Papers
6,514 papers found
Evaluating Generalization Capability of Language Models across Abductive, Deductive and Inductive Logical Reasoning
Yu Sheng, Wanting Wen, Linjing Li et al.
Evaluating Human Perception and Bias in AI-Generated Humor
Narendra Nath Joshi
Evaluating Large Language Models for In-Context Learning of Linguistic Patterns In Unseen Low Resource Languages
Hongpu Zhu, Yuqi Liang, Wenjing Xu et al.
Evaluating Large Language Models on Health-Related Claims Across Arabic Dialects
Abdulsalam obaid Alharbi, Abdullah Alsuhaibani, Abdulrahman Abdullah Alalawi et al.
Evaluating Model Alignment with Human Perception: A Study on Shitsukan in LLMs and LVLMs
Daiki Shiono, Ana Brassard, Yukiko Ishizuki et al.
Evaluating Open-Source ASR Systems: Performance Across Diverse Audio Conditions and Error Correction Methods
Saki Imai, Tahiya Chowdhury, Amanda J. Stent
Evaluating Pixel Language Models on Non-Standardized Languages
Alberto Muñoz-Ortiz, Verena Blaschke, Barbara Plank
Evaluating RAG Pipelines for Arabic Lexical Information Retrieval: A Comparative Study of Embedding and Generation Models
Raghad Al-Rasheed, Abdullah Al Muaddi, Hawra Aljasim et al.
Evaluating Readability Metrics for German Medical Text Simplification
Karen Scholz, Markus Wenzel
Evaluating Sampling Strategies for Similarity-Based Short Answer Scoring: a Case Study in Thailand
Pachara Boonsarngsuk, Pacharapon Arpanantikul, Supakorn Hiranwipas et al.
Evaluating Structural and Linguistic Quality in Urdu DRS Parsing and Generation through Bidirectional Evaluation
Muhammad Saad Amin, Luca Anselma, Alessandro Mazzei
Evaluating the Capabilities of Large Language Models for Multi-label Emotion Understanding
Tadesse Destaw Belay, Israel Abebe Azime, Abinew Ali Ayele et al.
Evaluating the Consistency of LLM Evaluators
Noah Lee, Jiwoo Hong, James Thorne
Evaluating Transformers for OCR Post-Correction in Early Modern Dutch Theatre
Florian Debaene, Aaron Maladry, Els Lefever et al.
Evaluation of Large Language Models on Arabic Punctuation Prediction
Asma Ali Al Wazrah, Afrah Altamimi, Hawra Aljasim et al.
Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection
Jinfa Huang, Jinsheng Pan, Zhongwei Wan et al.
EvoPrompt: Evolving Prompts for Enhanced Zero-Shot Named Entity Recognition with Large Language Models
Zeliang Tong, Zhuojun Ding, Wei Wei
ExMute: A Context-Enriched Multimodal Dataset for Hateful Memes
Riddhiman Swanan Debnath, Nahian Beente Firuj, Abdul Wadud Shakib et al.
Explain-Analyze-Generate: A Sequential Multi-Agent Collaboration Method for Complex Reasoning
WenYuan Gu, JiaLe Han, HaoWen Wang et al.
Explaining Relationships Among Research Papers
Xiangci Li, Jessica Ouyang
Explanation Regularisation through the Lens of Attributions
Pedro Ferreira, Ivan Titov, Wilker Aziz
Exploiting Task Reversibility of DRS Parsing and Generation: Challenges and Insights from a Multi-lingual Perspective
Muhammad Saad Amin, Luca Anselma, Alessandro Mazzei
Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models
Jiahui Li, Yongchang Hao, Haoyu Xu et al.
Exploiting Word Sense Disambiguation in Large Language Models for Machine Translation
Van-Hien Tran, Raj Dabre, Hour Kaing et al.