Papers
6,952 papers found
Evaluating LLM-Prompting for Sequence Labeling Tasks in Computational Literary Studies
Axel Pichler, Janis Pagel, Nils Reiter
Evaluating LLMs for Quotation Attribution in Literary Texts: A Case Study of LLaMa3
Gaspard Michel, Elena V. Epure, Romain Hennequin et al.
Evaluating Morphological Compositional Generalization in Large Language Models
Mete Ismayilzada, Defne Circi, Jonne Sälevä et al.
Evaluating Multimodal Generative AI with Korean Educational Standards
Sanghee Park, Geewook Kim
Evaluating Numeracy of Language Models as a Natural Language Inference Task
Rahmad Mahendra, Damiano Spina, Lawrence Cavedon et al.
Evaluating Robustness of LLMs to Numerical Variations in Mathematical Reasoning
Yuli Yang, Hiroaki Yamada, Takenobu Tokunaga
Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models
Jiatao Li, Xinyu Hu, Xunjian Yin et al.
Evaluating Small Language Models for News Summarization: Implications and Factors Influencing Performance
Borui Xu, Yao Chen, Zeyi Wen et al.
Evaluating Text Style Transfer Evaluation: Are There Any Reliable Metrics?
Sourabrata Mukherjee, Atul Kr. Ojha, John P. McCrae et al.
Evaluating the Performance of Large Language Models via Debates
Behrad Moniri, Hamed Hassani, Edgar Dobriban
Evaluating the Performance of RAG Methods for Conversational AI in the Airport Domain
Yuyang Li, Pjm Kerbusch, Rhr Pruim et al.
Evaluating the Prompt Steerability of Large Language Models
Erik Miehling, Michael Desmond, Karthikeyan Natesan Ramamurthy et al.
Evaluating Vision-Language Models for Emotion Recognition
Sree Bhattacharyya, James Z. Wang
Evaluation of LLMs-based Hidden States as Author Representations for Psychological Human-Centered NLP Tasks
Nikita Soni, Pranav Chitale, Khushboo Singh et al.
Evaluation of Multilingual Image Captioning: How far can we get with CLIP models?
Goncalo Emanuel Cavaco Gomes, Chrysoula Zerva, Bruno Martins
EventFull: Complete and Consistent Event Relation Annotation
Alon Eirew, Eviatar Nachshoni, Aviv Slobodkin et al.
EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms
Siyu Yuan, Kaitao Song, Jiangjie Chen et al.
Examining and Adapting Time for Multilingual Classification via Mixture of Temporal Experts
Weisi Liu, Guangzeng Han, Xiaolei Huang
Examining Spanish Counseling with MIDAS: a Motivational Interviewing Dataset in Spanish
Aylin Ece Gunal, Bowen Yi, John D. Piette et al.
Expertly Informed, Generatively Summarized: A Hybrid RAG Approach to Informed Consent Summarization with Auxiliary Expert Knowledge
Autumn Toney-Wails, Ryan Wails, Caleb Smith
Explainability for NLP in Pharmacovigilance: A Study on Adverse Event Report Triage in Swedish
Luise Dürlich, Erik Bergman, Maria Larsson et al.
Explainable ICD Coding via Entity Linking
Leonor Barreiros, Isabel Coutinho, Gonçalo Correia et al.
Explanation based In-Context Demonstrations Retrieval for Multilingual Grammatical Error Correction
Wei Li, Wen Luo, Guangyue Peng et al.
Exploiting Edited Large Language Models as General Scientific Optimizers
Qitan Lv, Tianyu Liu, Hong Wang
Exploratory Study into Relations between Cognitive Distortions and Emotional Appraisals
Navneet Agarwal, Kairit Sirts