Papers

6,952 papers found
Evaluating LLMs for Quotation Attribution in Literary Texts: A Case Study of LLaMa3
Gaspard Michel, Elena V. Epure, Romain Hennequin et al.
2025 NAACL
Evaluating Morphological Compositional Generalization in Large Language Models
Mete Ismayilzada, Defne Circi, Jonne Sälevä et al.
2025 NAACL
Evaluating Numeracy of Language Models as a Natural Language Inference Task
Rahmad Mahendra, Damiano Spina, Lawrence Cavedon et al.
2025 NAACL
2025 NAACL
Evaluating Text Style Transfer Evaluation: Are There Any Reliable Metrics?
Sourabrata Mukherjee, Atul Kr. Ojha, John P. McCrae et al.
2025 NAACL
Evaluating the Performance of Large Language Models via Debates
Behrad Moniri, Hamed Hassani, Edgar Dobriban
2025 NAACL
Evaluating the Prompt Steerability of Large Language Models
Erik Miehling, Michael Desmond, Karthikeyan Natesan Ramamurthy et al.
2025 NAACL
2025 NAACL
Evaluation of Multilingual Image Captioning: How far can we get with CLIP models?
Goncalo Emanuel Cavaco Gomes, Chrysoula Zerva, Bruno Martins
2025 NAACL
EventFull: Complete and Consistent Event Relation Annotation
Alon Eirew, Eviatar Nachshoni, Aviv Slobodkin et al.
2025 NAACL
2025 NAACL
2025 NAACL
Explainable ICD Coding via Entity Linking
Leonor Barreiros, Isabel Coutinho, Gonçalo Correia et al.
2025 NAACL