Papers

6,514 papers found
Evaluating Large Language Models on Health-Related Claims Across Arabic Dialects
Abdulsalam obaid Alharbi, Abdullah Alsuhaibani, Abdulrahman Abdullah Alalawi et al.
2025 COLING
2025 COLING
Evaluating Pixel Language Models on Non-Standardized Languages
Alberto Muñoz-Ortiz, Verena Blaschke, Barbara Plank
2025 COLING
Evaluating Sampling Strategies for Similarity-Based Short Answer Scoring: a Case Study in Thailand
Pachara Boonsarngsuk, Pacharapon Arpanantikul, Supakorn Hiranwipas et al.
2025 COLING
Evaluating the Capabilities of Large Language Models for Multi-label Emotion Understanding
Tadesse Destaw Belay, Israel Abebe Azime, Abinew Ali Ayele et al.
2025 COLING
Evaluating the Consistency of LLM Evaluators
Noah Lee, Jiwoo Hong, James Thorne
2025 COLING
Evaluating Transformers for OCR Post-Correction in Early Modern Dutch Theatre
Florian Debaene, Aaron Maladry, Els Lefever et al.
2025 COLING
Evaluation of Large Language Models on Arabic Punctuation Prediction
Asma Ali Al Wazrah, Afrah Altamimi, Hawra Aljasim et al.
2025 COLING
ExMute: A Context-Enriched Multimodal Dataset for Hateful Memes
Riddhiman Swanan Debnath, Nahian Beente Firuj, Abdul Wadud Shakib et al.
2025 COLING
2025 COLING
Explanation Regularisation through the Lens of Attributions
Pedro Ferreira, Ivan Titov, Wilker Aziz
2025 COLING