Research Explorer

Reasoning or Knowledge: Stratified Evaluation of Biomedical LLMs

Rahul Thapa, Qingyang Wu, Kevin Wu et al.

2026 EACL

AfriVox: Probing Multilingual and Accent Robustness of Speech LLMs

Busayo Awobade, Mardhiyah Sanni, Tassallah Abdullahi et al.

2026 EACL

PTEB: Towards Robust Text Embedding Evaluation via Stochastic Paraphrasing at Evaluation Time with LLMs

Manuel Frank, Haithem Afli

2026 EACL

How Good Are LLMs at Processing Tool Outputs?

Kiran Kate, Yara Rizk, Poulami Ghosh et al.

2026 EACL

Tug-of-war between idioms’ figurative and literal interpretations in LLMs

Soyoung Oh, Xinting Huang, Mathis Pink et al.

2026 EACL

MALicious INTent Dataset and Inoculating LLMs for Enhanced Disinformation Detection

Arkadiusz Modzelewski, Witold Sosnowski, Eleni Papadopulos et al.

2026 EACL

ARC: Argument Representation and Coverage Analysis for Zero-Shot Long Document Summarization with Instruction Following LLMs

Mohamed Elaraby, Diane Litman

2026 EACL

When Can We Trust LLMs in Mental Health? Large-Scale Benchmarks for Reliable LLM Evaluation

Abeer Badawi, Elahe Rahimi, Md Tahmid Rahman Laskar et al.

2026 EACL

Word Surprisal Correlates with Sentential Contradiction in LLMs

Ning Shi, Bradley Hauer, David Basil et al.

2026 EACL

Where Do LLMs Compose Meaning? A Layerwise Analysis of Compositional Robustness

Nura Aljaafari, Danilo Carvalho, Andre Freitas

2026 EACL

Unlocking Latent Discourse Translation in LLMs Through Quality-Aware Decoding

Wafaa Mohammed, Vlad Niculae, Chrysoula Zerva

2026 EACL

Calibrating Beyond English: Language Diversity for Better Quantized Multilingual LLMs

Everlyn Asiko Chimoto, Mostafa Elhoushi, Bruce Bassett

2026 EACL

Can you map it to English? The Role of Cross-Lingual Alignment in the Multilingual Performance of LLMs

Kartik Ravisankar, HyoJung Han, Sarah Wiegreffe et al.

2026 EACL

DITTO: A Spoofing Attack Framework on Watermarked LLMs via Knowledge Distillation

Hyeseon An, Shinwoo Park, Suyeon Woo et al.

2026 EACL

From Delegates to Trustees: How Optimizing for Long-Term Interests Shapes Bias and Alignment in LLMs

Suyash Fulay, Jocelyn Zhu, Michiel A. Bakker

2026 EACL

Biasless Language Models Learn Unnaturally: How LLMs Fail to Distinguish the Possible from the Impossible

Imry Ziv, Nur Lan, Emmanuel Chemla

2026 EACL

Multilingual Amnesia: On the Transferability of Unlearning in Multilingual LLMs

Alireza Dehghanpour Farashah, Aditi Khandelwal, Marylou Fauchard et al.

2026 EACL

Beyond Math: Stories as a Testbed for Memorization-Constrained Reasoning in LLMs

Yuxuan Jiang, Francis Ferraro

2026 EACL

Neural Breadcrumbs: Membership Inference Attacks on LLMs Through Hidden State and Attention Pattern Analysis

Disha Makhija, Manoj Ghuhan Arivazhagan, Vinayshekhar Bannihatti Kumar et al.

2026 EACL

Tracking the Limits of Knowledge Propagation: How LLMs Fail at Multi-Step Reasoning with Conflicting Knowledge

Yiyang Feng, Zeming Chen, Haotian Wu et al.

2026 EACL

Do Audio LLMs Really LISTEN, or Just Transcribe? Measuring Lexical vs. Acoustic Emotion Cues Reliance

Jingyi Chen, Zhimeng Guo, Jiyun Chun et al.

2026 EACL

Mary, the Cheeseburger-Eating Vegetarian: Do LLMs Recognize Incoherence in Narratives?

Karin De Langis, Püren Öncel, Ryan Peters et al.

2026 EACL

Strong Memory, Weak Control: An Empirical Study of Executive Functioning in LLMs

Karin de Langis, Jong Inn Park, Bin Hu et al.

2026 EACL

Can LLMs reason over extended multilingual contexts? Towards long-context evaluation beyond retrieval over haystacks

Amey Hengle, Prasoon Bajpai, Soham Dan et al.

2026 EACL

Knowing When to Abstain: Medical LLMs Under Clinical Uncertainty

Sravanthi Machcha, Sushrita Yerra, Sahil Gupta et al.

2026 EACL

Papers