Research Explorer

Hire Your Anthropologist! Rethinking Culture Benchmarks Through an Anthropological Lens

Mai Alkhamissi, Yunze Xiao, Badr AlKhamissi et al.

2026 EACL

H-MEM: Hierarchical Memory for High-Efficiency Long-Term Reasoning in LLM Agents

Haoran Sun, Shaoning Zeng, Bob Zhang

2026 EACL

H-Mem: Hybrid Multi-Dimensional Memory Management for Long-Context Conversational Agents

Zihe Ye, Jingyuan Huang, Weixin Chen et al.

2026 EACL

Hospitality-VQA: Decision-Oriented Informativeness Evaluation for Vision–Language Models

Jeongwoo Lee, Baek Duhyeong, Eungyeol Han et al.

2026 EACL

HotelQuEST: Balancing Quality and Efficiency in Agentic Search

Guy Hadad, Shadi Iskander, Sofia Tolmach et al.

2026 EACL

How DDAIR you? Disambiguated Data Augmentation for Intent Recognition

Galo Castillo-López, Alexis Lombard, Nasredine Semmar et al.

2026 EACL

How Do Language Models Acquire Character-Level Information?

Soma Sato, Ryohei Sasano

2026 EACL

How Do Lexical Senses Correspond Between Spoken German and German Sign Language?

Melis Çelikkol, Wei Zhao

2026 EACL

How Do LLMs Generate Contrastive Sentiments? A Mechanistic Perspective

Van Bach Nguyen, Jörg Schlötterer, Christin Seifert

2026 EACL

How effective are VLMs in assisting humans in inferring the quality of mental models from Multimodal short answers?

Pritam Sil, Durgaprasad Karnam, Vinay Reddy Venumuddala et al.

2026 EACL

How Far Can Pretrained LLMs Go in Symbolic Music? Controlled Comparisons of Supervised and Preference-based Adaptation

Deepak Kumar, Emmanouil Karystinaios, Gerhard Widmer et al.

2026 EACL

How Good Are LLMs at Processing Tool Outputs?

Kiran Kate, Yara Rizk, Poulami Ghosh et al.

2026 EACL

How Important is ‘Perfect’ English for Machine Translation Prompts?

Patrícia Schmidtová, Niyati Bafna, Seth Aycock et al.

2026 EACL

How Many Ratings per Item are Necessary for Reliable Significance Testing?

Christopher M Homan, Flip Korn, Deepak Pandita et al.

2026 EACL

How Much Pretraining Does Structured Data Need?

Daniel Fadlon, Kfir Bar

2026 EACL

How multilingual are multilingual LLMs? A case study in Northern Sámi-Finnish Translation

Jonne Sälevä, Constantine Lignos

2026 EACL

How Quantization Shapes Bias in Large Language Models

Federico Marcuzzi, Xuefei Ning, Roy Schwartz et al.

2026 EACL

How Reliable are Confidence Estimators for Large Reasoning Models? A Systematic Benchmark on High-Stakes Domains

Reza Khanmohammadi, Erfan Miahi, Simerjot Kaur et al.

2026 EACL

How Robust Are Router-LLMs? Analysis of the Fragility of LLM Routing Capabilities

Aly M. Kassem, Bernhard Schölkopf, Zhijing Jin

2026 EACL

How Should We Model the Probability of a Language?

Rasul Dent, Pedro Ortiz Suarez, Thibault Clérice et al.

2026 EACL

How to Contextualize Empirical Data for Risk Analysis with LLMs: A Case Study of Power Outages

Haiyun Huang, Yukun Li, Marco A Pretell et al.

2026 EACL

How to Efficiently Explore Noisy Historical Data? Leveraging Corpus Pre-Targeting to Enhance Graph-based RAG

Donghan Bian, Marie Puren, Florian Cafiero

2026 EACL

How to Make LMs Strong Node Classifiers?

Zhe Xu, Kaveh Hassani, Si Zhang et al.

2026 EACL

Humans and transformer LMs: Abstraction drives language learning

Jasper Jian, Christopher D Manning

2026 EACL

HumMusQA: A Human-written Music Understanding QA Benchmark Dataset

Benno Weck, Pablo Puentes, Andrea Poltronieri et al.

2026 EACL

Papers