Research Explorer

LLMs are not Zero-Shot Reasoners for Biomedical Information Extraction

Aishik Nagar, Viktor Schlegel, Thanh-Tung Nguyen et al.

2025 NAACL

Exploring Limitations of LLM Capabilities with Multi-Problem Evaluation

Zhengxiang Wang, Jordan Kodner, Owen Rambow

2025 NAACL

Self Knowledge-Tracing for Tool Use (SKT-Tool): Helping LLM Agents Understand Their Capabilities in Tool Use

Joshua Vigel, Renpei Cai, Eleanor Chen et al.

2025 NAACL

Evaluating Robustness of LLMs to Numerical Variations in Mathematical Reasoning

Yuli Yang, Hiroaki Yamada, Takenobu Tokunaga

2025 NAACL

Leveraging Domain Knowledge at Inference Time for LLM Translation: Retrieval versus Generation

Bryan Li, Jiaming Luo, Eleftheria Briakou et al.

2025 NAACL

LLM Reasoning Engine: Specialized Training for Enhanced Mathematical Reasoning

Shuguang Chen, Guang Lin

2025 NAACL

RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs

Vibha Belavadi, Tushar Vatsa, Dewang Sultania et al.

2025 NAACL

Towards Effectively Leveraging Execution Traces for Program Repair with Code LLMs

Mirazul Haque, Petr Babkin, Farima Farmahinifarahani et al.

2025 NAACL

AI Conversational Interviewing: Transforming Surveys with LLMs as Adaptive Interviewers

Alexander Wuttke, Matthias Aßenmacher, Christopher Klamm et al.

2025 NAACL

Prompting the Past: Exploring Zero-Shot Learning for Named Entity Recognition in Historical Texts Using Prompt-Answering LLMs

Crina Tudor, Beata Megyesi, Robert Östling

2025 NAACL

LLMs for Translation: Historical, Low-Resourced Languages and Contemporary AI Models

Merve Tekgürler

2025 NAACL

Using LLMs to Advance Idiom Corpus Construction

Doğukan Arslan, Hüseyin Anıl Çakmak, Gülşen Eryiğit et al.

2025 NAACL

Assessing Crowdsourced Annotations with LLMs: Linguistic Certainty as a Proxy for Trustworthiness

Tianyi Li, Divya Sree, Tatiana Ringenberg

2025 NAACL

On Psychology of AI – Does Primacy Effect Affect ChatGPT and Other LLMs?

Mika Hämäläinen

2025 NAACL

A Comparative Analysis of Ethical and Safety Gaps in LLMs using Relative Danger Coefficient

Yehor Tereshchenko, Mika Hämäläinen

2025 NAACL

VLG-BERT: Towards Better Interpretability in LLMs through Visual and Linguistic Grounding

Toufik Mechouma, Ismail Biskri, Serge Robert

2025 NAACL

A Comprehensive Evaluation of Cognitive Biases in LLMs

Simon Malberg, Roman Poletukhin, Carolin M. Schuster et al.

2025 NAACL

Fearful Falcons and Angry Llamas: Emotion Category Annotations of Arguments by Humans and LLMs

Lynn Greschner, Roman Klinger

2025 NAACL

Balancing Privacy and Utility in Personal LLM Writing Tasks: An Automated Pipeline for Evaluating Anonymizations

Stefan Pasch, Min Chul Cha

2025 NAACL

Named Entity Inference Attacks on Clinical LLMs: Exploring Privacy Risks and the Impact of Mitigation Strategies

Adam Sutton, Xi Bai, Kawsar Noor et al.

2025 NAACL

Prompt and circumstance: A word-by-word LLM prompting approach to interlinear glossing for low-resource languages

Micha Elsner, David Liu

2025 NAACL

Line of Duty: Evaluating LLM Self-Knowledge via Consistency in Feasibility Boundaries

Sahil Kale, Vijaykant Nadadur

2025 NAACL

Multi-lingual Multi-turn Automated Red Teaming for LLMs

Abhishek Singhania, Christophe Dupuy, Shivam Sadashiv Mangale et al.

2025 NAACL

Summary the Savior: Harmful Keyword and Query-based Summarization for LLM Jailbreak Defense

Shagoto Rahman, Ian Harris

2025 NAACL

Monte Carlo Temperature: a robust sampling strategy for LLM’s uncertainty quantification methods

Nicola Cecere, Andrea Bacciu, Ignacio Fernández-Tobías et al.

2025 NAACL

Papers