Research Explorer

Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis

Kejian Zhu, Shangqing Tu, Zhuoran Jin et al.

2025 ACL

Do Large Language Models have an English Accent? Evaluating and Improving the Naturalness of Multilingual LLMs

Yanzhu Guo, Simone Conia, Zelin Zhou et al.

2025 ACL

Enhancing Character-Level Understanding in LLMs through Token Internal Structure Learning

Zhu Xu, Zhiqiang Zhao, Zihan Zhang et al.

2025 ACL

Confidence v.s. Critique: A Decomposition of Self-Correction Capability for LLMs

Zhe Yang, Yichang Zhang, Yudong Wang et al.

2025 ACL

Automating Legal Interpretation with LLMs: Retrieval, Generation, and Evaluation

Kangcheng Luo, Quzhe Huang, Cong Jiang et al.

2025 ACL

Game Development as Human-LLM Interaction

Jiale Hong, Hongqiu Wu, Hai Zhao

2025 ACL

Can LLMs Simulate L2-English Dialogue? An Information-Theoretic Analysis of L1-Dependent Biases

Rena Gao, Xuetong Wu, Tatsuki Kuribayashi et al.

2025 ACL

Auto-Arena: Automating LLM Evaluations with Agent Peer Battles and Committee Discussions

Ruochen Zhao, Wenxuan Zhang, Yew Ken Chia et al.

2025 ACL

How Humans and LLMs Organize Conceptual Knowledge: Exploring Subordinate Categories in Italian

Andrea Pedrotti, Giulia Rambelli, Caterina Villani et al.

2025 ACL

Stepwise Reasoning Disruption Attack of LLMs

Jingyu Peng, Maolin Wang, Xiangyu Zhao et al.

2025 ACL

Uncertainty Propagation on LLM Agent

Qiwei Zhao, Dong Li, Yanchi Liu et al.

2025 ACL

Are the Hidden States Hiding Something? Testing the Limits of Factuality-Encoding Capabilities in LLMs

Giovanni Servedio, Alessandro De Bellis, Dario Di Palma et al.

2025 ACL

HD-NDEs: Neural Differential Equations for Hallucination Detection in LLMs

Qing Li, Jiahui Geng, Zongxiong Chen et al.

2025 ACL

CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis

Bohan Zhang, Xiaokang Zhang, Jing Zhang et al.

2025 ACL

Can Graph Descriptive Order Affect Solving Graph Problems with LLMs?

Yuyao Ge, Shenghua Liu, Baolong Bi et al.

2025 ACL

GIFT-SW: Gaussian noise Injected Fine-Tuning of Salient Weights for LLMs

Maxim Zhelnin, Viktor Moskvoretskii, Egor Shvetsov et al.

2025 ACL

Biased LLMs can Influence Political Decision-Making

Jillian Fisher, Shangbin Feng, Robert Aron et al.

2025 ACL

TheoremExplainAgent: Towards Video-based Multimodal Explanations for LLM Theorem Understanding

Max Ku, Cheuk Hei Chong, Jonathan Leung et al.

2025 ACL

FineReason: Evaluating and Improving LLMs’ Deliberate Reasoning through Reflective Puzzle Solving

Guizhen Chen, Weiwen Xu, Hao Zhang et al.

2025 ACL

The TIP of the Iceberg: Revealing a Hidden Class of Task-in-Prompt Adversarial Attacks on LLMs

Sergey Berezin, Reza Farahbakhsh, Noel Crespi

2025 ACL

Drift: Enhancing LLM Faithfulness in Rationale Generation via Dual-Reward Probabilistic Inference

Jiazheng Li, Hanqi Yan, Yulan He

2025 ACL

Fairness through Difference Awareness: Measuring Desired Group Discrimination in LLMs

Angelina Wang, Michelle Phan, Daniel E. Ho et al.

2025 ACL

DRAG: Distilling RAG for SLMs from LLMs to Transfer Knowledge and Mitigate Hallucination via Evidence and Graph-based Distillation

Jennifer Chen, Aidar Myrzakhan, Yaxin Luo et al.

2025 ACL

Rolling the DICE on Idiomaticity: How LLMs Fail to Grasp Context

Maggie Mi, Aline Villavicencio, Nafise Sadat Moosavi

2025 ACL

MathFusion: Enhancing Mathematical Problem-solving of LLM through Instruction Fusion

Qizhi Pei, Lijun Wu, Zhuoshi Pan et al.

2025 ACL

Papers