Research Explorer

Analyzing LLMs’ Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations

Chenghao Xiao, Hou Pong Chan, Hao Zhang et al.

2025 ACL

Dialogue-RAG: Enhancing Retrieval for LLMs via Node-Linking Utterance Rewriting

Qiwei Li, Teng Xiao, Zuchao Li et al.

2025 ACL

Evaluating LLMs for Portuguese Sentence Simplification with Linguistic Insights

Arthur Mariano Rocha De Azevedo Scalercio, Elvis A. De Souza, Maria José Bocorny Finatto et al.

2025 ACL

Leveraging In-Context Learning for Political Bias Testing of LLMs

Patrick Haller, Jannis Vamvas, Rico Sennrich et al.

2025 ACL

LLMs know their vulnerabilities: Uncover Safety Gaps through Natural Distribution Shifts

Qibing Ren, Hao Li, Dongrui Liu et al.

2025 ACL

Can We Further Elicit Reasoning in LLMs? Critic-Guided Planning with Retrieval-Augmentation for Solving Challenging Tasks

Xingxuan Li, Weiwen Xu, Ruochen Zhao et al.

2025 ACL

Help Me Write a Story: Evaluating LLMs’ Ability to Generate Writing Feedback

Hannah Rashkin, Elizabeth Clark, Fantine Huot et al.

2025 ACL

HumT DumT: Measuring and controlling human-like language in LLMs

Myra Cheng, Sunny Yu, Dan Jurafsky

2025 ACL

Do LLMs Understand Dialogues? A Case Study on Dialogue Acts

Ayesha Qamar, Jonathan Tong, Ruihong Huang

2025 ACL

Can LLMs Deceive CLIP? Benchmarking Adversarial Compositionality of Pre-trained Multimodal Representation via Text Updates

Jaewoo Ahn, Heeseung Yun, Dayoon Ko et al.

2025 ACL

MTSA: Multi-turn Safety Alignment for LLMs through Multi-round Red-teaming

Weiyang Guo, Jing Li, Wenya Wang et al.

2025 ACL

InductionBench: LLMs Fail in the Simplest Complexity Class

Wenyue Hua, Tyler Wong, Fei Sun et al.

2025 ACL

Multi-document Summarization through Multi-document Event Relation Graph Reasoning in LLMs: a case study in Framing Bias Mitigation

Yuanyuan Lei, Ruihong Huang

2025 ACL

StitchLLM: Serving LLMs, One Block at a Time

Bodun Hu, Shuozhe Li, Saurabh Agarwal et al.

2025 ACL

From Informal to Formal – Incorporating and Evaluating LLMs on Natural Language Requirements to Verifiable Formal Proofs

Jialun Cao, Yaojie Lu, Meiziniu Li et al.

2025 ACL

Exposing the Achilles’ Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning

Joykirat Singh, Akshay Nambi, Vibhav Vineet

2025 ACL

Understanding the Dark Side of LLMs’ Intrinsic Self-Correction

Qingjie Zhang, Di Wang, Haoting Qian et al.

2025 ACL

LLMs Caught in the Crossfire: Malware Requests and Jailbreak Challenges

Haoyang Li, Huan Gao, Zhiyuan Zhao et al.

2025 ACL

PIPER: Benchmarking and Prompting Event Reasoning Boundary of LLMs via Debiasing-Distillation Enhanced Tuning

Zhicong Lu, Changyuan Tian, Peiguang Li et al.

2025 ACL

LLMs Trust Humans More, That’s a Problem! Unveiling and Mitigating the Authority Bias in Retrieval-Augmented Generation

Yuxuan Li, Xinwei Guo, Jiashi Gao et al.

2025 ACL

Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs

Sumanth Doddapaneni, Mohammed Safi Ur Rahman Khan, Dilip Venkatesh et al.

2025 ACL

Synergizing Unsupervised Episode Detection with LLMs for Large-Scale News Events

Priyanka Kargupta, Yunyi Zhang, Yizhu Jiao et al.

2025 ACL

From Perceptions to Decisions: Wildfire Evacuation Decision Prediction with Behavioral Theory-informed LLMs

Ruxiao Chen, Chenguang Wang, Yuran Sun et al.

2025 ACL

Assessing Reliability and Political Bias In LLMs’ Judgements of Formal and Material Inferences With Partisan Conclusions

Reto Gubelmann, Ghassen Karray

2025 ACL

A Theory of Response Sampling in LLMs: Part Descriptive and Part Prescriptive

Sarath Sivaprasad, Pramod Kaushik, Sahar Abdelnabi et al.

2025 ACL

Papers