Research Explorer

From Informal to Formal – Incorporating and Evaluating LLMs on Natural Language Requirements to Verifiable Formal Proofs

Jialun Cao, Yaojie Lu, Meiziniu Li et al.

2025 ACL

Exposing the Achilles’ Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning

Joykirat Singh, Akshay Nambi, Vibhav Vineet

2025 ACL

Understanding the Dark Side of LLMs’ Intrinsic Self-Correction

Qingjie Zhang, Di Wang, Haoting Qian et al.

2025 ACL

Just a Scratch: Enhancing LLM Capabilities for Self-harm Detection through Intent Differentiation and Emoji Interpretation

Soumitra Ghosh, Gopendra Vikram Singh, Shambhavi et al.

2025 ACL

Contrastive Learning on LLM Back Generation Treebank for Cross-domain Constituency Parsing

Peiming Guo, Meishan Zhang, Jianling Li et al.

2025 ACL

LLM×MapReduce: Simplified Long-Sequence Processing using Large Language Models

Zihan Zhou, Chong Li, Xinyi Chen et al.

2025 ACL

LLMs Caught in the Crossfire: Malware Requests and Jailbreak Challenges

Haoyang Li, Huan Gao, Zhiyuan Zhao et al.

2025 ACL

Rethinking the Role of Prompting Strategies in LLM Test-Time Scaling: A Perspective of Probability Theory

Yexiang Liu, Zekun Li, Zhi Fang et al.

2025 ACL

Agentic Reasoning: A Streamlined Framework for Enhancing LLM Reasoning with Agentic Tools

Junde Wu, Jiayuan Zhu, Yuyuan Liu et al.

2025 ACL

PIPER: Benchmarking and Prompting Event Reasoning Boundary of LLMs via Debiasing-Distillation Enhanced Tuning

Zhicong Lu, Changyuan Tian, Peiguang Li et al.

2025 ACL

LLMs Trust Humans More, That’s a Problem! Unveiling and Mitigating the Authority Bias in Retrieval-Augmented Generation

Yuxuan Li, Xinwei Guo, Jiashi Gao et al.

2025 ACL

Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs

Sumanth Doddapaneni, Mohammed Safi Ur Rahman Khan, Dilip Venkatesh et al.

2025 ACL

Reconsidering LLM Uncertainty Estimation Methods in the Wild

Yavuz Faruk Bakman, Duygu Nur Yaldiz, Sungmin Kang et al.

2025 ACL

Synergizing Unsupervised Episode Detection with LLMs for Large-Scale News Events

Priyanka Kargupta, Yunyi Zhang, Yizhu Jiao et al.

2025 ACL

The Task Shield: Enforcing Task Alignment to Defend Against Indirect Prompt Injection in LLM Agents

Feiran Jia, Tong Wu, Xin Qin et al.

2025 ACL

From Perceptions to Decisions: Wildfire Evacuation Decision Prediction with Behavioral Theory-informed LLMs

Ruxiao Chen, Chenguang Wang, Yuran Sun et al.

2025 ACL

Assessing Reliability and Political Bias In LLMs’ Judgements of Formal and Material Inferences With Partisan Conclusions

Reto Gubelmann, Ghassen Karray

2025 ACL

A Theory of Response Sampling in LLMs: Part Descriptive and Part Prescriptive

Sarath Sivaprasad, Pramod Kaushik, Sahar Abdelnabi et al.

2025 ACL

Stochastic Chameleons: Irrelevant Context Hallucinations Reveal Class-Based (Mis)Generalization in LLMs

Ziling Cheng, Meng Cao, Marc-Antoine Rondeau et al.

2025 ACL

SubLIME: Subset Selection via Rank Correlation Prediction for Data-Efficient LLM Evaluation

Gayathri Saranathan, Cong Xu, Mahammad Parwez Alam et al.

2025 ACL

Activating Distributed Visual Region within LLMs for Efficient and Effective Vision-Language Training and Inference

Siyuan Wang, Dianyi Wang, Chengxing Zhou et al.

2025 ACL

Multi-Modality Expansion and Retention for LLMs through Parameter Merging and Decoupling

Junlin Li, Guodong Du, Jing Li et al.

2025 ACL

Hidden in Plain Sight: Evaluation of the Deception Detection Capabilities of LLMs in Multimodal Settings

Md Messal Monem Miah, Adrita Anika, Xi Shi et al.

2025 ACL

Exploiting Contextual Knowledge in LLMs through 𝒱-usable Information based Layer Enhancement

Xiaowei Yuan, Zhao Yang, Ziyang Huang et al.

2025 ACL

Unintended Harms of Value-Aligned LLMs: Psychological and Empirical Insights

Sooyung Choi, Jaehyeok Lee, Xiaoyuan Yi et al.

2025 ACL

Papers