Research Explorer

How Numerical Precision Affects Arithmetical Reasoning Capabilities of LLMs

Guhao Feng, Kai Yang, Yuntian Gu et al.

2025 ACL

BayesKD: Bayesian Knowledge Distillation for Compact LLMs in Constrained Fine-tuning Scenarios

Wei Li, Lujun Li, Mark G. Lee et al.

2025 ACL

Detecting and Mitigating Challenges in Zero-Shot Video Summarization with Video LLMs

Luca Cagliero, Lorenzo Vaiani, Eliana Pastor et al.

2025 ACL

Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs

Runchu Tian, Yanghao Li, Yuepeng Fu et al.

2025 ACL

Variable Layerwise Quantization: A Simple and Effective Approach to Quantize LLMs

Razvan-Gabriel Dumitru, Vikas Yadav, Rishabh Maheshwary et al.

2025 ACL

Probing the Geometry of Truth: Consistency and Generalization of Truth Directions in LLMs Across Logical Transformations and Question Answering Tasks

Yuntai Bao, Xuhong Zhang, Tianyu Du et al.

2025 ACL

Profiling News Media for Factuality and Bias Using LLMs and the Fact-Checking Methodology of Human Experts

Zain Muhammad Mujahid, Dilshod Azizov, Maha Tufail Agro et al.

2025 ACL

SHARP: Unlocking Interactive Hallucination via Stance Transfer in Role-Playing LLMs

Chuyi Kong, Ziyang Luo, Hongzhan Lin et al.

2025 ACL

SWE-Fixer: Training Open-Source LLMs for Effective and Efficient GitHub Issue Resolution

Chengxing Xie, Bowen Li, Chang Gao et al.

2025 ACL

From Misleading Queries to Accurate Answers: A Three-Stage Fine-Tuning Method for LLMs

Guocong Li, Weize Liu, Yihang Wu et al.

2025 ACL

UAQFact: Evaluating Factual Knowledge Utilization of LLMs on Unanswerable Questions

Chuanyuan Tan, Wenbiao Shao, Hao Xiong et al.

2025 ACL

Domain Regeneration: How well do LLMs match syntactic properties of text domains?

Da Ju, Hagen Blix, Adina Williams

2025 ACL

Training Long-Context LLMs Efficiently via Chunk-wise Optimization

Wenhao Li, Yuxin Zhang, Gen Luo et al.

2025 ACL

A Case Study of Cross-Lingual Zero-Shot Generalization for Classical Languages in LLMs

V.S.D.S.Mahesh Akavarapu, Hrishikesh Terdalkar, Pramit Bhattacharyya et al.

2025 ACL

Cross-Lingual Transfer of Debiasing and Detoxification in Multilingual LLMs: An Extensive Investigation

Vera Neplenbroek, Arianna Bisazza, Raquel Fernández

2025 ACL

Scaling LLMs’ Social Reasoning: Sprinkle Cognitive “Aha Moment” into Fundamental Long-thought Logical Capabilities

Guiyang Hou, Wenqi Zhang, Zhe Zheng et al.

2025 ACL

LongFaith: Enhancing Long-Context Reasoning in LLMs with Faithful Synthetic Data

Cehao Yang, Xueyuan Lin, Chengjin Xu et al.

2025 ACL

BanStereoSet: A Dataset to Measure Stereotypical Social Biases in LLMs for Bangla

Mahammed Kamruzzaman, Abdullah Al Monsur, Shrabon Kumar Das et al.

2025 ACL

ComparisonQA: Evaluating Factuality Robustness of LLMs Through Knowledge Frequency Control and Uncertainty

Qing Zong, Zhaowei Wang, Tianshi Zheng et al.

2025 ACL

There’s No Such Thing as Simple Reasoning for LLMs

Nurul Fajrin Ariyani, Zied Bouraoui, Richard Booth et al.

2025 ACL

Arbiters of Ambivalence: Challenges of using LLMs in No-Consensus tasks

Bhaktipriya Radharapu, Manon Revel, Megan Ung et al.

2025 ACL

Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation

Shuai Zhao, Xiaobao Wu, Cong-Duy T Nguyen et al.

2025 ACL

LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-Context QA

Jiajie Zhang, Yushi Bai, Xin Lv et al.

2025 ACL

Is LLM an Overconfident Judge? Unveiling the Capabilities of LLMs in Detecting Offensive Language with Annotation Disagreement

Junyu Lu, Kai Ma, Kaichun Wang et al.

2025 ACL

Self-Tuning: Instructing LLMs to Effectively Acquire New Knowledge through Self-Teaching

Xiaoying Zhang, Baolin Peng, Ye Tian et al.

2025 ACL

Papers