Research Explorer

BanStereoSet: A Dataset to Measure Stereotypical Social Biases in LLMs for Bangla

Mahammed Kamruzzaman, Abdullah Al Monsur, Shrabon Kumar Das et al.

2025 ACL

The Two Paradigms of LLM Detection: Authorship Attribution vs. Authorship Verification

Janek Bevendorff, Matti Wiegmann, Emmelie Richter et al.

2025 ACL

RemoteRAG: A Privacy-Preserving LLM Cloud RAG Service

Yihang Cheng, Lan Zhang, Junyang Wang et al.

2025 ACL

ComparisonQA: Evaluating Factuality Robustness of LLMs Through Knowledge Frequency Control and Uncertainty

Qing Zong, Zhaowei Wang, Tianshi Zheng et al.

2025 ACL

Fraud-R1 : A Multi-Round Benchmark for Assessing the Robustness of LLM Against Augmented Fraud and Phishing Inducements

Shu Yang, Shenzhe Zhu, Zeyu Wu et al.

2025 ACL

System Prompt Hijacking via Permutation Triggers in LLM Supply Chains

Lu Yan, Siyuan Cheng, Xuan Chen et al.

2025 ACL

There’s No Such Thing as Simple Reasoning for LLMs

Nurul Fajrin Ariyani, Zied Bouraoui, Richard Booth et al.

2025 ACL

Beyond Semantic Entropy: Boosting LLM Uncertainty Quantification with Pairwise Semantic Similarity

Dang Nguyen, Ali Payani, Baharan Mirzasoleiman

2025 ACL

Arbiters of Ambivalence: Challenges of using LLMs in No-Consensus tasks

Bhaktipriya Radharapu, Manon Revel, Megan Ung et al.

2025 ACL

Efficient but Vulnerable: Benchmarking and Defending LLM Batch Prompting Attack

Murong Yue, Ziyu Yao

2025 ACL

Re-TASK: Revisiting LLM Tasks from Capability, Skill, and Knowledge Perspectives

Zhihu Wang, Shiwan Zhao, Yu Wang et al.

2025 ACL

Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation

Shuai Zhao, Xiaobao Wu, Cong-Duy T Nguyen et al.

2025 ACL

LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-Context QA

Jiajie Zhang, Yushi Bai, Xin Lv et al.

2025 ACL

Combining the Best of Both Worlds: A Method for Hybrid NMT and LLM Translation

Zhanglin Wu, Daimeng Wei, Xiaoyu Chen et al.

2025 ACL

ClaimPKG: Enhancing Claim Verification via Pseudo-Subgraph Generation with Lightweight Specialized LLM

Hoang Pham, Thanh-Do Nguyen, Khac-Hoai Nam Bui

2025 ACL

Self-Tuning: Instructing LLMs to Effectively Acquire New Knowledge through Self-Teaching

Xiaoying Zhang, Baolin Peng, Ye Tian et al.

2025 ACL

Memory or Reasoning? Explore How LLMs Compute Mixed Arithmetic Expressions

Chengzhi Li, Heyan Huang, Ping Jian et al.

2025 ACL

CA-GAR: Context-Aware Alignment of LLM Generation for Document Retrieval

Heng Yu, Junfeng Kang, Rui Li et al.

2025 ACL

CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenge

Yu Li, Qizhi Pei, Mengyuan Sun et al.

2025 ACL

Which Retain Set Matters for LLM Unlearning? A Case Study on Entity Unlearning

Hwan Chang, Hwanhee Lee

2025 ACL

Mitigate Position Bias in LLMs via Scaling a Single Hidden States Channel

Yijiong Yu, Huiqiang Jiang, Xufang Luo et al.

2025 ACL

Why Not Act on What You Know? Unleashing Safety Potential of LLMs via Self-Aware Guard Enhancement

Peng Ding, Jun Kuang, ZongYu Wang et al.

2025 ACL

Beyond Reactive Safety: Risk-Aware LLM Alignment via Long-Horizon Simulation

Chenkai Sun, Denghui Zhang, ChengXiang Zhai et al.

2025 ACL

Probability-Consistent Preference Optimization for Enhanced LLM Reasoning

Yunqiao Yang, Houxing Ren, Zimu Lu et al.

2025 ACL

CAVGAN: Unifying Jailbreak and Defense of LLMs via Generative Adversarial Attacks on their Internal Representations

Xiaohu Li, Yunfeng Ning, Zepeng Bao et al.

2025 ACL

Papers