Research Explorer

DyePack: Provably Flagging Test Set Contamination in LLMs Using Backdoors

Yize Cheng, Wenxiao Wang, Mazda Moayeri et al.

2025 EMNLP

Jailbreak LLMs through Internal Stance Manipulation

Shuangjie Fu, Du Su, Beining Huang et al.

2025 EMNLP

Benchmark Profiling: Mechanistic Diagnosis of LLM Benchmarks

Dongjun Kim, Gyuho Shim, Yongchan Chun et al.

2025 EMNLP

Improving Chemical Understanding of LLMs via SMILES Parsing

Yunhui Jang, Jaehyung Kim, Sungsoo Ahn

2025 EMNLP

The State of Multilingual LLM Safety Research: From Measuring The Language Gap To Mitigating It

Zheng Xin Yong, Beyza Ermis, Marzieh Fadaee et al.

2025 EMNLP

From Capabilities to Performance: Evaluating Key Functional Properties of LLM Architectures in Penetration Testing

Lanxiao Huang, Daksh Dave, Tyler Cody et al.

2025 EMNLP

The Staircase of Ethics: Probing LLM Value Priorities through Multi-Step Induction to Complex Moral Dilemmas

Ya Wu, Qiang Sheng, Danding Wang et al.

2025 EMNLP

Sparse Neurons Carry Strong Signals of Question Ambiguity in LLMs

Zhuoxuan Zhang, Jinhao Duan, Edward Kim et al.

2025 EMNLP

Comparing human and LLM politeness strategies in free production

Haoran Zhao, Robert D. Hawkins

2025 EMNLP

CARMA: Enhanced Compositionality in LLMs via Advanced Regularisation and Mutual Information Alignment

Nura Aljaafari, Danilo Carvalho, Andre Freitas

2025 EMNLP

Can LLMs simulate the same correct solutions to free-response math problems as real students?

Yuya Asano, Diane Litman, Erin Walker

2025 EMNLP

Evaluating Behavioral Alignment in Conflict Dialogue: A Multi-Dimensional Comparison of LLM Agents and Humans

Deuksin Kwon, Kaleen Shrestha, Bin Han et al.

2025 EMNLP

Attention Eclipse: Manipulating Attention to Bypass LLM Safety-Alignment

Pedram Zaree, Md Abdullah Al Mamun, Quazi Mishkatul Alam et al.

2025 EMNLP

Implicit Values Embedded in How Humans and LLMs Complete Subjective Everyday Tasks

Arjun Arunasalam, Madison Pickering, Z. Berkay Celik et al.

2025 EMNLP

Let’s Reason Formally: Natural-Formal Hybrid Reasoning Enhances LLM’s Math Capability

Ruida Wang, Yuxin Li, Yi R. Fung et al.

2025 EMNLP

Fair or Framed? Political Bias in News Articles Generated by LLMs

Junho Yoo, Youhyun Shin

2025 EMNLP

Calibrating LLMs for Text-to-SQL Parsing by Leveraging Sub-clause Frequencies

Terrance Liu, Shuyi Wang, Daniel Preotiuc-Pietro et al.

2025 EMNLP

REACT: Representation Extraction And Controllable Tuning to Overcome Overfitting in LLM Knowledge Editing

Haitian Zhong, Yuhuan Liu, Ziyang Xu et al.

2025 EMNLP

PychoAgent: Psychology-driven LLM Agents for Explainable Panic Prediction on Social Media during Sudden Disaster Events

Mengzhu Liu, Zhengqiu Zhu, Chuan Ai et al.

2025 EMNLP

Stepwise Reasoning Checkpoint Analysis: A Test Time Scaling Method to Enhance LLMs’ Reasoning

Zezhong Wang, Xingshan Zeng, Weiwen Liu et al.

2025 EMNLP

RJE: A Retrieval-Judgment-Exploration Framework for Efficient Knowledge Graph Question Answering with LLMs

Can Lin, Zhengwang Jiang, Ling Zheng et al.

2025 EMNLP

Bias Mitigation or Cultural Commonsense? Evaluating LLMs with a Japanese Dataset

Taisei Yamamoto, Ryoma Kumon, Danushka Bollegala et al.

2025 EMNLP

Chameleon LLMs: User Personas Influence Chatbot Personality Shifts

Jane Xing, Tianyi Niu, Shashank Srivastava

2025 EMNLP

SynC-LLM: Generation of Large-Scale Synthetic Circuit Code with Hierarchical Language Models

Shang Liu, Yao Lu, Wenji Fang et al.

2025 EMNLP

Why Stop at One Error? Benchmarking LLMs as Data Science Code Debuggers for Multi-Hop and Multi-Bug Errors

Zhiyu Yang, Shuo Wang, Yukun Yan et al.

2025 EMNLP

Papers