Research Explorer

Layer-Aware Representation Filtering: Purifying Finetuning Data to Preserve LLM Safety Alignment

Hao Li, Lijun Li, Zhenghao Lu et al.

2025 EMNLP

A Rigorous Evaluation of LLM Data Generation Strategies for Low-Resource Languages

Tatiana Anikina, Jan Cegin, Jakub Simko et al.

2025 EMNLP

A Middle Path for On-Premises LLM Deployment: Preserving Privacy Without Sacrificing Model Confidentiality

Hanbo Huang, Yihan Li, Bowen Jiang et al.

2025 EMNLP

A Training-Free Length Extrapolation Approach for LLMs: Greedy Attention Logit Interpolation

Yan Li, Tianyi Zhang, Zechuan Li et al.

2025 EMNLP

IndoSafety: Culturally Grounded Safety for LLMs in Indonesian Languages

Muhammad Falensi Azmi, Muhammad Dehan Al Kautsar, Alfan Farizki Wicaksono et al.

2025 EMNLP

Steering LLM Reasoning Through Bias-Only Adaptation

Viacheslav Sinii, Alexey Gorbatovski, Artem Cherepanov et al.

2025 EMNLP

FB-Bench: A Fine-Grained Multi-Task Benchmark for Evaluating LLMs’ Responsiveness to Human Feedback

Youquan Li, Miao Zheng, Fan Yang et al.

2025 EMNLP

LM-Searcher: Cross-domain Neural Architecture Search with LLMs via Unified Numerical Encoding

Yuxuan Hu, Jihao Liu, Ke Wang et al.

2025 EMNLP

Bitune: Leveraging Bidirectional Attention to Improve Decoder-Only LLMs

Dawid Jan Kopiczko, Tijmen Blankevoort, Yuki M Asano

2025 EMNLP

Disambiguation in Conversational Question Answering in the Era of LLMs and Agents: A Survey

Mehrab Tanjim, Yeonjun In, Xiang Chen et al.

2025 EMNLP

Enhancing LLM Text Detection with Retrieved Contexts and Logits Distribution Consistency

Zhaoheng Huang, Yutao Zhu, Ji-Rong Wen et al.

2025 EMNLP

AgentPro: Enhancing LLM Agents with Automated Process Supervision

Yuchen Deng, Shichen Fan, Naibo Wang et al.

2025 EMNLP

Learn and Unlearn: Addressing Misinformation in Multilingual LLMs

TaiMing Lu, Philipp Koehn

2025 EMNLP

PRISM: Efficient Long-Range Reasoning With Short-Context LLMs

Dulhan Jayalath, James Bradley Wendt, Nicholas Monath et al.

2025 EMNLP

Primus: A Pioneering Collection of Open-Source Datasets for Cybersecurity LLM Training

Yao-Ching Yu, Tsun-Han Chiang, Cheng-Wei Tsai et al.

2025 EMNLP

Bit-Flip Error Resilience in LLMs: A Comprehensive Analysis and Defense Framework

Yuhang Chen, Zhen Tan, Ajay Kumar Jaiswal et al.

2025 EMNLP

Calibrating LLM Confidence by Probing Perturbed Representation Stability

Reza Khanmohammadi, Erfan Miahi, Mehrsa Mardikoraem et al.

2025 EMNLP

CIFLEX: Contextual Instruction Flow for Sub-task Execution in Multi-Turn Interactions with a Single On-Device LLM

Juntae Lee, Jihwan Bang, Seunghan Yang et al.

2025 EMNLP

Latent Inter-User Difference Modeling for LLM Personalization

Yilun Qiu, Tianhao Shi, Xiaoyan Zhao et al.

2025 EMNLP

SelfRACG: Enabling LLMs to Self-Express and Retrieve for Code Generation

Qian Dong, Jia Chen, Qingyao Ai et al.

2025 EMNLP

AdamS: Momentum Itself Can Be A Normalizer for LLM Pretraining and Post-training

Huishuai Zhang, Bohan Wang, Luoxin Chen

2025 EMNLP

Demystifying Synthetic Data in LLM Pre-training: A Systematic Study of Scaling Laws, Benefits, and Pitfalls

Feiyang Kang, Newsha Ardalani, Michael Kuchnik et al.

2025 EMNLP

From Scores to Steps: Diagnosing and Improving LLM Performance in Evidence-Based Medical Calculations

Benlu Wang, Iris Xia, Yifan Zhang et al.

2025 EMNLP

Bridging External and Parametric Knowledge: Mitigating Hallucination of LLMs with Shared-Private Semantic Synergy in Dual-Stream Knowledge

Yi Sui, Chaozhuo Li, Chen Zhang et al.

2025 EMNLP

Identifying Unlearned Data in LLMs via Membership Inference Attacks

Advit Deepak, Megan Mou, Jing Huang et al.

2025 EMNLP

Papers