Research Explorer

AgentPro: Enhancing LLM Agents with Automated Process Supervision

Yuchen Deng, Shichen Fan, Naibo Wang et al.

2025 EMNLP

Learn and Unlearn: Addressing Misinformation in Multilingual LLMs

TaiMing Lu, Philipp Koehn

2025 EMNLP

PRISM: Efficient Long-Range Reasoning With Short-Context LLMs

Dulhan Jayalath, James Bradley Wendt, Nicholas Monath et al.

2025 EMNLP

Primus: A Pioneering Collection of Open-Source Datasets for Cybersecurity LLM Training

Yao-Ching Yu, Tsun-Han Chiang, Cheng-Wei Tsai et al.

2025 EMNLP

Bit-Flip Error Resilience in LLMs: A Comprehensive Analysis and Defense Framework

Yuhang Chen, Zhen Tan, Ajay Kumar Jaiswal et al.

2025 EMNLP

Calibrating LLM Confidence by Probing Perturbed Representation Stability

Reza Khanmohammadi, Erfan Miahi, Mehrsa Mardikoraem et al.

2025 EMNLP

CIFLEX: Contextual Instruction Flow for Sub-task Execution in Multi-Turn Interactions with a Single On-Device LLM

Juntae Lee, Jihwan Bang, Seunghan Yang et al.

2025 EMNLP

Latent Inter-User Difference Modeling for LLM Personalization

Yilun Qiu, Tianhao Shi, Xiaoyan Zhao et al.

2025 EMNLP

SelfRACG: Enabling LLMs to Self-Express and Retrieve for Code Generation

Qian Dong, Jia Chen, Qingyao Ai et al.

2025 EMNLP

AdamS: Momentum Itself Can Be A Normalizer for LLM Pretraining and Post-training

Huishuai Zhang, Bohan Wang, Luoxin Chen

2025 EMNLP

Demystifying Synthetic Data in LLM Pre-training: A Systematic Study of Scaling Laws, Benefits, and Pitfalls

Feiyang Kang, Newsha Ardalani, Michael Kuchnik et al.

2025 EMNLP

From Scores to Steps: Diagnosing and Improving LLM Performance in Evidence-Based Medical Calculations

Benlu Wang, Iris Xia, Yifan Zhang et al.

2025 EMNLP

Bridging External and Parametric Knowledge: Mitigating Hallucination of LLMs with Shared-Private Semantic Synergy in Dual-Stream Knowledge

Yi Sui, Chaozhuo Li, Chen Zhang et al.

2025 EMNLP

Identifying Unlearned Data in LLMs via Membership Inference Attacks

Advit Deepak, Megan Mou, Jing Huang et al.

2025 EMNLP

LLMs cannot spot math errors, even when allowed to peek into the solution

Kv Aditya Srivatsa, Kaushal Kumar Maurya, Ekaterina Kochmar

2025 EMNLP

Can LLMs be Good Graph Judge for Knowledge Graph Construction?

Haoyu Huang, Chong Chen, Zeang Sheng et al.

2025 EMNLP

NileChat: Towards Linguistically Diverse and Culturally Aware LLMs for Local Communities

Abdellah El Mekki, Houdaifa Atou, Omer Nacar et al.

2025 EMNLP

Collaborative Beam Search: Enhancing LLM Reasoning via Collective Consensus

Yangyifan Xu, Shuo Ren, Jiajun Zhang

2025 EMNLP

Stimulate the Critical Thinking of LLMs via Debiasing Discussion

Ruiyu Xiao, Lei Wu, Yuanxing Liu et al.

2025 EMNLP

Polysemantic Dropout: Conformal OOD Detection for Specialized LLMs

Ayush Gupta, Ramneet Kaur, Anirban Roy et al.

2025 EMNLP

Facilitating Cognitive Accessibility with LLMs: A Multi-Task Approach to Easy-to-Read Text Generation

François Ledoyen, Gaël Dias, Jeremie Pantin et al.

2025 EMNLP

Toxicity Red-Teaming: Benchmarking LLM Safety in Singapore’s Low-Resource Languages

Yujia Hu, Ming Shan Hee, Preslav Nakov et al.

2025 EMNLP

Self-Augmented Preference Alignment for Sycophancy Reduction in LLMs

Chien Hung Chen, Hen-Hsen Huang, Hsin-Hsi Chen

2025 EMNLP

Towards Advanced Mathematical Reasoning for LLMs via First-Order Logic Theorem Proving

Chuxue Cao, Mengze Li, Juntao Dai et al.

2025 EMNLP

CityEQA: A Hierarchical LLM Agent on Embodied Question Answering Benchmark in City Space

Yong Zhao, Kai Xu, Zhengqiu Zhu et al.

2025 EMNLP

Papers