Research Explorer

Safeguard Fine-Tuned LLMs Through Pre- and Post-Tuning Model Merging

Hua Farn, Hsuan Su, Shachi H. Kumar et al.

2025 EMNLP

Butterfly Effects in Toolchains: A Comprehensive Analysis of Failed Parameter Filling in LLM Tool-Agent Systems

Qian Xiong, Yuekai Huang, Ziyou Jiang et al.

2025 EMNLP

FinLFQA: Evaluating Attributed Text Generation of LLMs in Financial Long-Form Question Answering

Yitao Long, Tiansheng Hu, Yilun Zhao et al.

2025 EMNLP

Zero-shot Graph Reasoning via Retrieval Augmented Framework with LLMs

Hanqing Li, Sharika Mahadevan, Kiran Jyothi Sheena et al.

2025 EMNLP

Dissecting Logical Reasoning in LLMs: A Fine-Grained Evaluation and Supervision Study

Yujun Zhou, Jiayi Ye, Zipeng Ling et al.

2025 EMNLP

Faster and Better LLMs via Latency-Aware Test-Time Scaling

Zili Wang, Tianyu Zhang, Haoli Bai et al.

2025 EMNLP

PolBiX: Detecting LLMs’ Political Bias in Fact-Checking through X-phemisms

Charlott Jakob, David Harbecke, Patrick Parschan et al.

2025 EMNLP

Low-Hallucination and Efficient Coreference Resolution with LLMs

Yujian Gan, Yuan Liang, Jinxia Xie et al.

2025 EMNLP

Your Mileage May Vary: How Empathy and Demographics Shape Human Preferences in LLM Responses

Yishan Wang, Amanda Cercas Curry, Flor Miriam Plaza-del-Arco

2025 EMNLP

Choosing a Model, Shaping a Future: Comparing LLM Perspectives on Sustainability and its Relationship with AI

Annika Bush, Meltem Aksoy, Markus Pauly et al.

2025 EMNLP

KurTail : Kurtosis-based LLM Quantization

Mohammad Sadegh Akhondzadeh, Aleksandar Bojchevski, Evangelos Eleftheriou et al.

2025 EMNLP

LLMs Reproduce Stereotypes of Sexual and Gender Minorities

Ruby Ostrow, Adam Lopez

2025 EMNLP

Accept or Deny? Evaluating LLM Fairness and Performance in Loan Approval across Table-to-Text Serialization Approaches

Israel Abebe Azime, Deborah D. Kanubala, Tejumade Afonja et al.

2025 EMNLP

Understanding and Improving Information Preservation in Prompt Compression for LLMs

Weronika Łajewska, Momchil Hardalov, Laura Aina et al.

2025 EMNLP

Beyond Surface Alignment: Rebuilding LLMs Safety Mechanism via Probabilistically Ablating Refusal Direction

Yuanbo Xie, Yingjie Zhang, Tianyun Liu et al.

2025 EMNLP

Distributed LLM Serving on Consumer-Grade GPUs by Reconciling Computation and Communication

Lewei Jin, Kui Zhang, Yongqi Chen et al.

2025 EMNLP

SafeToolBench: Pioneering a Prospective Benchmark to Evaluating Tool Utilization Safety in LLMs

Hongfei Xia, Hongru Wang, Zeming Liu et al.

2025 EMNLP

Beneath the Facade: Probing Safety Vulnerabilities in LLMs via Auto-Generated Jailbreak Prompts

Heehyeon Kim, Kyeongryul Lee, Joyce Jiyoung Whang

2025 EMNLP

Can Role Vectors Affect LLM Behaviour?

Daniele Potertì, Andrea Seveso, Fabio Mercorio

2025 EMNLP

Layer Duplication in LLMs

Neo Eyal, Nachum Dershowitz, Kfir Bar

2025 EMNLP

InFact: Informativeness Alignment for Improved LLM Factuality

Roi Cohen, Russa Biswas, Gerard de Melo

2025 EMNLP

Problem Solved? Information Extraction Design Space for Layout-Rich Documents using LLMs

Gaye Colakoglu, Gürkan Solmaz, Jonathan Fürst

2025 EMNLP

Following Occam’s Razor: Dynamic Combination of Structured Knowledge for Multi-Hop Question Answering using LLMs

Wei Chen, Zhi Zheng, Lili Zhao et al.

2025 EMNLP

AssistedDS: Benchmarking How External Domain Knowledge Assists LLMs in Automated Data Science

An Luo, Xun Xian, Jin Du et al.

2025 EMNLP

No Free Lunch: Retrieval-Augmented Generation Undermines Fairness in LLMs, Even for Vigilant Users

Mengxuan Hu, Hongyi Wu, Ronghang Zhu et al.

2025 EMNLP

Papers