Research Explorer

LLMEval-Med: A Real-world Clinical Benchmark for Medical LLMs with Physician Validation

Ming Zhang, Yujiong Shen, Zelin Li et al.

2025 EMNLP

LlmFixer: Fix the Helpfulness of Defensive Large Language Models

Zelong Yu, Xiaoming Zhang, Litian Zhang et al.

2025 EMNLP

LLM-Guided Co-Training for Text Classification

Md Mezbaur Rahman, Cornelia Caragea

2025 EMNLP

LLM-Guided Semantic Relational Reasoning for Multimodal Intent Recognition

Qianrui Zhou, Hua Xu, Yifan Wang et al.

2025 EMNLP

LLM-Independent Adaptive RAG: Let the Question Speak for Itself

Maria Marina, Nikolay Ivanov, Sergey Pletenev et al.

2025 EMNLP

LLMInit: A Free Lunch from Large Language Models for Selective Initialization of Recommendation

Weizhi Zhang, Liangwei Yang, Wooseong Yang et al.

2025 EMNLP

LLM Jailbreak Detection for (Almost) Free!

Guorui Chen, Yifan Xia, Xiaojun Jia et al.

2025 EMNLP

LLM×MapReduce-V3: Enabling Interactive In-Depth Survey Generation through a MCP-Driven Hierarchically Modular Agent System

Yu Chao, Siyu Lin, Xiaorong Wang et al.

2025 EMNLP

LLM-OREF: An Open Relation Extraction Framework Based on Large Language Models

Hongyao Tu, Liang Zhang, Yujie Lin et al.

2025 EMNLP

LLMs are Better Than You Think: Label-Guided In-Context Learning for Named Entity Recognition

Fan Bai, Hamid Hassanzadeh, Ardavan Saeedi et al.

2025 EMNLP

LLMs are Privacy Erasable

Zipeng Ye, Wenjian Luo

2025 EMNLP

LLMs as annotators of argumentation

Anna Lindahl

2025 EMNLP

LLMs as a synthesis between symbolic and distributed approaches to language

Gemma Boleda

2025 EMNLP

LLMs as World Models: Data-Driven and Human-Centered Pre-Event Simulation for Disaster Impact Assessment

Lingyao Li, Dawei Li, Zhenhui Ou et al.

2025 EMNLP

LLMs Behind the Scenes: Enabling Narrative Scene Illustration

Melissa Roemmele, John Joon Young Chung, Taewook Kim et al.

2025 EMNLP

LLMs Can Compensate for Deficiencies in Visual Representations

Sho Takishita, Jay Gala, Abdelrahman Mohamed et al.

2025 EMNLP

LLMs cannot spot math errors, even when allowed to peek into the solution

Kv Aditya Srivatsa, Kaushal Kumar Maurya, Ekaterina Kochmar

2025 EMNLP

LLMs Don’t Know Their Own Decision Boundaries: The Unreliability of Self-Generated Counterfactual Explanations

Harry Mayne, Ryan Othniel Kearns, Yushi Yang et al.

2025 EMNLP

LLMs for Bayesian Optimization in Scientific Domains: Are We There Yet?

Rushil Gupta, Jason Hartford, Bang Liu

2025 EMNLP

LLMs on a Budget? Say HOLA

Zohaib Hasan Siddiqui, Jiechao Gao, Ebad Shabbir et al.

2025 EMNLP

LLMsPark: A Benchmark for Evaluating Large Language Models in Strategic Gaming Contexts

Junhao Chen, Jingbo Sun, Xiang Li et al.

2025 EMNLP

LLMs Reproduce Stereotypes of Sexual and Gender Minorities

Ruby Ostrow, Adam Lopez

2025 EMNLP

LM2Protein: A Structure-to-Token Protein Large Language Model

Chang Zhou, Yuheng Shan, Pengan Chen et al.

2025 EMNLP

LMR-BENCH: Evaluating LLM Agent’s Ability on Reproducing Language Modeling Research

Shuo Yan, Ruochen Li, Ziming Luo et al.

2025 EMNLP

LM-Searcher: Cross-domain Neural Architecture Search with LLMs via Unified Numerical Encoding

Yuxuan Hu, Jihao Liu, Ke Wang et al.

2025 EMNLP

Papers