Research Explorer

AutoCVSS: Assessing the Performance of LLMs for Automated Software Vulnerability Scoring

Davide Sanvito, Giovanni Arriciati, Giuseppe Siracusano et al.

2025 EMNLP

Graph of Attacks with Pruning: Optimizing Stealthy Jailbreak Prompt Generation for Enhanced LLM Content Moderation

Daniel Schwartz, Dmitriy Bespalov, Zhe Wang et al.

2025 EMNLP

Memory-Efficient Backpropagation for Fine-Tuning LLMs on Resource-Constrained Mobile Devices

Congzheng Song, Xinyu Tang

2025 EMNLP

Benchmarking LLM Faithfulness in RAG with Evolving Leaderboards

Manveer Singh Tamber, Forrest Sheng Bao, Chenyu Xu et al.

2025 EMNLP

Group Preference Alignment: Customizing LLM Responses from In-Situ Conversations Only When Needed

Ishani Mondal, Jack W. Stokes, Sujay Kumar Jauhar et al.

2025 EMNLP

Can LLMs Narrate Tabular Data? An Evaluation Framework for Natural Language Representations of Text-to-SQL System Outputs

Jyotika Singh, Weiyi Sun, Amit Agarwal et al.

2025 EMNLP

Auto prompting without training labels: An LLM cascade for product quality assessment in e-commerce catalogs

Soham Satyadharma, Fatemeh Sheikholeslami, Swati Kaul et al.

2025 EMNLP

LLMs on a Budget? Say HOLA

Zohaib Hasan Siddiqui, Jiechao Gao, Ebad Shabbir et al.

2025 EMNLP

Learning from LLM Agents: In-Context Generative Models for Text Casing in E-Commerce Ads

Yingxue Zhou, Tan Zhu, Tao Zeng et al.

2025 EMNLP

AutoQual: An LLM Agent for Automated Discovery of Interpretable Features for Review Quality Assessment

Xiaochong Lan, Jie Feng, Yinxing Liu et al.

2025 EMNLP

JSON Whisperer: Efficient JSON Editing with LLMs

Sarel Duanis, Asnat Greenstein-Messica, Eliya Habba

2025 EMNLP

Spot the BlindSpots: Systematic Identification and Quantification of Fine-Grained LLM Biases in Contact Center Call Summarization

Kawin Mayilvaghanan, Siddhant Gupta, Ayush Kumar

2025 EMNLP

TOBUGraph: Knowledge Graph-Based Retrieval for Enhanced LLM Performance Beyond RAG

Savini Kashmira, Jayanaka L. Dantanarayana, Joshua Brodsky et al.

2025 EMNLP

Format Inertia: A Failure Mechanism of LLMs in Medical Pre-Consultation

Seungseop Lim, Gibaeg Kim, Wooseok Han et al.

2025 EMNLP

Encouraging Good Processes Without the Need for Good Answers: Reinforcement Learning for LLM Agent Planning

Zhiwei Li, Yong Hu, Wenqing Wang

2025 EMNLP

Scaling Down, Serving Fast: Compressing and Deploying Efficient LLMs for Recommendation Systems

Kayhan Behdin, Ata Fatahibaarzi, Qingquan Song et al.

2025 EMNLP

Group, Embed and Reason: A Hybrid LLM and Embedding Framework for Semantic Attribute Alignment

Shramona Chakraborty, Shashank Mujumdar, Nitin Gupta et al.

2025 EMNLP

How Accurate Are LLMs at Multi-Question Answering on Conversational Transcripts?

Xiliang Zhu, Shi Zong, David Rossouw

2025 EMNLP

Beyond Pointwise Scores: Decomposed Criteria-Based Evaluation of LLM Responses

Fangyi Yu, Nabeel Seedat, Drahomira Herrmannova et al.

2025 EMNLP

Scalable and Cost Effective High-Cardinality Classification with LLMs via Multi-View Label Representations and Retrieval Augmentation

Anup Pattnaik, Sasanka Vutla, Hamvir Dev et al.

2025 EMNLP

Zero-knowledge LLM hallucination detection and mitigation through fine-grained cross-model consistency

Aman Goel, Daniel Schwartz, Yanjun Qi

2025 EMNLP

LLMInit: A Free Lunch from Large Language Models for Selective Initialization of Recommendation

Weizhi Zhang, Liangwei Yang, Wooseong Yang et al.

2025 EMNLP

LLM Agents Implement an NLG System from Scratch: Building Interpretable Rule-Based RDF-to-Text Generators

Mateusz Lango, Ondrej Dusek

2025 EMNLP

Leveraging LLMs to Streamline the Review of Public Funding Applications

João DS Marques, Andre Vicente Duarte, André Mendes Marques de Carvalho et al.

2025 EMNLP

AttributeForge: An Agentic LLM Framework for Automated Product Schema Modeling

Yunhan Huang, Klevis Ramo, Andrea Iovine et al.

2025 EMNLP

Papers