conftrace_

Papers

5,479 papers found · 435 more without abstracts hidden Show all
Mind the Blind Spots: A Focus-Level Evaluation Framework for LLM Reviews
Hyungyu Shin, Jingyu Tang, Yoonjoo Lee et al.
2025 EMNLP
AgentDiagnose: An Open Toolkit for Diagnosing LLM Agent Trajectories
Tianyue Ou, Wanyao Guo, Apurva Gandhi et al.
2025 EMNLP
MedTutor: A Retrieval-Augmented LLM System for Case-Based Medical Education
Dongsuk Jang, Ziyao Shangguan, Kyle Tegtmeyer et al.
2025 EMNLP
TruthTorchLM: A Comprehensive Library for Predicting Truthfulness in LLM Outputs
Duygu Nur Yaldiz, Yavuz Faruk Bakman, Sungmin Kang et al.
2025 EMNLP
SAGE: A Generic Framework for LLM Safety Evaluation
Madhur Jindal, Hari Shrawgi, Parag Agrawal et al.
2025 EMNLP
2025 EMNLP
Aligning LLMs for Multilingual Consistency in Enterprise Applications
Amit Agarwal, Hansa Meghwani, Hitesh Laxmichand Patel et al.
2025 EMNLP
2025 EMNLP
ProCut: LLM Prompt Compression via Attribution Estimation
Zhentao Xu, Fengyi Li, Albert C. Chen et al.
2025 EMNLP
2025 EMNLP
AutoCVSS: Assessing the Performance of LLMs for Automated Software Vulnerability Scoring
Davide Sanvito, Giovanni Arriciati, Giuseppe Siracusano et al.
2025 EMNLP
Benchmarking LLM Faithfulness in RAG with Evolving Leaderboards
Manveer Singh Tamber, Forrest Sheng Bao, Chenyu Xu et al.
2025 EMNLP
2025 EMNLP
2025 EMNLP
LLMs on a Budget? Say HOLA
Zohaib Hasan Siddiqui, Jiechao Gao, Ebad Shabbir et al.
2025 EMNLP
JSON Whisperer: Efficient JSON Editing with LLMs
Sarel Duanis, Asnat Greenstein-Messica, Eliya Habba
2025 EMNLP