conftrace_

Papers

5,479 papers found · 435 more without abstracts hidden Show all

F²Bench: An Open-ended Fairness Evaluation Benchmark for LLMs with Factuality Considerations

Tian Lan, Jiang Li, Yemin Wang et al.

2025 EMNLP

CodeMixBench: Evaluating Code-Mixing Capabilities of LLMs Across 18 Languages

Yilun Yang, Yekun Chai

2025 EMNLP

Unveiling Internal Reasoning Modes in LLMs: A Deep Dive into Latent Reasoning vs. Factual Shortcuts with Attribute Rate Ratio

Yiran Yang, Haifeng Sun, Jingyu Wang et al.

2025 EMNLP

LLMs Behind the Scenes: Enabling Narrative Scene Illustration

Melissa Roemmele, John Joon Young Chung, Taewook Kim et al.

2025 EMNLP

FilBench: Can LLMs Understand and Generate Filipino?

Lester James Validad Miranda, Elyanah Aco, Conner G. Manuel et al.

2025 EMNLP

Code to Think, Think to Code: A Survey on Code-Enhanced Reasoning and Reasoning-Driven Code Intelligence in LLMs

Dayu Yang, Tianyang Liu, Daoan Zhang et al.

2025 EMNLP

User Feedback in Human-LLM Dialogues: A Lens to Understand Users But Noisy as a Learning Signal

Yuhan Liu, Michael JQ Zhang, Eunsol Choi

2025 EMNLP

Read to Hear: A Zero-Shot Pronunciation Assessment Using Textual Descriptions and LLMs

Yu-Wen Chen, Melody Ma, Julia Hirschberg

2025 EMNLP

Ask Patients with Patience: Enabling LLMs for Human-Centric Medical Dialogue with Grounded Reasoning

Jiayuan Zhu, Jiazhen Pan, Yuyuan Liu et al.

2025 EMNLP

Unleashing the Reasoning Potential of LLMs by Critique Fine-Tuning on One Problem

Yubo Wang, Ping Nie, Kai Zou et al.

2025 EMNLP

SAND: Boosting LLM Agents with Self-Taught Action Deliberation

Yu Xia, Yiran Jenny Shen, Junda Wu et al.

2025 EMNLP

LLMs as World Models: Data-Driven and Human-Centered Pre-Event Simulation for Disaster Impact Assessment

Lingyao Li, Dawei Li, Zhenhui Ou et al.

2025 EMNLP

Mind the Value-Action Gap: Do LLMs Act in Alignment with Their Values?

Hua Shen, Nicholas Clark, Tanu Mitra

2025 EMNLP

FANS: Formal Answer Selection for LLM Natural Language Math Reasoning Using Lean4

Jiarui Yao, Ruida Wang, Tong Zhang

2025 EMNLP

Humanizing Machines: Rethinking LLM Anthropomorphism Through a Multi-Level Framework of Design

Yunze Xiao, Lynnette Hui Xian Ng, Jiarui Liu et al.

2025 EMNLP

TokenSkip: Controllable Chain-of-Thought Compression in LLMs

Heming Xia, Chak Tou Leong, Wenjie Wang et al.

2025 EMNLP

Why Do Some Inputs Break Low-Bit LLM Quantization?

Ting-Yun Chang, Muru Zhang, Jesse Thomason et al.

2025 EMNLP

Exploring Changes in Nation Perception with Nationality-Assigned Personas in LLMs

Mahammed Kamruzzaman, Gene Louis Kim

2025 EMNLP

RAG-Instruct: Boosting LLMs with Diverse Retrieval-Augmented Instructions

Wanlong Liu, Junying Chen, Ke Ji et al.

2025 EMNLP

SmartBench: Is Your LLM Truly a Good Chinese Smartphone Assistant?

Xudong Lu, Haohao Gao, Renshou Wu et al.

2025 EMNLP

Multimedia Event Extraction with LLM Knowledge Editing

Jiaao Yu, Yijing Lin, Zhipeng Gao et al.

2025 EMNLP

Exploring the Impact of Personality Traits on LLM Bias and Toxicity

Shuo Wang, Renhao Li, Xi Chen et al.

2025 EMNLP

BannerAgency: Advertising Banner Design with Multimodal LLM Agents

Heng Wang, Yotaro Shimose, Shingo Takamatsu

2025 EMNLP

Training LLMs to be Better Text Embedders through Bidirectional Reconstruction

Chang Su, Dengliang Shi, Siyuan Huang et al.

2025 EMNLP

CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation

Ziyue Liu, Ruijie Zhang, Zhengyang Wang et al.

2025 EMNLP