conftrace_

Papers

5,914 papers found · incl. 435 without abstracts Only with abstracts

Can LLMs Reason Abstractly Over Math Word Problems Without CoT? Disentangling Abstract Formulation From Arithmetic Computation

Ziling Cheng, Meng Cao, Leila Pishdad et al.

2025 EMNLP

ESGenius: Benchmarking LLMs on Environmental, Social, and Governance (ESG) and Sustainability Knowledge

Chaoyue He, Xin Zhou, Yi Wu et al.

2025 EMNLP

WISE: Weak-Supervision-Guided Step-by-Step Explanations for Multimodal LLMs in Image Classification

Yiwen Jiang, Deval Mehta, Siyuan Yan et al.

2025 EMNLP

Calibration Across Layers: Understanding Calibration Evolution in LLMs

Abhinav Joshi, Areeb Ahmad, Ashutosh Modi

2025 EMNLP

FLRC: Fine-grained Low-Rank Compressor for Efficient LLM Inference

Yu-Chen Lu, Chong-Yan Chen, Chi-Chih Chang et al.

2025 EMNLP

CoEvo: Coevolution of LLM and Retrieval Model for Domain-Specific Information Retrieval

Ang Li, Yiquan Wu, Yinghao Hu et al.

2025 EMNLP

Conan-Embedding-v2: Training an LLM from Scratch for Text Embeddings

Shiyu Li, Yang Tang, Ruijie Liu et al.

2025 EMNLP

Vision-and-Language Navigation with Analogical Textual Descriptions in LLMs

Yue Zhang, Tianyi Ma, Zun Wang et al.

2025 EMNLP

BTC-SAM: Leveraging LLMs for Generation of Bias Test Cases for Sentiment Analysis Models

Zsolt T. Kardkovács, Lynda Djennane, Anna Field et al.

2025 EMNLP

Controllable Memorization in LLMs via Weight Pruning

Chenjie Ni, Zhepeng Wang, Runxue Bao et al.

2025 EMNLP

DCIS: Efficient Length Extrapolation of LLMs via Divide-and-Conquer Scaling Factor Search

Lei Yang, Shaoyang Xu, Jianxiang Peng et al.

2025 EMNLP

DyePack: Provably Flagging Test Set Contamination in LLMs Using Backdoors

Yize Cheng, Wenxiao Wang, Mazda Moayeri et al.

2025 EMNLP

Jailbreak LLMs through Internal Stance Manipulation

Shuangjie Fu, Du Su, Beining Huang et al.

2025 EMNLP

Benchmark Profiling: Mechanistic Diagnosis of LLM Benchmarks

Dongjun Kim, Gyuho Shim, Yongchan Chun et al.

2025 EMNLP

Improving Chemical Understanding of LLMs via SMILES Parsing

Yunhui Jang, Jaehyung Kim, Sungsoo Ahn

2025 EMNLP

The State of Multilingual LLM Safety Research: From Measuring The Language Gap To Mitigating It

Zheng Xin Yong, Beyza Ermis, Marzieh Fadaee et al.

2025 EMNLP

From Capabilities to Performance: Evaluating Key Functional Properties of LLM Architectures in Penetration Testing

Lanxiao Huang, Daksh Dave, Tyler Cody et al.

2025 EMNLP

The Staircase of Ethics: Probing LLM Value Priorities through Multi-Step Induction to Complex Moral Dilemmas

Ya Wu, Qiang Sheng, Danding Wang et al.

2025 EMNLP

Sparse Neurons Carry Strong Signals of Question Ambiguity in LLMs

Zhuoxuan Zhang, Jinhao Duan, Edward Kim et al.

2025 EMNLP

Comparing human and LLM politeness strategies in free production

Haoran Zhao, Robert D. Hawkins

2025 EMNLP

CARMA: Enhanced Compositionality in LLMs via Advanced Regularisation and Mutual Information Alignment

Nura Aljaafari, Danilo Carvalho, Andre Freitas

2025 EMNLP

Can LLMs simulate the same correct solutions to free-response math problems as real students?

Yuya Asano, Diane Litman, Erin Walker

2025 EMNLP

Evaluating Behavioral Alignment in Conflict Dialogue: A Multi-Dimensional Comparison of LLM Agents and Humans

Deuksin Kwon, Kaleen Shrestha, Bin Han et al.

2025 EMNLP

Attention Eclipse: Manipulating Attention to Bypass LLM Safety-Alignment

Pedram Zaree, Md Abdullah Al Mamun, Quazi Mishkatul Alam et al.

2025 EMNLP

Implicit Values Embedded in How Humans and LLMs Complete Subjective Everyday Tasks

Arjun Arunasalam, Madison Pickering, Z. Berkay Celik et al.

2025 EMNLP