Research Explorer

ContractEval: Benchmarking LLMs for Clause-Level Legal Risk Identification in Commercial Contracts

Shuang Liu, Zelong Li, Ruoyun Ma et al.

2025 EMNLP

Contemporary LLMs struggle with extracting formal legal arguments

Lena Held, Ivan Habernal

2025 EMNLP

Aligning LLMs for Thai Legal Question Answering with Efficient Semantic-Similarity Rewards

Pawitsapak Akarajaradwong, Chompakorn Chaksangchaichot, Pirat Pothavorn et al.

2025 EMNLP

Not ready for the bench: LLM legal interpretation is unstable and uncalibrated to human judgments

Abhishek Purushothama, Junghyun Min, Brandon Waldon et al.

2025 EMNLP

Are LLMs Court-Ready? Evaluating Frontier Models on Indian Legal Reasoning

Kush Juvekar, Arghya Bhattacharya, Sai Khadloya et al.

2025 EMNLP

Explanations explained. Influence of Free-text Explanations on LLMs and the Role of Implicit Knowledge

Andrea Zaninello, Roberto Dessi, Malvina Nissim et al.

2025 EMNLP

Latent Traits and Cross-Task Transfer: Deconstructing Dataset Interactions in LLM Fine-tuning

Shambhavi Krishna, Atharva Naik, Chaitali Agarwal et al.

2025 EMNLP

LLMs as annotators of argumentation

Anna Lindahl

2025 EMNLP

Beyond Human Judgment: A Bayesian Evaluation of LLMs’ Moral Values Understanding

Maciej Skorski, Alina Landowska

2025 EMNLP

Certain but not Probable? Differentiating Certainty from Probability in LLM Token Outputs for Probabilistic Scenarios

Autumn Toney, Ryan Wails

2025 EMNLP

On the Role of Unobserved Sequences on Sample-based Uncertainty Quantification for LLMs

Lucie Kunitomo-Jacquin, Edison Marrese-Taylor, Ken Fukuda

2025 EMNLP

Confidence-Based Response Abstinence: Improving LLM Trustworthiness via Activation-Based Uncertainty Estimation

Zhiqi Huang, Vivek Datla, Chenyang Zhu et al.

2025 EMNLP

Towards Trustworthy Summarization of Cardiovascular Articles: A Factuality-and-Uncertainty-Aware Biomedical LLM Approach

Eleni Partalidou, Tatiana Passali, Chrysoula Zerva et al.

2025 EMNLP

Causal Understanding by LLMs: The Role of Uncertainty

Oscar William Lithgow-Serrano, Vani Kanjirangat, Alessandro Antonucci

2025 EMNLP

Read Your Own Mind: Reasoning Helps Surface Self-Confidence Signals in LLMs

Jakub Podolak, Rajeev Verma

2025 EMNLP

Probing Gender Bias in Multilingual LLMs: A Case Study of Stereotypes in Persian

Ghazal Kalhor, Behnam Bahrak

2025 EMNLP

ValueCompass: A Framework for Measuring Contextual Value Alignment Between Human and LLMs

Hua Shen, Tiffany Knearem, Reshmi Ghosh et al.

2025 EMNLP

That Ain’t Right: Assessing LLM Performance on QA in African American and West African English Dialects

William Coggins, Jasmine McKenzie, Sangpil Youm et al.

2025 EMNLP

Findings of the WMT 2025 Shared Task on Model Compression: Early Insights on Compressing LLMs for Machine Translation

Marco Gaido, Roman Grundkiewicz, Thamme Gowda et al.

2025 EMNLP

Findings of the WMT 2025 Shared Task LLMs with Limited Resources for Slavic Languages: MT and QA

Shu Okabe, Daryna Dementieva, Marion Di Marco et al.

2025 EMNLP

Marco Large Translation Model at WMT2025: Transforming Translation Capability in LLMs via Quality-Aware Training and Decoding

Hao Wang, Linlong Xu, Heng Liu et al.

2025 EMNLP

A* Decoding for Machine Translation in LLMs - SRPOL Participation in WMT2025

Adam Dobrowolski, Paweł Przewłocki, Paweł Przybysz et al.

2025 EMNLP

IRB-MT at WMT25 Translation Task: A Simple Agentic System Using an Off-the-Shelf LLM

Ivan Grubišić, Damir Korencic

2025 EMNLP

Evaluation of LLM for English to Hindi Legal Domain Machine Translation Systems

Kshetrimayum Boynao Singh, Deepak Kumar, Asif Ekbal

2025 EMNLP

Tagged Span Annotation for Detecting Translation Errors in Reasoning LLMs

Taemin Yeom, Yonghyun Ryu, Yoonjung Choi et al.

2025 EMNLP

Papers