Research Explorer

Translating Tax Law to Code with LLMs: A Benchmark and Evaluation Framework

Gabriele Lorenzo, Aldo Pietromatera, Nils Holzenberger

2025 EMNLP

Validate Your Authority: Benchmarking LLMs on Multi-Label Precedent Treatment Classification

M. Mikail Demir, M Abdullah Canbaz

2025 EMNLP

ContractEval: Benchmarking LLMs for Clause-Level Legal Risk Identification in Commercial Contracts

Shuang Liu, Zelong Li, Ruoyun Ma et al.

2025 EMNLP

Contemporary LLMs struggle with extracting formal legal arguments

Lena Held, Ivan Habernal

2025 EMNLP

Aligning LLMs for Thai Legal Question Answering with Efficient Semantic-Similarity Rewards

Pawitsapak Akarajaradwong, Chompakorn Chaksangchaichot, Pirat Pothavorn et al.

2025 EMNLP

Are LLMs Court-Ready? Evaluating Frontier Models on Indian Legal Reasoning

Kush Juvekar, Arghya Bhattacharya, Sai Khadloya et al.

2025 EMNLP

Explanations explained. Influence of Free-text Explanations on LLMs and the Role of Implicit Knowledge

Andrea Zaninello, Roberto Dessi, Malvina Nissim et al.

2025 EMNLP

LLMs as annotators of argumentation

Anna Lindahl

2025 EMNLP

Beyond Human Judgment: A Bayesian Evaluation of LLMs’ Moral Values Understanding

Maciej Skorski, Alina Landowska

2025 EMNLP

On the Role of Unobserved Sequences on Sample-based Uncertainty Quantification for LLMs

Lucie Kunitomo-Jacquin, Edison Marrese-Taylor, Ken Fukuda

2025 EMNLP

Causal Understanding by LLMs: The Role of Uncertainty

Oscar William Lithgow-Serrano, Vani Kanjirangat, Alessandro Antonucci

2025 EMNLP

Read Your Own Mind: Reasoning Helps Surface Self-Confidence Signals in LLMs

Jakub Podolak, Rajeev Verma

2025 EMNLP

Probing Gender Bias in Multilingual LLMs: A Case Study of Stereotypes in Persian

Ghazal Kalhor, Behnam Bahrak

2025 EMNLP

ValueCompass: A Framework for Measuring Contextual Value Alignment Between Human and LLMs

Hua Shen, Tiffany Knearem, Reshmi Ghosh et al.

2025 EMNLP

Findings of the WMT 2025 Shared Task on Model Compression: Early Insights on Compressing LLMs for Machine Translation

Marco Gaido, Roman Grundkiewicz, Thamme Gowda et al.

2025 EMNLP

Findings of the WMT 2025 Shared Task LLMs with Limited Resources for Slavic Languages: MT and QA

Shu Okabe, Daryna Dementieva, Marion Di Marco et al.

2025 EMNLP

Marco Large Translation Model at WMT2025: Transforming Translation Capability in LLMs via Quality-Aware Training and Decoding

Hao Wang, Linlong Xu, Heng Liu et al.

2025 EMNLP

A* Decoding for Machine Translation in LLMs - SRPOL Participation in WMT2025

Adam Dobrowolski, Paweł Przewłocki, Paweł Przybysz et al.

2025 EMNLP

Tagged Span Annotation for Detecting Translation Errors in Reasoning LLMs

Taemin Yeom, Yonghyun Ryu, Yoonjung Choi et al.

2025 EMNLP

TartuNLP at WMT25 LLMs with Limited Resources for Slavic Languages Shared Task

Taido Purason, Mark Fishel

2025 EMNLP

JGU Mainz’s Submission to the WMT25 Shared Task on LLMs with Limited Resources for Slavic Languages: MT and QA

Hossain Shaikh Saadi, Minh Duc Bui, Mario Sanz-Guerrero et al.

2025 EMNLP

Fine-tuning NMT Models and LLMs for Specialised EN-ES Translation Using Aligned Corpora, Glossaries, and Synthetic Data: MULTITAN at WMT25 Terminology Shared Task

Lichao Zhu, Maria Zimina-Poirot, Cristian Valdez et al.

2025 EMNLP

Enhancing NeRF akin to Enhancing LLMs: Generalizable NeRF Transformer with Mixture-of-View-Experts

Wenyan Cong, Hanxue Liang, Peihao Wang et al.

2023 ICCV

Zeroth-Order Fine-Tuning of LLMs in Random Subspaces

Ziming Yu, Pan Zhou, Sike Wang et al.

2025 ICCV

Hints of Prompt: Enhancing Visual Representation for Multimodal LLMs in Autonomous Driving

Hao Zhou, Zhanning Gao, Zhili Chen et al.

2025 ICCV

Papers