Papers
TReX: Tokenizer Regression for Optimal Data Mixture
Inho Won, Hangyeol Yoo, Minkyung Cho et al.
Trove: A Flexible Toolkit for Dense Retrieval
Reza Esfandiarpoor, Max Zuo, Stephen Bach
TruthTrap: A Bilingual Benchmark for Evaluating Factually Correct Yet Misleading Information in Question Answering
Mohammadamin Shafiei, Hamidreza Saffari, Mohammad Taher Pilehvar et al.
Tug-of-war between idioms’ figurative and literal interpretations in LLMs
Soyoung Oh, Xinting Huang, Mathis Pink et al.
TUNE: A Task For Turkish Machine Unlearning For Data Privacy
Doruk Benli, Ada Canoğlu, Nehir İlkim Gönençer et al.
TurkBench: A Benchmark for Evaluating Turkish Large Language Models
Cagri Toraman, Ahmet Kaan Sever, Ayşe Aysu Cengiz et al.
Turn-PPO: Turn-Level Advantage Estimation with PPO for Improved Multi-Turn RL in Agentic LLMs
Junbo Li, Peng Zhou, Rui Meng et al.
Two Birds with One Stone: Annotating Romanian Multiword Expressions with an Eye to the PARSEME 2.0 Guidelines Applicability
Verginica Mititelu, Mihaela Cristescu, Elena Irimia et al.
Ukrainian Multiword Expressions Corpus: Creation, Annotation, and Linguistic Analysis
Hanna Sytar, Maria Shvedova, Olha Kanishcheva
Ultra-Low-Dimensional Prompt Tuning via Random Projection
Zijun Wu, Yongchang Hao, Lili Mou
U-MIRAGE: Benchmarking Chain-of-Thought Reasoning for Urdu Medical QA
Ali Faheem, Faizad Ullah, Muhammad Hammad et al.
Uncertainty Quantification for Evaluating Gender Bias in Machine Translation
Ieva Staliunaite, Julius Cheng, Andreas Vlachos
Uncovering Hidden Correctness in LLM Causal Reasoning via Symbolic Verification
Paul He, Yinya Huang, Mrinmaya Sachan et al.
Under-resourced studies of under-resourced languages: lemmatization and POS-tagging with LLM annotators for historical Armenian, Georgian, Greek and Syriac
Chahan Vidal-Gorène, Bastien Kindt, Florian Cafiero
Understanding Jailbreak Success: A Study of Latent Space Dynamics in Large Language Models
Sarah Ball, Frauke Kreuter, Nina Panickssery
UniBO at MWE-2026 PARSEME 2.0 Subtask 2: A Cross-lingual Approach to Multiword Expression Paraphrasing
Debora Ciminari, Alberto Barrón-Cedeño
Unified Multimodal Interleaved Document Representation for Retrieval
Jaewoo Lee, Joonho Ko, Jinheon Baek et al.
Unintended Memorization of Sensitive Information in Fine-Tuned Language Models
Marton Szep, Jorge Marin Ruiz, Georgios Kaissis et al.
UniToolBench: A Benchmark for Tool-Augmented LLMs in Cross-Domain, Universal Task Automation
Xiaojie Guo, Yang Zhang, Bing Zhang et al.
Unleashing the Unseen: Harnessing Benign Datasets for Jailbreaking Large Language Models
Wei Zhao, Zhe Li, Yige Li et al.
Unlocking Large Audio-Language Models for Interactive Language Learning
Hongfu Liu, Zhouying Cui, Xiangming Gu et al.
Unlocking Latent Discourse Translation in LLMs Through Quality-Aware Decoding
Wafaa Mohammed, Vlad Niculae, Chrysoula Zerva
Unmasking the Factual-Conceptual Gap in Persian Language Models
Alireza Sakhaeirad, Ali Ma'manpoosh, Arshia Hemmat
Unraveling LLM Jailbreaks Through Safety Knowledge Neurons
Chongwen Zhao, Yutong Ke, Kaizhu Huang
UNSC-Bench: Evaluating LLM Diplomatic Role-Playing Through UN Security Council Vote Prediction
Ayush Nangia, Aman Gokrani, Ruggero Marino Lazzaroni