Papers
FinDebate: Multi-Agent Collaborative Intelligence for Financial Analysis
Tianshi Cai, Guanxu Li, Nijia Han et al.
Finding Diamonds in Conversation Haystacks: A Benchmark for Conversational Data Retrieval
Yohan Lee, Yongwoo Song, Sangyeop Kim
Findings of the BlackboxNLP 2025 Shared Task: Localizing Circuits and Causal Variables in Language Models
Dana Arad, Yonatan Belinkov, Hanjie Chen et al.
Findings of the First Shared Task for Creole Language Machine Translation at WMT25
Nathaniel Robinson, Claire Bizon Monroc, Rasul Dent et al.
Findings of the Fourth Shared Task on Multilingual Coreference Resolution: Can LLMs Dethrone Traditional Approaches?
Michal Novák, Miloslav Konopik, Anna Nedoluzhko et al.
Findings of the Third BabyLM Challenge: Accelerating Language Modeling Research with Cognitively Plausible Data
Lucas Charpentier, Leshem Choshen, Ryan Cotterell et al.
Findings of the TSAR 2025 Shared Task on Readability-Controlled Text Simplification
Fernando Alva-Manchego, Regina Stodden, Joseph Marvin Imperial et al.
Findings of the WMT 2025 Shared Task LLMs with Limited Resources for Slavic Languages: MT and QA
Shu Okabe, Daryna Dementieva, Marion Di Marco et al.
Findings of the WMT 2025 Shared Task of the Open Language Data Initiative
David Dale, Laurie Burchell, Jean Maillard et al.
Findings of the WMT 2025 Shared Task on Model Compression: Early Insights on Compressing LLMs for Machine Translation
Marco Gaido, Roman Grundkiewicz, Thamme Gowda et al.
Findings of the WMT25 General Machine Translation Shared Task: Time to Stop Evaluating on Easy Test Sets
Tom Kocmi, Ekaterina Artemova, Eleftherios Avramidis et al.
Findings of the WMT25 Multilingual Instruction Shared Task: Persistent Hurdles in Reasoning, Generation, and Evaluation
Tom Kocmi, Sweta Agrawal, Ekaterina Artemova et al.
Findings of the WMT25 Shared Task on Automated Translation Evaluation Systems: Linguistic Diversity is Challenging and References Still Help
Alon Lavie, Greg Hanneman, Sweta Agrawal et al.
Findings of the WMT25 Terminology Translation Task: Terminology is Useful Especially for Good MTs
Kirill Semenov, Xu Huang, Vilém Zouhar et al.
Findings of WMT 2025 Shared Task on Low-resource Indic Languages Translation
Partha Pakray, Reddi Krishna, Santanu Pal et al.
Finding your MUSE: Mining Unexpected Solutions Engine
Nir Sweed, Hanit Hakim, Ben Wolfson et al.
Fine-Grained Evaluation of English-Russian MT in 2025: Linguistic Challenges Mirroring Human Translator Training
Shushen Manakhimova, Maria Kunilovskaya, Ekaterina Lapshinova-Koltunski et al.
Fine-Grained Manipulation of Arithmetic Neurons
Wenyu Du, Rui Zheng, Tongxu Luo et al.
Fine-Tuned Llama for Multilingual Text-to-Text Coreference Resolution
Jakub Hejman, Ondrej Prazak, Miloslav Konopík
Fine-Tuned Thoughts: Leveraging Chain-of-Thought Reasoning for Industrial Asset Health Monitoring
Shuxin Lin, Dhaval C Patel, Christodoulos Constantinides
Fine-Tuning Encoder-Decoder Models with Contrastive Learning for In-Context Distractor Generation
Elaf Alhazmi, Quan Z. Sheng, Wei Emma Zhang et al.
Finetuning LLMs for Human Behavior Prediction in Social Science Experiments
Akaash Kolluri, Shengguang Wu, Joon Sung Park et al.
Fine-tuning LLMs with Cross-Attention-based Weight Decay for Bias Mitigation
Farsheed Haque, Zhe Fu, Depeng Xu et al.
Fine-tuning NMT Models and LLMs for Specialised EN-ES Translation Using Aligned Corpora, Glossaries, and Synthetic Data: MULTITAN at WMT25 Terminology Shared Task
Lichao Zhu, Maria Zimina-Poirot, Cristian Valdez et al.
Fine-tuning XLM-RoBERTa for Named Entity Recognition in Kurmanji Kurdish
Akam Nawzad, Hossein Hassani