Papers
Findings of the IWSLT 2025 Evaluation Campaign
Idris Abdulmumin, Victor Agostinelli, Tanel Alumäe et al.
Finding the Sweet Spot: Preference Data Construction for Scaling Preference Optimization
Yao Xiao, Hai Ye, Linyao Chen et al.
FineCite: A Novel Approach For Fine-Grained Citation Context Analysis
Lasse M. Jantsch, Dong-Jae Koh, Seonghwan Yoon et al.
Fine-Grained Constraint Generation-Verification for Improved Instruction-Following
Zhixiang Liang, Zhenyu Hou, Xiao Wang
Fine-grained Knowledge Enhancement for Retrieval-Augmented Generation
Jingxuan Han, Zhendong Mao, Yi Liu et al.
Fine-grained Video Dubbing Duration Alignment with Segment Supervised Preference Optimization
Chaoqun Cui, Liangbin Huang, Shijing Wang et al.
FineReason: Evaluating and Improving LLMs’ Deliberate Reasoning through Reflective Puzzle Solving
Guizhen Chen, Weiwen Xu, Hao Zhang et al.
Fine-Tuned Transformer-Based Weighted Soft Voting Ensemble for Persuasion Technique Classification in Slavic Languages
Mahshar Yahan, Sakib Sarker, Mohammad Islam
Fine-Tune on the Format: First Improving Multiple-Choice Evaluation for Intermediate LLM Checkpoints
Alec Bunn, Sarah Wiegreffe, Ben Bogin
Fine-Tuning Large Language Models for Relation Extraction within a Retrieval-Augmented Generation Framework
Sefika Efeoglu, Adrian Paschke
Fine-tuning LLMs to Extract Epilepsy Seizure Frequency Data from Health Records
Ben Holgate, Joe Davies, Shichao Fang et al.
Fine-Tuning Lowers Safety and Disrupts Evaluation Consistency
Kathleen C. Fraser, Hillary Dawkins, Isar Nejadgholi et al.
Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs
Haritz Puerto, Tilek Chubakov, Xiaodan Zhu et al.
Fine-Tuning vs Prompting Techniques for Gender-Fair Rewriting of Machine Translations
Paolo Mainardi, Federico Garcea, Alberto Barrón-Cedeño
Fine-tuning Whisper Tiny for Swahili ASR: Challenges and Recommendations for Low-Resource Speech Recognition
Avinash Kumar Sharma, Manas Pandya, Arpit Shukla
Finite State Automata Inside Transformers with Chain-of-Thought: A Mechanistic Study on State Tracking
Yifan Zhang, Wenyu Du, Dongming Jin et al.
FINKRX: Establishing Best Practices for Korean Financial NLP
Guijin Son, Hyunwoo Ko, Hanearl Jung et al.
FinMME: Benchmark Dataset for Financial Multi-Modal Reasoning Evaluation
Junyu Luo, Zhizhuo Kou, Liming Yang et al.
FinRipple: Aligning Large Language Models with Financial Market for Event Ripple Effect Awareness
Yuanjian Xu, Jianing Hao, Kunsheng Tang et al.
FiRC-NLP at SemEval-2025 Task 11: To Prompt or to Fine-Tune? Approaches for Multilingual Emotion Classification
Wondimagegnhue Tufa, Fadi Hassan, Evgenii Migaev et al.
FiRC-NLP at SemEval-2025 Task 3: Exploring Prompting Approaches for Detecting Hallucinations in LLMs
Wondimagegnhue Tufa, Fadi Hassan, Guillem Collell et al.
Firefly Team at SemEval-2025 Task 8: Question-Answering over Tabular Data using SQL/Python generation with Closed-Source Large Language Models
Ho Thuy Nga, Ho Thi Thanh Tuyen, Le Minh Hung et al.
Firm or Fickle? Evaluating Large Language Models Consistency in Sequential Interactions
Yubo Li, Yidi Miao, Xueying Ding et al.
First-AID: the first Annotation Interface for grounded Dialogues
Stefano Menini, Daniel Russo, Alessio Palmero Aprosio et al.