Papers
6,952 papers found
FaithBench: A Diverse Hallucination Benchmark for Summarization by Modern LLMs
Forrest Sheng Bao, Miaoran Li, Renyi Qu et al.
FaithfulPersona: Balancing Faithfulness and Personalization in Code Explanations through Self-Critique
Zhuang Luo, Yichuan Li, Zexing Xu et al.
Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation
Mahnaz Koupaee, Jake W. Vincent, Saab Mansour et al.
Familiarity: Better Evaluation of Zero-Shot Named Entity Recognition by Quantifying Label Shifts in Synthetic Training Data
Jonas Golde, Patrick Haller, Max Ploner et al.
Faster Machine Translation Ensembling with Reinforcement Learning and Competitive Correction
Kritarth Prasad, Mohammadi Zaki, Pratik Rakesh Singh et al.
Faux Polyglot: A Study on Information Disparity in Multilingual Large Language Models
Nikhil Sharma, Kenton Murray, Ziang Xiao
Fearful Falcons and Angry Llamas: Emotion Category Annotations of Arguments by Humans and LLMs
Lynn Greschner, Roman Klinger
Features that Make a Difference: Leveraging Gradients for Improved Dictionary Learning
Jeffrey Olmo, Jared Wilson, Max Forsey et al.
FedSpaLLM: Federated Pruning of Large Language Models
Guangji Bai, Yijiang Li, Zilinghan Li et al.
Few-shot Personalization of LLMs with Mis-aligned Responses
Jaehyung Kim, Yiming Yang
FIDELITY: Fine-grained Interpretable Distillation for Effective Language Insights and Topic Yielding
Divyansh Singh, Brodie Mather, Demi Zhang et al.
Fighting Spurious Correlations in Text Classification via a Causal Learning Perspective
Yuqing Zhou, Ziwei Zhu
Finding-Centric Structuring of Japanese Radiology Reports and Analysis of Performance Gaps for Multiple Facilities
Yuki Tagawa, Yohei Momoki, Norihisa Nakano et al.
Finding Common Patterns in Domestic Violence Stories Posted on Reddit
Mohammad Shokri, Emily Klapper, Jason Shan et al.
Findings of the AmericasNLP 2025 Shared Tasks on Machine Translation, Creation of Educational Material, and Translation Metrics for Indigenous Languages of the Americas
Ona De Gibert, Robert Pugh, Ali Marashian et al.
Findings of the Shared Task on Abusive Tamil and Malayalam Text Targeting Women on Social Media: DravidianLangTech@NAACL 2025
Saranya Rajiakodi, Bharathi Raja Chakravarthi, Shunmuga Priya Muthusamy Chinnan et al.
Findings of the Shared Task on Misogyny Meme Detection: DravidianLangTech@NAACL 2025
Bharathi Raja Chakravarthi, Rahul Ponnusamy, Saranya Rajiakodi et al.
Find the Intention of Instruction: Comprehensive Evaluation of Instruction Understanding for Large Language Models
Hyeonseok Moon, Jaehyung Seo, Seungyoon Lee et al.
FiNE: Filtering and Improving Noisy Data Elaborately with Large Language Models
Junliang He, Ziyue Fan, Shaohui Kuang et al.
Fine-Grained and Multi-Dimensional Metrics for Document-Level Machine Translation
Yirong Sun, Dawei Zhu, Yanjun Chen et al.
Fine-grained Fallacy Detection with Human Label Variation
Alan Ramponi, Agnese Daffara, Sara Tonelli
Fine-Grained Transfer Learning for Harmful Content Detection through Label-Specific Soft Prompt Tuning
Faeze Ghorbanpour, Viktor Hangya, Alexander Fraser
Fine-Tuned LLMs are “Time Capsules” for Tracking Societal Bias Through Books
Sangmitra Madhusudan, Robert Morabito, Skye Reid et al.
Fine-Tuning Large Language Models with Sequential Instructions
Hanxu Hu, Simon Yu, Pinzhen Chen et al.