Papers
FB-Bench: A Fine-Grained Multi-Task Benchmark for Evaluating LLMs’ Responsiveness to Human Feedback
Youquan Li, Miao Zheng, Fan Yang et al.
FC-Attack: Jailbreaking Multimodal Large Language Models via Auto-Generated Flowcharts
Ziyi Zhang, Zhen Sun, Zongmin Zhang et al.
Feature Engineering is not Dead: A Step Towards State of the Art for Arabic Automated Essay Scoring
Marwan Sayed, Sohaila Eltanbouly, May Bashendy et al.
Feature Extraction and Steering for Enhanced Chain-of-Thought Reasoning in Language Models
Zihao Li, Xu Wang, Yuzhe Yang et al.
FedCoT: Federated Chain-of-Thought Distillation for Large Language Models
Tao Fan, Weijing Chen, Yan Kang et al.
Federated Retrieval-Augmented Generation: A Systematic Mapping Study
Abhijit Chakraborty, Chahana Dahal, Vivek Gupta
FedMABench: Benchmarking Mobile GUI Agents on Decentralized Heterogeneous User Data
WenHao Wang, Zijie Yu, Rui Ye et al.
Feeding Two Birds or Favoring One? Adequacy–Fluency Tradeoffs in Evaluation and Meta-Evaluation of Machine Translation
Behzad Shayegh, Jan-Thorsten Peter, David Vilar et al.
“Feels Feminine to Me”: Understanding Perceived Gendered Style through Human Annotations
Hongyu Chen, Neele Falk, Michael Roth et al.
Feel the Difference? A Comparative Analysis of Emotional Arcs in Real and LLM-Generated CBT Sessions
Xiaoyi Wang, Jiwei Zhang, Guangtao Zhang et al.
Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique
Tej Deep Pala, Vernon Toh, Rishabh Bhardwaj et al.
FESTA: Functionally Equivalent Sampling for Trust Assessment of Multimodal LLMs
Debarpan Bhattacharya, Apoorva Kulkarni, Sriram Ganapathy
Few-Shot Coreference Resolution with Semantic Difficulty Metrics and In-Context Learning
Nguyen Xuan Phuc, Dang Van Thin
Few-Shot Learning Translation from New Languages
Carlos Mullov, Alexander Waibel
Few-Shot Multilingual Coreference Resolution Using Long-Context Large Language Models
Moiz Sajid, Seemab Latif, Zuhair Zafar et al.
Few-Shot Open-Set Classification via Reasoning-Aware Decomposition
Avyav Kumar Singh, Helen Yannakoudakis
FG-PRM: Fine-grained Hallucination Detection and Mitigation in Language Model Mathematical Reasoning
Ruosen Li, Ziming Luo, Xinya Du
FicSim: A Dataset for Multi-Faceted Semantic Similarity in Long-Form Fiction
Natasha Johnson, Amanda Bertsch, Maria-Emil Deal et al.
FIER: Fine-Grained and Efficient KV Cache Retrieval for Long-context LLM Inference
Dongwei Wang, Zijie Liu, Song Wang et al.
FigEx: Aligned Extraction of Scientific Figures and Captions
Jifeng Song, Arun Das, Ge Cui et al.
FilBench: Can LLMs Understand and Generate Filipino?
Lester James Validad Miranda, Elyanah Aco, Conner G. Manuel et al.
FillerSpeech: Towards Human-Like Text-to-Speech Synthesis with Filler Insertion and Filler Style Control
Seung-Bin Kim, Jun-Hyeok Cha, Hyung-Seok Oh et al.
Filling the Gap for Uzbek: Creating Translation Resources for Southern Uzbek
Mukhammadsaid Mamasaidov, Azizullah Aral, Abror Shopulatov et al.
Financial Risk Relation Identification through Dual-view Adaptation
Wei-Ning Chiu, Yu-Hsiang Wang, Andy Hsiao et al.
FinCoT: Grounding Chain-of-Thought in Expert Financial Reasoning
Natapong Nitarach, Warit Sirichotedumrong, Panop Pitchayarthorn et al.