Papers
17,973 papers found
CAFE: Retrieval Head-based Coarse-to-Fine Information Seeking to Enhance Multi-Document QA Capability
Han Peng, Jinhao Jiang, Zican Dong et al.
CafGa: Customizing Feature Attributions to Explain Language Models
Alan David Boyle, Furui Cheng, Vilém Zouhar et al.
CAIR: Counterfactual-based Agent Influence Ranker for Agentic AI Workflows
Amit Giloni, Chiara Picardi, Roy Betser et al.
CaKE: Circuit-aware Editing Enables Generalizable Knowledge Learners
Yunzhi Yao, Jizhan Fang, Jia-Chen Gu et al.
Calibrating Language Models for Neural Ranking under Noisy Supervision with Relaxed Labels
Arnab Sharma, Daniel Vollmers, Axel-Cyrille Ngonga Ngomo
Calibrating LLM Confidence by Probing Perturbed Representation Stability
Reza Khanmohammadi, Erfan Miahi, Mehrsa Mardikoraem et al.
Calibrating LLMs for Text-to-SQL Parsing by Leveraging Sub-clause Frequencies
Terrance Liu, Shuyi Wang, Daniel Preotiuc-Pietro et al.
Calibrating Pseudo-Labeling with Class Distribution for Semi-supervised Text Classification
Weiyi Yang, Richong Zhang, Junfan Chen et al.
Calibrating Verbal Uncertainty as a Linear Feature to Reduce Hallucinations
Ziwei Ji, Lei Yu, Yeskendir Koishekenov et al.
Calibration Across Layers: Understanding Calibration Evolution in LLMs
Abhinav Joshi, Areeb Ahmad, Ashutosh Modi
Calibration as a Proxy for Fairness and Efficiency in a Perspectivist Ensemble Approach to Irony Detection
Samuel B. Jesus, Guilherme Dal Bianco, Wanderlei Junior et al.
CalligraphicOCR for Chinese Calligraphy Recognition
Xiaoyi Bao, Zhongqing Wang, Jinghang Gu et al.
CaMMT: Benchmarking Culturally Aware Multimodal Machine Translation
Emilio Villa-Cueva, Sholpan Bolatzhanova, Diana Turmakhan et al.
Can an Individual Manipulate the Collective Decisions of Multi-Agents?
Fengyuan Liu, Rui Zhao, Shuo Chen et al.
Can Code-Switched Texts Activate a Knowledge Switch in LLMs? A Case Study on English-Korean Code-Switching
Seoyeon Kim, Huiseo Kim, Chanjun Park et al.
CANDY: Benchmarking LLMs’ Limitations and Assistive Potential in Chinese Misinformation Fact-Checking
Ruiling Guo, Xinwei Yang, Chen Huang et al.
Can Federated Learning Safeguard Private Data in LLM Training? Vulnerabilities, Attacks, and Defense Evaluation
Wenkai Guo, Xuefeng Liu, Haolin Wang et al.
Can GRPO Boost Complex Multimodal Table Understanding?
Xiaoqiang Kang, Shengen Wu, Zimu Wang et al.
Can Language Models Follow Multiple Turns of Entangled Instructions?
Chi Han, Xin Liu, Haodong Wang et al.
Can Language Neuron Intervention Reduce Non-Target Language Output?
Suchun Xie, Hwichan Kim, Shota Sasaki et al.
Can Large Language Models Act as Ensembler for Multi-GNNs?
Hanqi Duan, Yao Cheng, Jianxiang Yu et al.
Can Large Language Models be Effective Online Opinion Miners?
Ryang Heo, Yongsik Seo, Junseong Lee et al.
Can Large Language Models Be Good Language Teachers?
LiQing Xu, Qiwei Li, Tianshuo Peng et al.
Can Large Language Models Identify Implicit Suicidal Ideation? An Empirical Evaluation
Tong Li, Shu Yang, Junchao Wu et al.
Can Large Language Models Outperform Non-Experts in Poetry Evaluation? A Comparative Study Using the Consensual Assessment Technique
Piotr Sawicki, Marek Grzes, Dan Brown et al.