Papers
17,973 papers found
Can Out-of-Distribution Evaluations Uncover Reliance on Prediction Shortcuts? A Case Study in Question Answering
Michal Štefánik, Timothee Mickus, Michal Spiegel et al.
Can Prompts Rewind Time for LLMs? Evaluating the Effectiveness of Prompted Knowledge Cutoffs
Xin Gao, Ruiyi Zhang, Daniel Du et al.
Can Role Vectors Affect LLM Behaviour?
Daniele Potertì, Andrea Seveso, Fabio Mercorio
Can Vision-Language Models Infer Speaker’s Ignorance? The Role of Visual and Linguistic Cues
Ye-eun Cho, Yunho Maeng
Can Vision-Language Models Solve Visual Math Equations?
Monjoy Narayan Choudhury, Junling Wang, Yifan Hou et al.
Can VLMs Recall Factual Associations From Visual References?
Dhananjay Ashok, Ashutosh Chaubey, Hirona Jacqueline Arai et al.
Can We Edit LLMs for Long-Tail Biomedical Knowledge?
Xinhao Yi, Jake Lever, Kevin Bryson et al.
Can We Steer Reasoning Direction by Thinking Intervention?
Xingsheng Zhang, Luxi Xing, Chen Zhang et al.
Can you SPLICE it together? A Human Curated Benchmark for Probing Visual Reasoning in VLMs
Mohamad Ballout, Okajevo Wilfred, Seyedalireza Yaghoubi et al.
Can You Trick the Grader? Adversarial Persuasion of LLM Judges
Yerin Hwang, Dongryeol Lee, Taegwan Kang et al.
CAPE: Context-Aware Personality Evaluation Framework for Large Language Models
Jivnesh Sandhan, Fei Cheng, Tushar Sandhan et al.
CAPSTONE: Composable Attribute‐Prompted Scene Translation for Zero‐Shot Vision–Language Reasoning
Md. Ismail Hossain, Shahriyar Zaman Ridoy, Moshiur Farazi et al.
Captioning for Text-Video Retrieval via Dual-Group Direct Preference Optimization
Ji Soo Lee, Byungoh Ko, Jaewon Cho et al.
Capturing Intra-Dialectal Variation in Qatari Arabic: A Corpus of Cultural and Gender Dimensions
Houda Bouamor, Sara Al-Emadi, Zeinab Ibrahim et al.
Capturing Latent Modal Association For Multimodal Entity Alignment
Yongquan Ji, Jingwei Cheng, Fu Zhang et al.
CARD: Cross-modal Agent Framework for Generative and Editable Residential Design
Pengyu Zeng, Jun Yin, Miao Zhang et al.
Cardiverse: Harnessing LLMs for Novel Card Game Prototyping
Danrui Li, Sen Zhang, Samuel S. Sohn et al.
CARE: A Disagreement Detection Framework with Concept Alignment and Reasoning Enhancement
Jiyuan Liu, Jielin Song, Yunhe Pang et al.
CARE: Multilingual Human Preference Learning for Cultural Awareness
Geyang Guo, Tarek Naous, Hiromi Wakaki et al.
CARFT: Boosting LLM Reasoning via Contrastive Learning with Annotated Chain-of-Thought-based Reinforced Fine-Tuning
Wenqiao Zhu, Ji Liu, Rongjunchen Zhang et al.
CARMA: Enhanced Compositionality in LLMs via Advanced Regularisation and Mutual Information Alignment
Nura Aljaafari, Danilo Carvalho, Andre Freitas
CARVQ: Corrective Adaptor with Group Residual Vector Quantization for LLM Embedding Compression
Dayin Gou, Sanghyun Byun, Nilesh Malpeddi et al.
Case-Based Decision-Theoretic Decoding with Quality Memories
Hiroyuki Deguchi, Masaaki Nagata
cAST: Enhancing Code Retrieval-Augmented Generation with Structural Chunking via Abstract Syntax Tree
Yilin Zhang, Xinran Zhao, Zora Zhiruo Wang et al.
Castle: Causal Cascade Updates in Relational Databases with Large Language Models
Yongye Su, Yucheng Zhang, Zeru Shi et al.