Papers
172,654 papers found
Can LLMs Reason Like Doctors? Exploring the Limits of Large Language Models in Complex Medical Reasoning
Flavio Merenda, Jose Manuel Gomez-Perez, German Rigau
Can LLMs reason over extended multilingual contexts? Towards long-context evaluation beyond retrieval over haystacks
Amey Hengle, Prasoon Bajpai, Soham Dan et al.
Can LLMs Translate Italy’s Language Varieties?
Edoardo Signoroni, Pavel Rychlý
Can LLMs Truly Embody Human Personality? Analyzing AI and Human Behavior Alignment in Dispute Resolution
Deuksin Kwon, Kaleen Shrestha, Bin Han et al.
Can MLLMs Find Their Way in a City? Exploring Emergent Navigation from Web-Scale Knowledge
Dwip Dalal, Utkarsh Mishra, Narendra Ahuja et al.
Can Models Help Us Create Better Models? Evaluating LLMs as Data Scientists
Michał Pietruszka, Łukasz Borchmann, Aleksander Jędrosz et al.
Can Molecular Evolution Mechanism Enhance Molecular Representation?
Kun Li, Longtao Hu, Jiameng Chen et al.
Can Protective Watermarking Safeguard the Copyright of 3D Gaussian Splatting?
Wenkai Huang, Yijia Guo, Gaolei Li et al.
Can Pseudo-Label Be More Reliable? A Simple yet Effective Topology-Aware Graph Self-Training Method
Gen Liu, Zhongying Zhao, Hui Zhou et al.
Can Reasoning Help Large Language Models Capture Human Annotator Disagreement?
Jingwei Ni, Yu Fan, Vilém Zouhar et al.
CANVAS: A Benchmark for Vision-Language Models on Tool-Based User Interface Design
Daeheon Jeong, Seoyeon Byun, Kihoon Son et al.
Can We Challenge Open-Vocabulary Object Detectors with Generated Content in Street Scenes?
Annika Mütze, Sadia Ilyas, Christian Dörpelkus et al.
Can you map it to English? The Role of Cross-Lingual Alignment in the Multilingual Performance of LLMs
Kartik Ravisankar, HyoJung Han, Sarah Wiegreffe et al.
Can You Tell the Difference? Contrastive Explanations for ABox Entailments
Patrick Koopmann, Yasir Mahmood, Axel-Cyrille Ngonga Ngomo et al.
Capacity Constraints Make Admissions Processes Less Predictable
Evan Dong, Nikhil Garg, Sarah Dean
CAPE: A CLIP-Aware Pointing Ensemble of Complementary Heatmap Cues for Embodied Reference Understanding
Fevziye Irem Eyiokur, Dogucan Yaman, Hazım Kemal Ekenel et al.
CapeNext: Rethinking and Refining Dynamic Support Information for Category-Agnostic Pose Estimation
Yu Zhu, Dan Zeng, Shuiwang Li et al.
CAPID: Context-Aware PII Detection for Question-Answering Systems
Mariia Ponomarenko, Sepideh Abedini, Masoumeh Shafieinejad et al.
CAPO: A Unified Policy Gradient Approach for Reward and Cost Optimization in Safe Reinforcement Learning (Student Abstract)
Xiaotao Liu, Prashant Mohit, Arvind Easwaran
CaPro: Curvilinear-aware Prompt Learning with Single Unlabeled Image for Cost-effective Curvilinear Structure Segmentation
Zhuangzhuang Chen, Qiangyu Chen, Chubin Ou et al.
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting
Yolo Yunlong Tang, Jing Bi, Chao Huang et al.
Capturing Dynamic User Interests Under Modality Imbalance for Multimodal Sequential Recommendation
Zilong Li, Jia Zhu, Chenglei Huang et al.
Cards Against Contamination: TCG-Bench for Difficulty-Scalable Multilingual LLM Reasoning
Sultan AlRashed, Jianghui Wang, Francesco Orabona
CARE-Bench: A Benchmark of Diverse Client Simulations Guided by Expert Principles for Evaluating LLMs in Psychological Counseling
Bichen Wang, Yixin Sun, Junzhe Wang et al.