Papers
Can ChatGPT Assess Human Personalities? A General Evaluation Framework
Haocong Rao, Cyril Leung, Chunyan Miao
Can ChatGPT Defend its Belief in Truth? Evaluating LLM Reasoning via Debate
Boshi Wang, Xiang Yue, Huan Sun
Can ChatGPT Perform Reasoning Using the IRAC Method in Analyzing Legal Scenarios Like a Lawyer?
Xiaoxi Kang, Lizhen Qu, Lay-Ki Soon et al.
Can Foundation Models Watch, Talk and Guide You Step by Step to Make a Cake?
Yuwei Bao, Keunwoo Yu, Yichi Zhang et al.
Can Language Models Be Tricked by Language Illusions? Easier with Syntax, Harder with Semantics
Yuhan Zhang, Edward Gibson, Forrest Davis
Can Language Models Laugh at YouTube Short-form Videos?
Dayoon Ko, Sangho Lee, Gunhee Kim
Can language models learn analogical reasoning? Investigating training objectives and comparisons to human performance
Molly Petersen, Lonneke van der Plas
Can Language Models Understand Physical Concepts?
Lei Li, Jingjing Xu, Qingxiu Dong et al.
Can Large Language Models Capture Dissenting Human Voices?
Noah Lee, Na Min An, James Thorne
Can Large Language Models Fix Data Annotation Errors? An Empirical Study Using Debatepedia for Query-Focused Text Summarization
Md Tahmid Rahman Laskar, Mizanur Rahman, Israt Jahan et al.
Can LLMs Facilitate Interpretation of Pre-trained Language Models?
Basel Mousi, Nadir Durrani, Fahim Dalvi
Can LMs Generalize to Future Data? An Empirical Analysis on Text Summarization
Chi Cheang, Hou Chan, Derek Wong et al.
Can Pre-trained Vision and Language Models Answer Visual Information-Seeking Questions?
Yang Chen, Hexiang Hu, Yi Luan et al.
Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model
Parishad BehnamGhader, Santiago Miret, Siva Reddy
Can training neural language models on a curriculum with developmentally plausible data improve alignment with human reading behavior?
Aryaman Chobey, Oliver Smith, Anzi Wang et al.
Can We Edit Factual Knowledge by In-Context Learning?
Ce Zheng, Lei Li, Qingxiu Dong et al.
Can We Edit Multimodal Large Language Models?
Siyuan Cheng, Bozhong Tian, Qingbin Liu et al.
Can Word Sense Distribution Detect Semantic Changes of Words?
Xiaohang Tang, Yi Zhou, Taichi Aida et al.
Can You Follow Me? Testing Situational Understanding for ChatGPT
Chenghao Yang, Allyson Ettinger
Can you Summarize my learnings? Towards Perspective-based Educational Dialogue Summarization
Raghav Jain, Tulika Saha, Jhagrut Lalwani et al.
CAPIVARA: Cost-Efficient Approach for Improving Multilingual CLIP Performance on Low-Resource Languages
Gabriel Oliveira dos Santos, Diego Alysson Braga Moreira, Alef Iury Ferreira et al.
CAPSTONE: Curriculum Sampling for Dense Retrieval with Document Expansion
Xingwei He, Yeyun Gong, A-Long Jin et al.
CAR: Conceptualization-Augmented Reasoner for Zero-Shot Commonsense Question Answering
Weiqi Wang, Tianqing Fang, Wenxuan Ding et al.
CarExpert: Leveraging Large Language Models for In-Car Conversational Question Answering
Md Rashad Al Hasan Rony, Christian Suess, Sinchana Ramakanth Bhat et al.
CASE: Commonsense-Augmented Score with an Expanded Answer Space
Wenkai Chen, Sahithya Ravi, Vered Shwartz