Papers
175 papers found
Can Large Language Models Safely Address Patient Questions Following Cataract Surgery?
Mohita Chowdhury, Ernest Lim, Aisling Higham et al.
Multi-Task Inference: Can Large Language Models Follow Multiple Instructions at Once?
Guijin Son, SangWon Baek, Sangdae Nam et al.
Can Large Language Models Interpret Noun-Noun Compounds? A Linguistically-Motivated Study on Lexicalized and Novel Compounds
Giulia Rambelli, Emmanuele Chersoni, Claudia Collacciani et al.
Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?
Zhaochen Su, Juntao Li, Jun Zhang et al.
Can Large Language Models be Good Emotional Supporter? Mitigating Preference Bias on Emotional Support Conversation
Dongjin Kang, Sunghwan Kim, Taeyoon Kwon et al.
Can Large Language Models Mine Interpretable Financial Factors More Effectively? A Neural-Symbolic Factor Mining Agent Model
Zhiwei Li, Ran Song, Caihong Sun et al.
Can Large Language Model Summarizers Adapt to Diverse Scientific Communication Goals?
Marcio Fonseca, Shay Cohen
How can large language models become more human?
Daphne Wang, Mehrnoosh Sadrzadeh, Miloš Stanojević et al.
Human Speech Perception in Noise: Can Large Language Models Paraphrase to Improve It?
Anupama Chingacham, Miaoran Zhang, Vera Demberg et al.
Can Large Language Models Understand Internet Buzzwords Through User-Generated Content
Chen Huang, Junkai Luo, Xinzuo Wang et al.
Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?
Yancheng He, Shilong Li, Jiaheng Liu et al.
LLM Meets Scene Graph: Can Large Language Models Understand and Generate Scene Graphs? A Benchmark and Empirical Study
Dongil Yang, Minjin Kim, Sunghwan Kim et al.
Can Large Language Models Accurately Generate Answer Keys for Health-related Questions?
Davis Bartels, Deepak Gupta, Dina Demner-Fushman
Can Large Language Models Address Open-Target Stance Detection?
Abu Ubaida Akash, Ahmed Fahmy, Amine Trabelsi
Can Large Language Models Understand Argument Schemes?
Elfia Bezou-Vrakatseli, Oana Cocarascu, Sanjay Modgil
Can Large Language Models Classify and Generate Antimicrobial Resistance Genes?
Hyunwoo Yoo, Haebin Shin, Gail Rosen
Can Large Language Models Automatically Score Proficiency of Written Essays?
Watheq Ahmad Mansour, Salam Albatarni, Sohaila Eltanbouly et al.
Can Large Language Models Discern Evidence for Scientific Hypotheses? Case Studies in the Social Sciences
Sai Koneru, Jian Wu, Sarah Rajtmajer
Can Large Language Models Learn Translation Robustness from Noisy-Source In-context Demonstrations?
Leiyu Pan, Yongqi Leng, Deyi Xiong
CodeJudge-Eval: Can Large Language Models be Good Judges in Code Understanding?
Yuwei Zhao, Ziyang Luo, Yuchen Tian et al.
How Well Can Large Language Models Reflect? A Human Evaluation of LLM-generated Reflections for Motivational Interviewing Dialogues
Erkan Basar, Xin Sun, Iris Hendrickx et al.
Can Large Language Models Understand You Better? An MBTI Personality Detection Dataset Aligned with Population Traits
Bohan Li, Jiannan Guan, Longxu Dou et al.
Can Large Language Models Differentiate Harmful from Argumentative Essays? Steps Toward Ethical Essay Scoring
Hongjin Kim, Jeonghyun Kang, Harksoo Kim
Can Large Language Models perform Relation-based Argument Mining?
Deniz Gorur, Antonio Rago, Francesca Toni