Research Explorer

Can LLMs Rank the Harmfulness of Smaller LLMs? We are Not There Yet

Berk Atil, Vipul Gupta, Sarkar Snigdha Sarathi Das et al.

2025 ACL

Can LLMs Reason About Program Semantics? A Comprehensive Evaluation of LLMs on Formal Specification Inference

Thanh Le-Cong, Bach Le, Toby Murray

2025 ACL

Can LLMs Recognize Their Own Analogical Hallucinations? Evaluating Uncertainty Estimation for Analogical Reasoning

Zheng Chen, Zhaoxin Feng, Jianfei Ma et al.

2025 ACL

Can LLMs Reliably Simulate Real Students’ Abilities in Mathematics and Reading Comprehension?

KV Aditya Srivatsa, Kaushal Maurya, Ekaterina Kochmar

2025 ACL

Can LLMs Simulate L2-English Dialogue? An Information-Theoretic Analysis of L1-Dependent Biases

Rena Gao, Xuetong Wu, Tatsuki Kuribayashi et al.

2025 ACL

Can LLMs Understand Unvoiced Speech? Exploring EMG-to-Text Conversion with LLMs

Payal Mohapatra, Akash Pandey, Xiaoyuan Zhang et al.

2025 ACL

Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation?

Leyi Pan, Aiwei Liu, Shiyu Huang et al.

2025 ACL

Can Medical Vision-Language Pre-training Succeed with Purely Synthetic Data?

Che Liu, Zhongwei Wan, Haozhe Wang et al.

2025 ACL

Can MLLMs Understand the Deep Implication Behind Chinese Images?

Chenhao Zhang, Xi Feng, Yuelin Bai et al.

2025 ACL

Can Multimodal Foundation Models Understand Schematic Diagrams? An Empirical Study on Information-Seeking QA over Scientific Papers

Yilun Zhao, Chengye Wang, Chuhan Li et al.

2025 ACL

Can Multimodal Large Language Models Understand Spatial Relations?

Jingping Liu, Ziyan Liu, Zhedong Cen et al.

2025 ACL

Can Multi-turn Self-refined Single Agent LMs with Retrieval Solve Hard Coding Problems?

Md Tanzib Hosain, Md Kishor Morol

2025 ACL

Can Perplexity Predict Finetuning Performance? An Investigation of Tokenization Effects on Sequential Language Models for Nepali

Nishant Luitel, Nirajan Bekoju, Anand Kumar Sah et al.

2025 ACL

Can Prompting LLMs Unlock Hate Speech Detection across Languages? A Zero-shot and Few-shot Study

Faeze Ghorbanpour, Daryna Dementieva, Alexander Fraser

2025 ACL

Can Reasoning LLMs Synthesize Complex Climate Statements?

Yucheng Lu

2025 ACL

Can Stories Help LLMs Reason? Curating Information Space Through Narrative

Vahid Sadiri Javadi, Johanne Trippas, Yash Kumar Lal et al.

2025 ACL

Can Third Parties Read Our Emotions?

Jiayi Li, Yingfan Zhou, Pranav Narayanan Venkit et al.

2025 ACL

Can’t See the Forest for the Trees: Benchmarking Multimodal Safety Awareness for Multimodal LLMs

Wenxuan Wang, Xiaoyuan Liu, Kuiyi Gao et al.

2025 ACL

Can Uniform Meaning Representation Help GPT-4 Translate from Indigenous Languages?

Shira Wein

2025 ACL

Can Vision-Language Models Evaluate Handwritten Math?

Oikantik Nath, Hanani Bathina, Mohammed Safi Ur Rahman Khan et al.

2025 ACL

Can Vision Language Models Understand Mimed Actions?

Hyundong Justin Cho, Spencer Lin, Tejas Srinivasan et al.

2025 ACL

Can VLMs Actually See and Read? A Survey on Modality Collapse in Vision-Language Models

Mong Yuan Sim, Wei Emma Zhang, Xiang Dai et al.

2025 ACL

Can We Further Elicit Reasoning in LLMs? Critic-Guided Planning with Retrieval-Augmentation for Solving Challenging Tasks

Xingxuan Li, Weiwen Xu, Ruochen Zhao et al.

2025 ACL

Can we Retrieve Everything All at Once? ARM: An Alignment-Oriented LLM-based Retrieval Method

Peter Baile Chen, Yi Zhang, Mike Cafarella et al.

2025 ACL

Can We Trust AI Doctors? A Survey of Medical Hallucination in Large Language and Large Vision-Language Models

Zhihong Zhu, Yunyan Zhang, Xianwei Zhuang et al.

2025 ACL