Research Explorer

One Agent To Rule Them All: Towards Multi-agent Conversational AI

Christopher Clarke, Joseph Peper, Karthik Krishnamurthy et al.

2022 ACL

Pre-trained language models evaluating themselves - A comparative study

Philipp Koch, Matthias Aßenmacher, Christian Heumann

2022 ACL

Fantastic Expressions and Where to Find Them: Chinese Simile Generation with Multiple Constraints

Kexin Yang, Dayiheng Liu, Wenqiang Lei et al.

2023 ACL

We Understand Elliptical Sentences, and Language Models should Too: A New Dataset for Studying Ellipsis and its Interaction with Thematic Fit

Davide Testa, Emmanuele Chersoni, Alessandro Lenci

2023 ACL

“Knowledge is Power”: Constructing Knowledge Graph of Abdominal Organs and Using Them for Automatic Radiology Report Generation

Kaveri Kale, Pushpak Bhattacharyya, Aditya Shetty et al.

2023 ACL

Interactive Concept Learning for Uncovering Latent Themes in Large Text Collections

Maria Leonor Pacheco, Tunazzina Islam, Lyle Ungar et al.

2023 ACL

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

Mirac Suzgun, Nathan Scales, Nathanael Schärli et al.

2023 ACL

“Female Astronaut: Because sandwiches won’t make themselves up there”: Towards Multimodal misogyny detection in memes

Smriti Singh, Amritha Haridasan, Raymond Mooney

2023 ACL

ItD: Large Language Models Can Teach Themselves Induction through Deduction

Wangtao Sun, Haotian Xu, Xuanqing Yu et al.

2024 ACL

One Prompt To Rule Them All: LLMs for Opinion Summary Evaluation

Tejpalsingh Siledar, Swaroop Nath, Sankara Muddu et al.

2024 ACL

How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety by Humanizing LLMs

Yi Zeng, Hongpeng Lin, Jingwen Zhang et al.

2024 ACL

Disambiguate Words like Composing Them: A Morphology-Informed Approach to Enhance Chinese Word Sense Disambiguation

Yue Wang, Qiliang Liang, Yaqi Yin et al.

2024 ACL

Language Models can Evaluate Themselves via Probability Discrepancy

Tingyu Xia, Bowen Yu, Yuan Wu et al.

2024 ACL

Strong hallucinations from negation and how to fix them

Nicholas Asher, Swarnadeep Bhar

2024 ACL

LLMs cannot find reasoning errors, but can correct them given the error location

Gladys Tyen, Hassan Mansoor, Victor Carbune et al.

2024 ACL

Fantastic Semantics and Where to Find Them: Investigating Which Layers of Generative LLMs Reflect Lexical Semantics

Zhu Liu, Cunliang Kong, Ying Liu et al.

2024 ACL

HALoGEN: Fantastic LLM Hallucinations and Where to Find Them

Abhilasha Ravichander, Shrusti Ghela, David Wadden et al.

2025 ACL

Conspiracy Theories and Where to Find Them on TikTok

Francesco Corso, Francesco Pierri, Gianmarco De Francisci Morales

2025 ACL

ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities

Adhiraj Ghosh, Sebastian Dziadzio, Ameya Prabhu et al.

2025 ACL

Low-Perplexity LLM-Generated Sequences and Where To Find Them

Arthur Wuhrmann, Andrei Kucharavy, Anastasiia Kucherenko

2025 ACL

Building Japanese Creativity Benchmarks and Applying them to Enhance LLM Creativity

So Fukuda, Hayato Ogawa, Kaito Horio et al.

2025 ACL

Do Language Models Understand the Cognitive Tasks Given to Them? Investigations with the N-Back Paradigm

Xiaoyang Hu, Richard Lewis

2025 ACL

Explicit Bayesian Inference to Uncover the Latent Themes of Large Language Models

Raymond Li, Chuyuan Li, Gabriel Murray et al.

2025 ACL

Tell, Don’t Show: Leveraging Language Models’ Abstractive Retellings to Model Literary Themes

Li Lucy, Camilla Griffiths, Sarah Levine et al.

2025 ACL

Predicting Depression in Screening Interviews from Interactive Multi-Theme Collaboration

Xianbing Zhao, Yiqing Lyu, Di Wang et al.

2025 ACL

Papers