Papers

214 papers found
One Agent To Rule Them All: Towards Multi-agent Conversational AI
Christopher Clarke, Joseph Peper, Karthik Krishnamurthy et al.
2022 ACL
Pre-trained language models evaluating themselves - A comparative study
Philipp Koch, Matthias Aßenmacher, Christian Heumann
2022 ACL
Interactive Concept Learning for Uncovering Latent Themes in Large Text Collections
Maria Leonor Pacheco, Tunazzina Islam, Lyle Ungar et al.
2023 ACL
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
Mirac Suzgun, Nathan Scales, Nathanael Schärli et al.
2023 ACL
2024 ACL
One Prompt To Rule Them All: LLMs for Opinion Summary Evaluation
Tejpalsingh Siledar, Swaroop Nath, Sankara Muddu et al.
2024 ACL
2024 ACL
2024 ACL
2024 ACL
HALoGEN: Fantastic LLM Hallucinations and Where to Find Them
Abhilasha Ravichander, Shrusti Ghela, David Wadden et al.
2025 ACL
Conspiracy Theories and Where to Find Them on TikTok
Francesco Corso, Francesco Pierri, Gianmarco De Francisci Morales
2025 ACL
ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities
Adhiraj Ghosh, Sebastian Dziadzio, Ameya Prabhu et al.
2025 ACL
Low-Perplexity LLM-Generated Sequences and Where To Find Them
Arthur Wuhrmann, Andrei Kucharavy, Anastasiia Kucherenko
2025 ACL
2025 ACL