Papers
214 papers found
One Agent To Rule Them All: Towards Multi-agent Conversational AI
Christopher Clarke, Joseph Peper, Karthik Krishnamurthy et al.
Pre-trained language models evaluating themselves - A comparative study
Philipp Koch, Matthias Aßenmacher, Christian Heumann
Fantastic Expressions and Where to Find Them: Chinese Simile Generation with Multiple Constraints
Kexin Yang, Dayiheng Liu, Wenqiang Lei et al.
We Understand Elliptical Sentences, and Language Models should Too: A New Dataset for Studying Ellipsis and its Interaction with Thematic Fit
Davide Testa, Emmanuele Chersoni, Alessandro Lenci
“Knowledge is Power”: Constructing Knowledge Graph of Abdominal Organs and Using Them for Automatic Radiology Report Generation
Kaveri Kale, Pushpak Bhattacharyya, Aditya Shetty et al.
Interactive Concept Learning for Uncovering Latent Themes in Large Text Collections
Maria Leonor Pacheco, Tunazzina Islam, Lyle Ungar et al.
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
Mirac Suzgun, Nathan Scales, Nathanael Schärli et al.
“Female Astronaut: Because sandwiches won’t make themselves up there”: Towards Multimodal misogyny detection in memes
Smriti Singh, Amritha Haridasan, Raymond Mooney
ItD: Large Language Models Can Teach Themselves Induction through Deduction
Wangtao Sun, Haotian Xu, Xuanqing Yu et al.
One Prompt To Rule Them All: LLMs for Opinion Summary Evaluation
Tejpalsingh Siledar, Swaroop Nath, Sankara Muddu et al.
How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety by Humanizing LLMs
Yi Zeng, Hongpeng Lin, Jingwen Zhang et al.
Disambiguate Words like Composing Them: A Morphology-Informed Approach to Enhance Chinese Word Sense Disambiguation
Yue Wang, Qiliang Liang, Yaqi Yin et al.
Language Models can Evaluate Themselves via Probability Discrepancy
Tingyu Xia, Bowen Yu, Yuan Wu et al.
Strong hallucinations from negation and how to fix them
Nicholas Asher, Swarnadeep Bhar
LLMs cannot find reasoning errors, but can correct them given the error location
Gladys Tyen, Hassan Mansoor, Victor Carbune et al.
Fantastic Semantics and Where to Find Them: Investigating Which Layers of Generative LLMs Reflect Lexical Semantics
Zhu Liu, Cunliang Kong, Ying Liu et al.
HALoGEN: Fantastic LLM Hallucinations and Where to Find Them
Abhilasha Ravichander, Shrusti Ghela, David Wadden et al.
Conspiracy Theories and Where to Find Them on TikTok
Francesco Corso, Francesco Pierri, Gianmarco De Francisci Morales
ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities
Adhiraj Ghosh, Sebastian Dziadzio, Ameya Prabhu et al.
Low-Perplexity LLM-Generated Sequences and Where To Find Them
Arthur Wuhrmann, Andrei Kucharavy, Anastasiia Kucherenko
Building Japanese Creativity Benchmarks and Applying them to Enhance LLM Creativity
So Fukuda, Hayato Ogawa, Kaito Horio et al.
Do Language Models Understand the Cognitive Tasks Given to Them? Investigations with the N-Back Paradigm
Xiaoyang Hu, Richard Lewis
Explicit Bayesian Inference to Uncover the Latent Themes of Large Language Models
Raymond Li, Chuyuan Li, Gabriel Murray et al.
Tell, Don’t Show: Leveraging Language Models’ Abstractive Retellings to Model Literary Themes
Li Lucy, Camilla Griffiths, Sarah Levine et al.
Predicting Depression in Screening Interviews from Interactive Multi-Theme Collaboration
Xianbing Zhao, Yiqing Lyu, Di Wang et al.