Papers

2,781 papers found
Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs
Shashank Gupta, Vaishnavi Shrivastava, Ameet Deshpande et al.
2024 ICLR
The Reversal Curse: LLMs trained on “A is B” fail to learn “B is A”
Lukas Berglund, Meg Tong, Maximilian Kaufmann et al.
2024 ICLR
Catastrophic Jailbreak of Open-source LLMs via Exploiting Generation
Yangsibo Huang, Samyak Gupta, Mengzhou Xia et al.
2024 ICLR
AgentBench: Evaluating LLMs as Agents
Xiao Liu, Hao Yu, Hanchen Zhang et al.
2024 ICLR
2024 ICLR
2024 ICLR
2024 ICLR
LLM Augmented LLMs: Expanding Capabilities through Composition
Rachit Bansal, Bidisha Samanta, Siddharth Dalmia et al.
2024 ICLR
Chain-of-Experts: When LLMs Meet Complex Operations Research Problems
Ziyang Xiao, Dongxiang Zhang, Yangjun Wu et al.
2024 ICLR
How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions
Lorenzo Pacchiardi, Alex James Chan, Sören Mindermann et al.
2024 ICLR
Teach LLMs to Phish: Stealing Private Information from Language Models
Ashwinee Panda, Christopher A. Choquette-Choo, Zhengming Zhang et al.
2024 ICLR
SmartPlay : A Benchmark for LLMs as Intelligent Agents
Yue Wu, Xuan Tang, Tom Mitchell et al.
2024 ICLR
Compressing LLMs: The Truth is Rarely Pure and Never Simple
AJAY KUMAR JAISWAL, Zhe Gan, Xianzhi Du et al.
2024 ICLR
2024 ICLR
2024 ICLR
2024 ICLR