Papers

5,479 papers found
RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content
João Monteiro, Pierre-André Noël, Étienne Marcotte et al.
2024 NIPS
Transcoders find interpretable LLM feature circuits
Jacob Dunefsky, Philippe Chlenski, Neel Nanda
2024 NIPS
2024 NIPS
2024 NIPS
2024 NIPS
Enhancing LLM Reasoning via Vision-Augmented Prompting
Ziyang Xiao, Dongxiang Zhang, Xiongwei Han et al.
2024 NIPS
2024 NIPS
MediQ: Question-Asking LLMs and a Benchmark for Reliable Interactive Clinical Reasoning
Shuyue Stella Li, Vidhisha Balachandran, Shangbin Feng et al.
2024 NIPS
Protecting Your LLMs with Information Bottleneck
Zichuan Liu, Zefan Wang, Linjie Xu et al.
2024 NIPS
Time-Reversal Provides Unsupervised Feedback to LLMs
Varun Yerram, Rahul Madhavan, Sravanti Addepalli et al.
2024 NIPS
Wings: Learning Multimodal LLMs without Text-only Forgetting
Yi-Kai Zhang, Shiyin Lu, Yang Li et al.
2024 NIPS
2024 NIPS
2024 NIPS
LeDex: Training LLMs to Better Self-Debug and Explain Code
Nan Jiang, Xiaopeng Li, Shiqi Wang et al.
2024 NIPS
Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum
Hadi Pouransari, Chun-Liang Li, Jen-Hao Rick Chang et al.
2024 NIPS
Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition
Edoardo Debenedetti, Javier Rando, Daniel Paleka et al.
2024 NIPS
StackEval: Benchmarking LLMs in Coding Assistance
Nidhish Shah, Zulkuf Genc, Dogu Araci
2024 NIPS
2024 NIPS