Papers
TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization
Mingkang Zhu, Xi Chen, Zhongdao Wang et al.
The Batch Complexity of Bandit Pure Exploration
Adrienne Tuynman, Rémy Degenne
The Berkeley Function Calling Leaderboard (BFCL): From Tool Use to Agentic Evaluation of Large Language Models
Shishir G Patil, Huanzhi Mao, Fanjia Yan et al.
The Best of Both Worlds: Bridging Quality and Diversity in Data Selection with Bipartite Graph
Minghao Wu, Thuy-Trang Vu, Lizhen Qu et al.
The Brain’s Bitter Lesson: Scaling Speech Decoding With Self-Supervised Learning
Dulhan Jayalath, Gilad Landau, Brendan Shillingford et al.
The Butterfly Effect: Neural Network Training Trajectories Are Highly Sensitive to Initial Conditions
Gül Sena Altıntaş, Devin Kwok, Colin Raffel et al.
The Canary’s Echo: Auditing Privacy Risks of LLM-Generated Synthetic Text
Matthieu Meeus, Lukas Wutschitz, Santiago Zanella-Beguelin et al.
The Case for Learned Provenance-based System Behavior Baseline
Yao Zhu, Zhenyuan Li, Yangyang Wei et al.
The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning
Jiashun Liu, Johan Obando-Ceron, Pablo Samuel Castro et al.
The dark side of the forces: assessing non-conservative force models for atomistic machine learning
Filippo Bigi, Marcel F. Langer, Michele Ceriotti
The Devil Is in the Details: Tackling Unimodal Spurious Correlations for Generalizable Multimodal Reward Models
Zichao Li, Xueru Wen, Jie Lou et al.
The Diffusion Duality
Subham Sekhar Sahoo, Justin Deschenaux, Aaron Gokaslan et al.
The Disparate Benefits of Deep Ensembles
Kajetan Schweighofer, Adrian Arnaiz-Rodriguez, Sepp Hochreiter et al.
The Double-Ellipsoid Geometry of CLIP
Meir Yossef Levi, Guy Gilboa
The Elicitation Game: Evaluating Capability Elicitation Techniques
Felix Hofstätter, Teun Van Der Weij, Jayden Teoh et al.
The Emperor’s New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination
Yifan Sun, Han Wang, Dongbai Li et al.
The Empirical Mean is Minimax Optimal for Local Glivenko-Cantelli
Doron Cohen, Aryeh Kontorovich, Roi Weiss
The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking
Yuchun Miao, Sen Zhang, Liang Ding et al.
The Four Color Theorem for Cell Instance Segmentation
Ye Zhang, Yu Zhou, Yifeng Wang et al.
The Generalized Skew Spectrum of Graphs
Armando Bellante, Martin Plávala, Alessandro Luongo
The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence
Tom Wollschläger, Jannes Elstner, Simon Geisler et al.
The Global Convergence Time of Stochastic Gradient Descent in Non-Convex Landscapes: Sharp Estimates via Large Deviations
Waı̈ss Azizian, Franck Iutzeler, Jerome Malick et al.
The Harder Path: Last Iterate Convergence for Uncoupled Learning in Zero-Sum Games with Bandit Feedback
Côme Fiegel, Pierre Menard, Tadashi Kozuno et al.
The Hidden Dimensions of LLM Alignment: A Multi-Dimensional Analysis of Orthogonal Safety Directions
Wenbo Pan, Zhichao Liu, Qiguang Chen et al.