Papers - Conftrace

TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization

Mingkang Zhu, Xi Chen, Zhongdao Wang et al.

2025 ICML

The Batch Complexity of Bandit Pure Exploration

Adrienne Tuynman, Rémy Degenne

2025 ICML

The Berkeley Function Calling Leaderboard (BFCL): From Tool Use to Agentic Evaluation of Large Language Models

Shishir G Patil, Huanzhi Mao, Fanjia Yan et al.

2025 ICML

The Best of Both Worlds: Bridging Quality and Diversity in Data Selection with Bipartite Graph

Minghao Wu, Thuy-Trang Vu, Lizhen Qu et al.

2025 ICML

The Brain’s Bitter Lesson: Scaling Speech Decoding With Self-Supervised Learning

Dulhan Jayalath, Gilad Landau, Brendan Shillingford et al.

2025 ICML

The Butterfly Effect: Neural Network Training Trajectories Are Highly Sensitive to Initial Conditions

Gül Sena Altıntaş, Devin Kwok, Colin Raffel et al.

2025 ICML

The Canary’s Echo: Auditing Privacy Risks of LLM-Generated Synthetic Text

Matthieu Meeus, Lukas Wutschitz, Santiago Zanella-Beguelin et al.

2025 ICML

The Case for Learned Provenance-based System Behavior Baseline

Yao Zhu, Zhenyuan Li, Yangyang Wei et al.

2025 ICML

The Complexity of Learning Sparse Superposed Features with Feedback

Akash Kumar

2025 ICML

The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning

Jiashun Liu, Johan Obando-Ceron, Pablo Samuel Castro et al.

2025 ICML

The dark side of the forces: assessing non-conservative force models for atomistic machine learning

Filippo Bigi, Marcel F. Langer, Michele Ceriotti

2025 ICML

The Devil Is in the Details: Tackling Unimodal Spurious Correlations for Generalizable Multimodal Reward Models

Zichao Li, Xueru Wen, Jie Lou et al.

2025 ICML

The Diffusion Duality

Subham Sekhar Sahoo, Justin Deschenaux, Aaron Gokaslan et al.

2025 ICML

The Disparate Benefits of Deep Ensembles

Kajetan Schweighofer, Adrian Arnaiz-Rodriguez, Sepp Hochreiter et al.

2025 ICML

The Double-Ellipsoid Geometry of CLIP

Meir Yossef Levi, Guy Gilboa

2025 ICML

The Elicitation Game: Evaluating Capability Elicitation Techniques

Felix Hofstätter, Teun Van Der Weij, Jayden Teoh et al.

2025 ICML

The Emperor’s New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination

Yifan Sun, Han Wang, Dongbai Li et al.

2025 ICML

The Empirical Mean is Minimax Optimal for Local Glivenko-Cantelli

Doron Cohen, Aryeh Kontorovich, Roi Weiss

2025 ICML

The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking

Yuchun Miao, Sen Zhang, Liang Ding et al.

2025 ICML

The Four Color Theorem for Cell Instance Segmentation

Ye Zhang, Yu Zhou, Yifeng Wang et al.

2025 ICML

The Generalized Skew Spectrum of Graphs

Armando Bellante, Martin Plávala, Alessandro Luongo

2025 ICML

The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence

Tom Wollschläger, Jannes Elstner, Simon Geisler et al.

2025 ICML

The Global Convergence Time of Stochastic Gradient Descent in Non-Convex Landscapes: Sharp Estimates via Large Deviations

Waı̈ss Azizian, Franck Iutzeler, Jerome Malick et al.

2025 ICML

The Harder Path: Last Iterate Convergence for Uncoupled Learning in Zero-Sum Games with Bandit Feedback

Côme Fiegel, Pierre Menard, Tadashi Kozuno et al.

2025 ICML

The Hidden Dimensions of LLM Alignment: A Multi-Dimensional Analysis of Orthogonal Safety Directions

Wenbo Pan, Zhichao Liu, Qiguang Chen et al.

2025 ICML