Papers
ODIN: Disentangled Reward Mitigates Hacking in RLHF
Lichang Chen, Chen Zhu, Jiuhai Chen et al.
Offline Actor-Critic Reinforcement Learning Scales to Large Models
Jost Tobias Springenberg, Abbas Abdolmaleki, Jingwei Zhang et al.
Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL
Yu Luo, Tianying Ji, Fuchun Sun et al.
Offline Imitation from Observation via Primal Wasserstein State Occupancy Matching
Kai Yan, Alex Schwing, Yu-Xiong Wang
Offline Inverse RL: New Solution Concepts and Provably Efficient Algorithms
Filippo Lazzati, Mirco Mutti, Alberto Maria Metelli
Offline Multi-Objective Optimization
Ke Xue, Rongxi Tan, Xiaobin Huang et al.
Offline Training of Language Model Agents with Functions as Learnable Weights
Shaokun Zhang, Jieyu Zhang, Jiale Liu et al.
Offline Transition Modeling via Contrastive Energy Learning
Ruifeng Chen, Chengxing Jia, Zefang Huang et al.
Off-policy Evaluation Beyond Overlap: Sharp Partial Identification Under Smoothness
Samir Khan, Martin Saveski, Johan Ugander
OLLIE: Imitation Learning from Offline Pretraining to Online Finetuning
Sheng Yue, Xingyuan Hua, Ju Ren et al.
OMPO: A Unified Framework for RL under Policy and Dynamics Shifts
Yu Luo, Tianying Ji, Fuchun Sun et al.
On a Combinatorial Problem Arising in Machine Teaching
Joakim Sunde, Brigt Håvardstun, Jan Kratochvı́l et al.
On a Neural Implementation of Brenier’s Polar Factorization
Nina Vesseron, Marco Cuturi
On Computational Limits of Modern Hopfield Models: A Fine-Grained Complexity Analysis
Jerry Yao-Chieh Hu, Thomas Lin, Zhao Song et al.
On Convergence of Incremental Gradient for Non-convex Smooth Functions
Anastasia Koloskova, Nikita Doikov, Sebastian U Stich et al.
On dimensionality of feature vectors in MPNNs
César Bravo, Alexander Kozachinskiy, Cristobal Rojas
On Discrete Prompt Optimization for Diffusion Models
Ruochen Wang, Ting Liu, Cho-Jui Hsieh et al.
One for All: A Universal Generator for Concept Unlearnability via Multi-Modal Alignment
Chaochao Chen, Jiaming Zhang, Yuyuan Li et al.
One Meta-tuned Transformer is What You Need for Few-shot Learning
Xu Yang, Huaxiu Yao, Ying Wei
One Prompt is not Enough: Automated Construction of a Mixture-of-Expert Prompts
Ruochen Wang, Sohyun An, Minhao Cheng et al.
One-Shot Strategic Classification Under Unknown Costs
Elan Rosenfeld, Nir Rosenfeld
One Size Fits All for Semantic Shifts: Adaptive Prompt Tuning for Continual Learning
Doyoung Kim, Susik Yoon, Dongmin Park et al.
On Hypothesis Transfer Learning of Functional Linear Models
Haotian Lin, Matthew Reimherr
On Interpolating Experts and Multi-Armed Bandits
Houshuang Chen, Yuchen He, Chihao Zhang