Papers

261 papers found
Geometric-Averaged Preference Optimization for Soft Preference Labels
Hiroki Furuta, Kuang-Huei Lee, Shixiang Shane Gu et al.
2024 NIPS
Teaching an Old LLM Secure Coding: Localized Preference Optimization on Distilled Preferences
Mohammad Saqib Hasan, Saikat Chakraborty, Santu Karmaker et al.
2025 ACL
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback
Arun Verma, Zhongxiang Dai, Xiaoqiang Lin et al.
2025 ICLR
2025 IJCNLP
2025 AACL
Direct Preference-based Policy Optimization without Reward Modeling
Gaon An, Junhyeok Lee, Xingdong Zuo et al.
2023 NIPS
Beyond Reward: Offline Preference-guided Policy Optimization
Yachen Kang, Diyuan Shi, Jinxin Liu et al.
2023 ICML
No Preference Left Behind: Group Distributional Preference Optimization
Binwei Yao, Zefan Cai, Yun-Shiuan Chuang et al.
2025 ICLR
Preference-Aware Constrained Multi-Objective Bayesian Optimization (Student Abstract)
Alaleh Ahmadianshalchi, Syrine Belakaria, Janardhan Rao Doppa
2024 AAAI
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Rafael Rafailov, Archit Sharma, Eric Mitchell et al.
2023 NIPS