Papers

261 papers found
On Softmax Direct Preference Optimization for Recommendation
Yuxin Chen, Junfei Tan, An Zhang et al.
2024 NIPS
Group Robust Preference Optimization in Reward-free RLHF
Shyam Sundhar Ramesh, Yifan Hu, Iason Chaimalas et al.
2024 NIPS
2024 NIPS
2024 NIPS
Iterative Reasoning Preference Optimization
Richard Yuanzhe Pang, Weizhe Yuan, Kyunghyun Cho et al.
2024 NIPS
2024 NIPS
2024 NIPS
$\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$
Junkang Wu, Yuexiang Xie, Zhengyi Yang et al.
2024 NIPS
2025 AAAI
2025 AAAI
Multi-Reference Preference Optimization for Large Language Models
Hung Le, Quan Hung Tran, Dung Nguyen et al.
2025 AAAI
2025 AAAI
Atomic Consistency Preference Optimization for Long-Form Question Answering
Jingfeng Chen, Raghuveer Thirukovalluru, Junlin Wang et al.
2025 AACL