Papers

261 papers found
2025 CVPR
Curriculum Direct Preference Optimization for Diffusion and Consistency Models
Florinel-Alin Croitoru, Vlad Hondru, Radu Tudor Ionescu et al.
2025 CVPR
2025 CVPR
2025 CVPR
Direct Multi-Turn Preference Optimization for Language Agents
Wentao Shi, Mengqi Yuan, Junkang Wu et al.
2024 EMNLP
2024 EMNLP
2024 EMNLP
WPO: Enhancing RLHF with Weighted Preference Optimization
Wenxuan Zhou, Ravi Agrawal, Shujian Zhang et al.
2024 EMNLP
2024 EMNLP
2024 EMNLP
2024 EMNLP
Filtered Direct Preference Optimization
Tetsuro Morimura, Mitsuki Sakamoto, Yuu Jinnai et al.
2024 EMNLP
2024 EMNLP
Step-level Value Preference Optimization for Mathematical Reasoning
Guoxin Chen, Minpeng Liao, Chengxi Li et al.
2024 EMNLP