Papers

261 papers found
2025 ICLR
2025 ICLR
Preference Optimization for Reasoning with Pseudo Feedback
Fangkai Jiao, Geyang Guo, Xingxing Zhang et al.
2025 ICLR
2025 ICLR
2025 ICLR
2025 ICLR
Modality-Fair Preference Optimization for Trustworthy MLLM Alignment
Songtao Jiang, Yan Zhang, Ruizhe Chen et al.
2025 IJCAI
2025 IJCAI
Atomic Consistency Preference Optimization for Long-Form Question Answering
Jingfeng Chen, Raghuveer Thirukovalluru, Junlin Wang et al.
2025 IJCNLP
2025 NAACL
2025 NAACL
PORT: Preference Optimization on Reasoning Traces
Salem Lahlou, Abdalgader Abubaker, Hakim Hacid
2025 NAACL
2025 NAACL