Co-occurring keywords
Papers
Regularized Best-of-N Sampling with Minimum Bayes Risk Objective for Language Model Alignment
NAACL 2025
When Personalization Meets Reality: A Multi-Faceted Analysis of Personalized Preference Learning
EMNLP 2025
World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning
ACL 2025
Rating-Based Reinforcement Learning
AAAI 2024
Embedding Learning for Preference-based Speech Quality Assessment
INTERSPEECH 2024