Co-occurring keywords
Papers
Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation
ACL 2024
CURATRON: Complete and Robust Preference Data for Rigorous Alignment of Large Language Models
NAACL 2024