Co-occurring keywords
Papers
Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation
ACL 2024
Preference-based Pure Exploration
NIPS 2024
Are U a Joke Master? Pun Generation via Multi-Stage Curriculum Learning towards a Humor LLM
ACL 2024
Enhancing Multimodal Emotion Recognition through ASR Error Compensation and LLM Fine-Tuning
INTERSPEECH 2024
Think Before You Duel: Understanding Complexities of Preference Learning under Constrained Resources
AISTATS 2024
The Paradox of Preference: A Study on LLM Alignment Algorithms and Data Acquisition Methods
NAACL 2024