Yifu Huo
6 papers · 2024–2026 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Conference Polyglot (3) π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (16) π§ Keyword Pioneer
π
Cross-Pollinator
(15)
Conferences
AAAI (4)
EMNLP (1)
ICML (1)
Top co-authors
Keywords
reward model
(3)
preference optimization
(2)
language model alignment
(2)
reinforcement learning from human feedback
(2)
machine translation
(1)
preference alignment
(1)
sampling efficiency
(1)
ranking accuracy
(1)
generative model
(1)
foundation model
(1)
hallucination mitigation
(1)
proximal policy optimization
(1)
human preference alignment
(1)
efficient sampling
(1)
abstractive summarization
(1)
hypothesis space
(1)
multi-dimensional evaluation
(1)
probing method
(1)
visual reward model
(1)
preference representation
(1)
Papers
GRAM-RΒ²: Self-Training Generative Foundation Reward Models for Reward Reasoning
AAAI 2026
Probing Preference Representations: A Multi-Dimensional Evaluation and Analysis Method for Reward Models
AAAI 2026
RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference Data
AAAI 2025
HEAL: A Hypothesis-Based Preference-Aware Analysis Framework
EMNLP 2025
GRAM: A Generative Foundation Reward Model for Reward Generalization
ICML 2025
ESRL: Efficient Sampling-Based Reinforcement Learning for Sequence Generation
AAAI 2024