Binghai Wang
4 papers · 2024–2026 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
π
Conference Polyglot
(2)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(10)
π§
Keyword Pioneer
π
Cross-Pollinator
(15)
Conferences
EMNLP (2)
ACL (1)
ICLR (1)
Top co-authors
Keywords
reward modeling
(2)
reward model
(2)
language model alignment
(2)
reinforcement learning from human feedback
(2)
data quality
(1)
preference datum
(1)
human preference datum
(1)
deceptive alignment
(1)
outcome accuracy
(1)
contrastive learning
(1)
rationale consistency
(1)
reinforcement learning
(1)
preference alignment
(1)
preference modeling
(1)
Papers
Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models
ACL 2026
RMB: Comprehensively benchmarking reward models in LLM alignment
ICLR 2025
Improving Discriminative Capability of Reward Models in RLHF Using Contrastive Learning
EMNLP 2024
Reward Modeling Requires Automatic Adjustment Based on Data Quality
EMNLP 2024