conftrace_

Binghai Wang

4 papers · 2024–2026 · 3 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

🌍 Conference Polyglot (2) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (10) 🧭 Keyword Pioneer 🐝 Cross-Pollinator (15)

Conferences

EMNLP (2) ACL (1) ICLR (1)

Top co-authors

Tao Gui (4) Qi Zhang (4) Xuanjing Huang (4) Zhiheng Xi (3) Rui Zheng (3) Lu Chen (2) Wei Shen (2) Yuhao Zhou (2) Bowen Yu (1) Junyang Lin (1)

Keywords

reward modeling (2) reward model (2) language model alignment (2) reinforcement learning from human feedback (2) data quality (1) preference datum (1) human preference datum (1) deceptive alignment (1) outcome accuracy (1) contrastive learning (1) rationale consistency (1) reinforcement learning (1) preference alignment (1) preference modeling (1)

Papers

Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models ACL 2026 RMB: Comprehensively benchmarking reward models in LLM alignment ICLR 2025 Improving Discriminative Capability of Reward Models in RLHF Using Contrastive Learning EMNLP 2024 Reward Modeling Requires Automatic Adjustment Based on Data Quality EMNLP 2024