Yuhang Lai
5 papers · 2023–2025 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Conference Polyglot (3) π Interdisciplinary Bridge π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (20) π Cross-Pollinator (3)
π
Renaissance Researcher
(6)
β
The Questioner
Conferences
ACL (2)
EMNLP (2)
ICML (1)
Top co-authors
Keywords
code generation
(2)
large language model
(2)
benchmark evaluation
(1)
adversarial robustness
(1)
reward modeling
(1)
preference alignment
(1)
language model alignment
(1)
model safety
(1)
knowledge base
(1)
reinforcement learning from human feedback
(1)
preference modeling
(1)
human feedback
(1)
ensemble method
(1)
reward model
(1)
retrieval-augmented generation
(1)
large vision-language model
(1)
human preference
(1)
jailbreak defense
(1)
token-level prediction
(1)
functional correctness
(1)
Papers
HAF-RM: A Hybrid Alignment Framework for Reward Model Training
ACL 2025
How Jailbreak Defenses Work and Ensemble? A Mechanistic Investigation
EMNLP 2025
ALaRM: Align Language Models via Hierarchical Rewards Modeling
ACL 2024
EvoR: Evolving Retrieval for Code Generation
EMNLP 2024
DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation
ICML 2023