conftrace_

Yuhang Lai

5 papers · 2023–2025 · 3 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+2 more ↓

🌍 Conference Polyglot (3) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (20) 🐝 Cross-Pollinator (3)

🌈 Renaissance Researcher (6) ❓ The Questioner

Conferences

ACL (2) EMNLP (2) ICML (1)

Top co-authors

Shujun Liu (3) Siyuan Wang (3) Tao Yu (2) zhongyu wei (2) Xuanjing Huang (2) Chengxi Li (1) Haoyuan Wu (1) Qian Liu (1) Che Liu (1) Wen-tau Yih (1)

Keywords

code generation (2) large language model (2) benchmark evaluation (1) adversarial robustness (1) reward modeling (1) preference alignment (1) language model alignment (1) model safety (1) knowledge base (1) reinforcement learning from human feedback (1) preference modeling (1) human feedback (1) ensemble method (1) reward model (1) retrieval-augmented generation (1) large vision-language model (1) human preference (1) jailbreak defense (1) token-level prediction (1) functional correctness (1)

Papers

HAF-RM: A Hybrid Alignment Framework for Reward Model Training ACL 2025 How Jailbreak Defenses Work and Ensemble? A Mechanistic Investigation EMNLP 2025 ALaRM: Align Language Models via Hierarchical Rewards Modeling ACL 2024 EvoR: Evolving Retrieval for Code Generation EMNLP 2024 DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation ICML 2023