conftrace_

Borong Zhang

6 papers · 2023–2026 · 5 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+1 more ↓

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (4) 🐝 Cross-Pollinator (13) 🌈 Renaissance Researcher (5)

🗺️ Taxonomy Completionist (13)

Conferences

NIPS (2) AAAI (1) ACL (1) ICLR (1) JMLR (1)

Top co-authors

Jiaming Ji (5) Yaodong Yang (5) Xuehai Pan (3) Weidong Huang (3) Jiayi Zhou (3) Yiran Geng (2) Donghai Hong (2) Josef Dai (2) Ruiyang Sun (2) Boyuan Chen (2)

Keywords

reinforcement learning from human feedback (2) safe reinforcement learning (2) large language model (2) policy optimization (1) transfer learning (1) preference learning (1) constraint optimization (1) knowledge distillation (1) policy learning (1) ai safety (1) safety alignment (1) risk minimization (1) continuous control (1) constraint satisfaction (1) state representation learning (1) human feedback (1) diffusion model (1) hallucination reduction (1) safety benchmark (1) alignment method (1)

Papers

Latent State-Predictive Exploration for Deep Reinforcement Learning AAAI 2026 PKU-SafeRLHF: Towards Multi-Level Safety Alignment for LLMs with Human Preference ACL 2025 Aligner: Efficient Alignment by Learning to Correct NIPS 2024 SafeDreamer: Safe Reinforcement Learning with World Models ICLR 2024 OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research JMLR 2024 Safety Gymnasium: A Unified Safe Reinforcement Learning Benchmark NIPS 2023