Borong Zhang
6 papers · 2023–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (4) π Cross-Pollinator (13) π Renaissance Researcher (5)
πΊοΈ
Taxonomy Completionist
(13)
Conferences
NIPS (2)
AAAI (1)
ACL (1)
ICLR (1)
JMLR (1)
Top co-authors
Keywords
reinforcement learning from human feedback
(2)
safe reinforcement learning
(2)
large language model
(2)
policy optimization
(1)
transfer learning
(1)
preference learning
(1)
constraint optimization
(1)
knowledge distillation
(1)
policy learning
(1)
ai safety
(1)
safety alignment
(1)
risk minimization
(1)
continuous control
(1)
constraint satisfaction
(1)
state representation learning
(1)
human feedback
(1)
diffusion model
(1)
hallucination reduction
(1)
safety benchmark
(1)
alignment method
(1)
Papers
Latent State-Predictive Exploration for Deep Reinforcement Learning
AAAI 2026
PKU-SafeRLHF: Towards Multi-Level Safety Alignment for LLMs with Human Preference
ACL 2025
Aligner: Efficient Alignment by Learning to Correct
NIPS 2024
SafeDreamer: Safe Reinforcement Learning with World Models
ICLR 2024
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research
JMLR 2024
Safety Gymnasium: A Unified Safe Reinforcement Learning Benchmark
NIPS 2023