conftrace_
2025 L4DC L4DC 2025

Safe Exploration in Reinforcement Learning: Training Backup Control Barrier Functions with Zero Training-Time Safety Violations