Papers
546 papers found
Policy Evaluation in Distributional LQR
Zifan Wang, Yulong Gao, Siyi Wang et al.
Policy Gradient Play with Networked Agents in Markov Potential Games
Sarper Aydin, Ceyhun Eksin
Policy Learning for Active Target Tracking over Continuous $SE(3)$ Trajectories
Pengzhi Yang, Shumon Koga, Arash Asgharivaskasi et al.
Practical Critic Gradient based Actor Critic for On-Policy Reinforcement Learning
Swaminathan Gurumurthy, Zachary Manchester, J Zico Kolter
Predictive safety filter using system level synthesis
Antoine Leeman, Johannes Köhler, Samir Bennani et al.
Probabilistic Invariance for Gaussian Process State Space Models
Paul Griffioen, Alex Devonport, Murat Arcak
Probabilistic Safeguard for Reinforcement Learning Using Safety Index Guided Gaussian Process Models
Weiye Zhao, Tairan He, Changliu Liu
Probabilistic Symmetry for Multi-Agent Dynamics
Sophia Huiwen Sun, Robin Walters, Jinxi Li et al.
Probabilistic Verification of ReLU Neural Networks via Characteristic Functions
Joshua Pilipovsky, Vignesh Sivaramakrishnan, Meeko Oishi et al.
Provably Efficient Generalized Lagrangian Policy Optimization for Safe Multi-Agent Reinforcement Learning
Dongsheng Ding, Xiaohan Wei, Zhuoran Yang et al.
Reachability Analysis-based Safety-Critical Control using Online Fixed-Time Reinforcement Learning
Nick-Marios Kokolakis, Kyriakos G Vamvoudakis, Wassim Haddad
Rectified Pessimistic-Optimistic Learning for Stochastic Continuum-armed Bandit with Constraints
Hengquan Guo, Zhu Qi, Xin Liu
Regret Analysis of Online LQR Control via Trajectory Prediction and Tracking
Yitian Chen, Timothy L Molloy, Tyler Summers et al.
Regret Guarantees for Online Deep Control
Xinyi Chen, Edgar Minasyan, Jason D. Lee et al.
Roll-Drop: accounting for observation noise with a single parameter
Luigi Campanaro, Daniele De Martini, Siddhant Gangapurwala et al.
Safe and Efficient Reinforcement Learning using Disturbance-Observer-Based Control Barrier Functions
Yikun Cheng, Pan Zhao, Naira Hovakimyan
Sample Complexity Bound for Evaluating the Robust Observer’s Performance under Coprime Factors Uncertainty
Serban Sabau, Yifei Zhang, Sourav Kumar Ukil
Satellite Navigation and Coordination with Limited Information Sharing
Sydney Dolan, Siddharth Nayak, Hamsa Balakrishnan
Targeted Adversarial Attacks against Neural Network Trajectory Predictors
Kaiyuan Tan, Jun Wang, Yiannis Kantaros
Template-Based Piecewise Affine Regression
Guillaume O Berger, Sriram Sankaranarayanan
The Impact of the Geometric Properties of the Constraint Set in Safe Optimization with Bandit Feedback
Spencer Hutchinson, Berkay Turan, Mahnoosh Alizadeh
Time Dependent Inverse Optimal Control using Trigonometric Basis Functions
Rahel Rickenbach, Elena Arcari, Melanie Zeilinger
Time-Incremental Learning of Temporal Logic Classifiers Using Decision Trees
Erfan Aasi, Mingyu Cai, Cristian Ioan Vasile et al.
Top-k data selection via distributed sample quantile inference
Xu Zhang, Marcos M. Vasconcelos