Artificial Intelligence › Core AI ›

AI Safety

2972 directly classified papers

Papers per year

Papers

Towards Class-Oriented Poisoning Attacks Against Neural Networks WACV 2022

On the Effectiveness of Small Input Noise for Defending Against Query-Based Black-Box Attacks WACV 2022

Can You Spot the Chameleon? Adversarially Camouflaging Images From Co-Salient Object Detection CVPR 2022

Segment and Complete: Defending Object Detectors Against Adversarial Patch Attacks With Robust Patch Detection CVPR 2022

Demystifying the Adversarial Robustness of Random Transformation Defenses ICML 2022

Self-Healing Robust Neural Networks via Closed-Loop Control JMLR 2022

OVERT: An Algorithm for Safety Verification of Neural Network Control Policies for Nonlinear Systems JMLR 2022

Predicting the Influence of Fake and Real News Spreaders (Student Abstract) AAAI 2022

On Global-view Based Defense via Adversarial Attack and Defense Risk Guaranteed Bounds AISTATS 2022

A Complete Criterion for Value of Information in Soluble Influence Diagrams AAAI 2022

The King Is Naked: On the Notion of Robustness for Natural Language Processing AAAI 2022

On Collective Robustness of Bagging Against Data Poisoning ICML 2022

Consent as a Foundation for Responsible Autonomy AAAI 2022

Gaussian Process Uniform Error Bounds with Unknown Hyperparameters for Safety-Critical Applications ICML 2022

Joint Synthesis of Safety Certificate and Safe Control Policy Using Constrained Reinforcement Learning L4DC 2022

Safe Reinforcement Learning with Chance-constrained Model Predictive Control L4DC 2022

Safety-Aware Preference-Based Learning for Safety-Critical Control L4DC 2022

Protecting Intellectual Property of Language Generation APIs with Lexical Watermark AAAI 2022

Query-Efficient and Scalable Black-Box Adversarial Attacks on Discrete Sequential Data via Bayesian Optimization ICML 2022

Intriguing Properties of Input-Dependent Randomized Smoothing ICML 2022

FedInv: Byzantine-Robust Federated Learning by Inversing Local Model Updates AAAI 2022

GalilAI: Out-of-Task Distribution Detection using Causal Active Experimentation for Safe Transfer RL AISTATS 2022

Certified Robustness via Randomized Smoothing over Multiplicative Parameters of Input Transformations IJCAI 2022

Automatic Reliability Testing For Cluster Management Controllers OSDI 2022

Calibrated Learning to Defer with One-vs-All Classifiers ICML 2022