Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
AI Safety
2972 directly classified papers
Papers per year
2002: 1
2006: 1
2007: 1
2012: 4
2013: 1
2015: 5
2016: 1
2017: 13
2018: 40
2019: 91
2020: 111
2021: 181
2022: 204
2023: 333
2024: 642
2025: 1031
2026: 312
Papers
Towards Class-Oriented Poisoning Attacks Against Neural Networks
WACV 2022
On the Effectiveness of Small Input Noise for Defending Against Query-Based Black-Box Attacks
WACV 2022
Can You Spot the Chameleon? Adversarially Camouflaging Images From Co-Salient Object Detection
CVPR 2022
Segment and Complete: Defending Object Detectors Against Adversarial Patch Attacks With Robust Patch Detection
CVPR 2022
Demystifying the Adversarial Robustness of Random Transformation Defenses
ICML 2022
Self-Healing Robust Neural Networks via Closed-Loop Control
JMLR 2022
OVERT: An Algorithm for Safety Verification of Neural Network Control Policies for Nonlinear Systems
JMLR 2022
Predicting the Influence of Fake and Real News Spreaders (Student Abstract)
AAAI 2022
On Global-view Based Defense via Adversarial Attack and Defense Risk Guaranteed Bounds
AISTATS 2022
A Complete Criterion for Value of Information in Soluble Influence Diagrams
AAAI 2022
The King Is Naked: On the Notion of Robustness for Natural Language Processing
AAAI 2022
On Collective Robustness of Bagging Against Data Poisoning
ICML 2022
Consent as a Foundation for Responsible Autonomy
AAAI 2022
Gaussian Process Uniform Error Bounds with Unknown Hyperparameters for Safety-Critical Applications
ICML 2022
Joint Synthesis of Safety Certificate and Safe Control Policy Using Constrained Reinforcement Learning
L4DC 2022
Safe Reinforcement Learning with Chance-constrained Model Predictive Control
L4DC 2022
Safety-Aware Preference-Based Learning for Safety-Critical Control
L4DC 2022
Protecting Intellectual Property of Language Generation APIs with Lexical Watermark
AAAI 2022
Query-Efficient and Scalable Black-Box Adversarial Attacks on Discrete Sequential Data via Bayesian Optimization
ICML 2022
Intriguing Properties of Input-Dependent Randomized Smoothing
ICML 2022
FedInv: Byzantine-Robust Federated Learning by Inversing Local Model Updates
AAAI 2022
GalilAI: Out-of-Task Distribution Detection using Causal Active Experimentation for Safe Transfer RL
AISTATS 2022
Certified Robustness via Randomized Smoothing over Multiplicative Parameters of Input Transformations
IJCAI 2022
Automatic Reliability Testing For Cluster Management Controllers
OSDI 2022
Calibrated Learning to Defer with One-vs-All Classifiers
ICML 2022
<
1
…
96
97
98
…
119
>