Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
AI Safety
2972 directly classified papers
Papers per year
2002: 1
2006: 1
2007: 1
2012: 4
2013: 1
2015: 5
2016: 1
2017: 13
2018: 40
2019: 91
2020: 111
2021: 181
2022: 204
2023: 333
2024: 642
2025: 1031
2026: 312
Papers
Exploiting Connections between Lipschitz Structures for Certifiably Robust Deep Equilibrium Models
NIPS 2023
RS-Del: Edit Distance Robustness Certificates for Sequence Classifiers via Randomized Deletion
NIPS 2023
Safety Verification of Decision-Tree Policies in Continuous Time
NIPS 2023
Provable Adversarial Robustness for Group Equivariant Tasks: Graphs, Point Clouds, Molecules, and More
NIPS 2023
Single Image Backdoor Inversion via Robust Smoothed Classifiers
CVPR 2023
Dynamic Generative Targeted Attacks With Pattern Injection
CVPR 2023
The Best Defense Is a Good Offense: Adversarial Augmentation Against Adversarial Attacks
CVPR 2023
Adversarial Robustness via Random Projection Filters
CVPR 2023
Feature Separation and Recalibration for Adversarial Robustness
CVPR 2023
Can't Steal? Cont-Steal! Contrastive Stealing Attacks Against Image Encoders
CVPR 2023
Progressive Backdoor Erasing via Connecting Backdoor and Adversarial Attacks
CVPR 2023
Black-Box Sparse Adversarial Attack via Multi-Objective Optimisation
CVPR 2023
Detecting Backdoors During the Inference Stage Based on Corruption Robustness Consistency
CVPR 2023
Backdoor Defense via Adaptively Splitting Poisoned Dataset
CVPR 2023
The Dark Side of Dynamic Routing Neural Networks: Towards Efficiency Backdoor Injection
CVPR 2023
Efficient Verification of Neural Networks Against LVM-Based Specifications
CVPR 2023
Randomized Adversarial Training via Taylor Expansion
CVPR 2023
Improving the Transferability of Adversarial Samples by Path-Augmented Method
CVPR 2023
Adversarially Robust Neural Architecture Search for Graph Neural Networks
CVPR 2023
Exploring the Relationship Between Architectural Design and Adversarially Robust Generalization
CVPR 2023
Spoq: Scaling Machine-Checkable Systems Verification in Coq
OSDI 2023
MEDIC: Remove Model Backdoors via Importance Driven Cloning
CVPR 2023
CUDA: Convolution-Based Unlearnable Datasets
CVPR 2023
Delving into the Adversarial Robustness of Federated Learning
AAAI 2023
Robust Safety under Stochastic Uncertainty with Discrete-Time Control Barrier Functions
RSS 2023
<
1
…
82
83
84
…
119
>