AI Safety
3,026 papers
Papers per year
1
1
1
4
1
5
1
13
40
91
111
181
204
333
642
1031
366
'15
'20
'25
Papers
Adaptive Experiment Design with Synthetic Controls
AISTATS 2024
Optimal Zero-Shot Detector for Multi-Armed Attacks
AISTATS 2024
Backdoor NLP Models via AI-Generated Text
COLING 2024
How Susceptible Are LLMs to Logical Fallacies?
COLING 2024