Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Responsible AI
1991 directly classified papers
Papers per year
2011: 1
2016: 1
2017: 7
2018: 10
2019: 22
2020: 51
2021: 91
2022: 145
2023: 207
2024: 526
2025: 760
2026: 170
Papers
ABCFair: an Adaptable Benchmark approach for Comparing Fairness Methods
NIPS 2024
A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks
NIPS 2024
ProgressGym: Alignment with a Millennium of Moral Progress
NIPS 2024
Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias
NIPS 2024
Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition
NIPS 2024
Auditing Local Explanations is Hard
NIPS 2024
LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
NIPS 2024
WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models
NIPS 2024
Direct Unlearning Optimization for Robust and Safe Text-to-Image Models
NIPS 2024
What Makes and Breaks Safety Fine-tuning? A Mechanistic Study
NIPS 2024
Quantifying and Analyzing Entity-Level Memorization in Large Language Models
AAAI 2024
Fairness-Aware Structured Pruning in Transformers
AAAI 2024
Artificial Intelligence in the CS2023 Undergraduate Computer Science Curriculum: Rationale and Challenges
AAAI 2024
Co-designing AI Education Curriculum with Cross-Disciplinary High School Teachers
AAAI 2024
AI, Ethics, and Education: The Pioneering Path of Sidekick Academy
AAAI 2024
Automated Assessment of Fidelity and Interpretability: An Evaluation Framework for Large Language Models’ Explanations (Student Abstract)
AAAI 2024
Biases Mitigation and Expressiveness Preservation in Language Models: A Comprehensive Pipeline (Student Abstract)
AAAI 2024
Evaluating AI Red Teaming’s Readiness to Address Environmental Harms: A Thematic Analysis of LLM Discourse
AAAI 2024
Transforming Healthcare: A Comprehensive Approach to Mitigating Bias and Fostering Empathy through AI-Driven Augmented Reality
AAAI 2024
LLMGuard: Guarding against Unsafe LLM Behavior
AAAI 2024
AI Evaluation Authorities: A Case Study Mapping Model Audits to Persistent Standards
AAAI 2024
A Framework for Approaching AI Education in Educator Preparation Programs
AAAI 2024
Supporting Upper Elementary Students in Learning AI Concepts with Story-Driven Game-Based Learning
AAAI 2024
Thesis Summary: Operationalizing User-Inclusive Transparency in Artificial Intelligence Systems
AAAI 2024
Data Efficient Paradigms for Personalized Assessment of Black-Box Taskable AI Systems
AAAI 2024
<
1
…
54
55
56
…
80
>