Artificial Intelligence › Core AI ›

Responsible AI

1991 directly classified papers

Papers per year

Papers

ABCFair: an Adaptable Benchmark approach for Comparing Fairness Methods NIPS 2024

A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks NIPS 2024

ProgressGym: Alignment with a Millennium of Moral Progress NIPS 2024

Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias NIPS 2024

Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition NIPS 2024

Auditing Local Explanations is Hard NIPS 2024

LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models NIPS 2024

WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models NIPS 2024

Direct Unlearning Optimization for Robust and Safe Text-to-Image Models NIPS 2024

What Makes and Breaks Safety Fine-tuning? A Mechanistic Study NIPS 2024

Quantifying and Analyzing Entity-Level Memorization in Large Language Models AAAI 2024

Fairness-Aware Structured Pruning in Transformers AAAI 2024

Artificial Intelligence in the CS2023 Undergraduate Computer Science Curriculum: Rationale and Challenges AAAI 2024

Co-designing AI Education Curriculum with Cross-Disciplinary High School Teachers AAAI 2024

AI, Ethics, and Education: The Pioneering Path of Sidekick Academy AAAI 2024

Automated Assessment of Fidelity and Interpretability: An Evaluation Framework for Large Language Models’ Explanations (Student Abstract) AAAI 2024

Biases Mitigation and Expressiveness Preservation in Language Models: A Comprehensive Pipeline (Student Abstract) AAAI 2024

Evaluating AI Red Teaming’s Readiness to Address Environmental Harms: A Thematic Analysis of LLM Discourse AAAI 2024

Transforming Healthcare: A Comprehensive Approach to Mitigating Bias and Fostering Empathy through AI-Driven Augmented Reality AAAI 2024

LLMGuard: Guarding against Unsafe LLM Behavior AAAI 2024

AI Evaluation Authorities: A Case Study Mapping Model Audits to Persistent Standards AAAI 2024

A Framework for Approaching AI Education in Educator Preparation Programs AAAI 2024

Supporting Upper Elementary Students in Learning AI Concepts with Story-Driven Game-Based Learning AAAI 2024

Thesis Summary: Operationalizing User-Inclusive Transparency in Artificial Intelligence Systems AAAI 2024

Data Efficient Paradigms for Personalized Assessment of Black-Box Taskable AI Systems AAAI 2024