Artificial Intelligence › Core AI ›

Responsible AI

1991 directly classified papers

Papers per year

Papers

Probing Gender Bias in Multilingual LLMs: A Case Study of Stereotypes in Persian EMNLP 2025

Human-AI Moral Judgment Congruence on Real-World Scenarios: A Cross-Lingual Analysis EMNLP 2025

Debiasing Large Language Models in Thai Political Stance Detection via Counterfactual Calibration EMNLP 2025

Brown Like Chocolate: How Vision-Language Models Associate Skin Tone with Food Colors EMNLP 2025

Insights from a Disaggregated Analysis of Kinds of Biases in a Multicultural Dataset EMNLP 2025

On Effects of Steering Latent Representation for Large Language Model Unlearning AAAI 2025

CrAM: Credibility-Aware Attention Modification in LLMs for Combating Misinformation in RAG AAAI 2025

Task-Agnostic Language Model Watermarking via High Entropy Passthrough Layers AAAI 2025

Training on the Benchmark Is Not All You Need AAAI 2025

Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data? AAAI 2025

Look Before You Leap: Enhance Attention and Vigilance Regarding Harmful Content with GuidelineLLM AAAI 2025

Utilize the Flow Before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning AAAI 2025

Measuring Human and AI Values Based on Generative Psychometrics with Large Language Models AAAI 2025

Bridging the Knowledge Gap: Understanding User Expectations for Trustworthy LLM Standards AAAI 2025

MMJ-Bench: A Comprehensive Study on Jailbreak Attacks and Defenses for Vision Language Models AAAI 2025

RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios? AAAI 2025

Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs AAAI 2025

Trustworthy and Practical AI for Healthcare: A Guided Deferral System with Large Language Models AAAI 2025

Certified Trustworthiness in the Era of Large Language Models AAAI 2025

Scalable, Sustainable, Generalizable, and Responsible AI for Public Sector AAAI 2025

To Err Is AI: A Case Study Informing LLM Flaw Reporting Practices AAAI 2025

We Are AI: Taking Control of Technology AAAI 2025

Fostering Epistemic Insights into AI Ethics through a Constructionist Pedagogy: An Interdisciplinary Approach to AI Literacy AAAI 2025

Investigating and Mitigating Undesirable Biases in Large Language Models AAAI 2025

Six-CD: Benchmarking Concept Removals for Text-to-image Diffusion Models CVPR 2025