Artificial Intelligence › Core AI ›

Fairness

1139 directly classified papers

Papers per year

Papers

A Comprehensive Evaluation of Cognitive Biases in LLMs NAACL 2025

A Bias-Free Training Paradigm for More General AI-generated Image Detection CVPR 2025

T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation CVPR 2025

Let Samples Speak: Mitigating Spurious Correlation by Exploiting the Clusterness of Samples CVPR 2025

Dissecting and Mitigating Diffusion Bias via Mechanistic Interpretability CVPR 2025

Multi-Group Proportional Representations for Text-to-Image Models CVPR 2025

Attention IoU: Examining Biases in CelebA using Attention Maps CVPR 2025

Plug-and-Play Interpretable Responsible Text-to-Image Generation via Dual-Space Multi-facet Concept Control CVPR 2025

Mitigating Bias in Machine Learning: A Comprehensive Review and Novel Approaches AAAI 2025

Evaluating AI for Finance: Is AI Credible at Assessing Investment Risk Appetite? EMNLP 2025

Privacy, Utility and Fairness: Navigating Trade-offs in Differentially Private Machine Learning AAAI 2025

Robust Bias Detection in MLMs and its Application to Human Trait Ratings NAACL 2025

All You Need Is S P A C E: When Jailbreaking Meets Bias Audit and Reveals What Lies Beneath the Guardrails (Student Abstract) AAAI 2025

CURE: Controlled Unlearning for Robust Embeddings — Mitigating Conceptual Shortcuts in Pre-Trained Language Models EMNLP 2025

Unmasking Style Sensitivity: A Causal Analysis of Bias Evaluation Instability in Large Language Models ACL 2025

How Inclusively do LMs Perceive Social and Moral Norms? NAACL 2025

Value Portrait: Assessing Language Models’ Values through Psychometrically and Ecologically Valid Items ACL 2025

Fine-tuning LLMs with Cross-Attention-based Weight Decay for Bias Mitigation EMNLP 2025

Which Demographics do LLMs Default to During Annotation? ACL 2025

Echoes of Discord: Forecasting Hater Reactions to Counterspeech NAACL 2025

AUTALIC: A Dataset for Anti-AUTistic Ableist Language In Context ACL 2025

GeNRe: A French Gender-Neutral Rewriting System Using Collective Nouns ACL 2025

The Invisible Hand: Unveiling Provider Bias in Large Language Models for Code Generation ACL 2025

Causally Testing Gender Bias in LLMs: A Case Study on Occupational Bias NAACL 2025

Your Mileage May Vary: How Empathy and Demographics Shape Human Preferences in LLM Responses EMNLP 2025