Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Fairness
1139 directly classified papers
Papers per year
2013: 1
2017: 7
2018: 15
2019: 33
2020: 64
2021: 96
2022: 166
2023: 167
2024: 221
2025: 364
2026: 5
Papers
A Comprehensive Evaluation of Cognitive Biases in LLMs
NAACL 2025
A Bias-Free Training Paradigm for More General AI-generated Image Detection
CVPR 2025
T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation
CVPR 2025
Let Samples Speak: Mitigating Spurious Correlation by Exploiting the Clusterness of Samples
CVPR 2025
Dissecting and Mitigating Diffusion Bias via Mechanistic Interpretability
CVPR 2025
Multi-Group Proportional Representations for Text-to-Image Models
CVPR 2025
Attention IoU: Examining Biases in CelebA using Attention Maps
CVPR 2025
Plug-and-Play Interpretable Responsible Text-to-Image Generation via Dual-Space Multi-facet Concept Control
CVPR 2025
Mitigating Bias in Machine Learning: A Comprehensive Review and Novel Approaches
AAAI 2025
Evaluating AI for Finance: Is AI Credible at Assessing Investment Risk Appetite?
EMNLP 2025
Privacy, Utility and Fairness: Navigating Trade-offs in Differentially Private Machine Learning
AAAI 2025
Robust Bias Detection in MLMs and its Application to Human Trait Ratings
NAACL 2025
All You Need Is S P A C E: When Jailbreaking Meets Bias Audit and Reveals What Lies Beneath the Guardrails (Student Abstract)
AAAI 2025
CURE: Controlled Unlearning for Robust Embeddings — Mitigating Conceptual Shortcuts in Pre-Trained Language Models
EMNLP 2025
Unmasking Style Sensitivity: A Causal Analysis of Bias Evaluation Instability in Large Language Models
ACL 2025
How Inclusively do LMs Perceive Social and Moral Norms?
NAACL 2025
Value Portrait: Assessing Language Models’ Values through Psychometrically and Ecologically Valid Items
ACL 2025
Fine-tuning LLMs with Cross-Attention-based Weight Decay for Bias Mitigation
EMNLP 2025
Which Demographics do LLMs Default to During Annotation?
ACL 2025
Echoes of Discord: Forecasting Hater Reactions to Counterspeech
NAACL 2025
AUTALIC: A Dataset for Anti-AUTistic Ableist Language In Context
ACL 2025
GeNRe: A French Gender-Neutral Rewriting System Using Collective Nouns
ACL 2025
The Invisible Hand: Unveiling Provider Bias in Large Language Models for Code Generation
ACL 2025
Causally Testing Gender Bias in LLMs: A Case Study on Occupational Bias
NAACL 2025
Your Mileage May Vary: How Empathy and Demographics Shape Human Preferences in LLM Responses
EMNLP 2025
<
1
…
12
13
14
…
46
>