Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Fairness
1139 directly classified papers
Papers per year
2013: 1
2017: 7
2018: 15
2019: 33
2020: 64
2021: 96
2022: 166
2023: 167
2024: 221
2025: 364
2026: 5
Papers
Joint Vision-Language Social Bias Removal for CLIP
CVPR 2025
Mind the Gesture: Evaluating AI Sensitivity to Culturally Offensive Non-Verbal Gestures
ACL 2025
Gender Bias in Instruction-Guided Speech Synthesis Models
NAACL 2025
Watching the Watchers: Exposing Gender Disparities in Machine Translation Quality Estimation
ACL 2025
Mitigating Social Bias in Large Language Models: A Multi-Objective Approach Within a Multi-Agent Framework
AAAI 2025
GeNRe: A French Gender-Neutral Rewriting System Using Collective Nouns
ACL 2025
LLMs are Biased Teachers: Evaluating LLM Bias in Personalized Education
NAACL 2025
PRISM: A Framework for Producing Interpretable Political Bias Embeddings with Political-Aware Cross-Encoder
ACL 2025
Bias Analysis and Mitigation through Protected Attribute Detection and Regard Classification
EMNLP 2025
LLMs Trust Humans More, That’s a Problem! Unveiling and Mitigating the Authority Bias in Retrieval-Augmented Generation
ACL 2025
Tackling Social Bias against the Poor: a Dataset and a Taxonomy on Aporophobia
NAACL 2025
Only a Little to the Left: A Theory-grounded Measure of Political Bias in Large Language Models
ACL 2025
Native Design Bias: Studying the Impact of English Nativeness on Language Model Performance
IJCNLP 2025
LOTUS: A Leaderboard for Detailed Image Captioning from Quality to Societal Bias and User Preferences
ACL 2025
“Women do not have heart attacks!” Gender Biases in Automatically Generated Clinical Cases in French
NAACL 2025
STATE ToxiCN: A Benchmark for Span-level Target-Aware Toxicity Extraction in Chinese Hate Speech Detection
ACL 2025
DaKultur: Evaluating the Cultural Awareness of Language Models for Danish with Native Speakers
NAACL 2025
MDIT-Bench: Evaluating the Dual-Implicit Toxicity in Large Multimodal Models
ACL 2025
Rejected Dialects: Biases Against African American Language in Reward Models
NAACL 2025
Blinded by Context: Unveiling the Halo Effect of MLLM in AI Hiring
ACL 2025
Fairer Analysis and Demographically Balanced Face Generation for Fairer Face Verification
WACV 2025
Large Language Models Still Exhibit Bias in Long Text
ACL 2025
Aligning to What? Limits to RLHF Based Alignment
NAACL 2025
7 Points to Tsinghua but 10 Points to ? Assessing Large Language Models in Agentic Multilingual National Bias
ACL 2025
No for Some, Yes for Others: Persona Prompts and Other Sources of False Refusal in Language Models
EMNLP 2025
<
1
…
13
14
15
…
46
>