Artificial Intelligence › Core AI ›

Fairness

1139 directly classified papers

Papers per year

Papers

Joint Vision-Language Social Bias Removal for CLIP CVPR 2025

Mind the Gesture: Evaluating AI Sensitivity to Culturally Offensive Non-Verbal Gestures ACL 2025

Gender Bias in Instruction-Guided Speech Synthesis Models NAACL 2025

Watching the Watchers: Exposing Gender Disparities in Machine Translation Quality Estimation ACL 2025

Mitigating Social Bias in Large Language Models: A Multi-Objective Approach Within a Multi-Agent Framework AAAI 2025

GeNRe: A French Gender-Neutral Rewriting System Using Collective Nouns ACL 2025

LLMs are Biased Teachers: Evaluating LLM Bias in Personalized Education NAACL 2025

PRISM: A Framework for Producing Interpretable Political Bias Embeddings with Political-Aware Cross-Encoder ACL 2025

Bias Analysis and Mitigation through Protected Attribute Detection and Regard Classification EMNLP 2025

LLMs Trust Humans More, That’s a Problem! Unveiling and Mitigating the Authority Bias in Retrieval-Augmented Generation ACL 2025

Tackling Social Bias against the Poor: a Dataset and a Taxonomy on Aporophobia NAACL 2025

Only a Little to the Left: A Theory-grounded Measure of Political Bias in Large Language Models ACL 2025

Native Design Bias: Studying the Impact of English Nativeness on Language Model Performance IJCNLP 2025

LOTUS: A Leaderboard for Detailed Image Captioning from Quality to Societal Bias and User Preferences ACL 2025

“Women do not have heart attacks!” Gender Biases in Automatically Generated Clinical Cases in French NAACL 2025

STATE ToxiCN: A Benchmark for Span-level Target-Aware Toxicity Extraction in Chinese Hate Speech Detection ACL 2025

DaKultur: Evaluating the Cultural Awareness of Language Models for Danish with Native Speakers NAACL 2025

MDIT-Bench: Evaluating the Dual-Implicit Toxicity in Large Multimodal Models ACL 2025

Rejected Dialects: Biases Against African American Language in Reward Models NAACL 2025

Blinded by Context: Unveiling the Halo Effect of MLLM in AI Hiring ACL 2025

Fairer Analysis and Demographically Balanced Face Generation for Fairer Face Verification WACV 2025

Large Language Models Still Exhibit Bias in Long Text ACL 2025

Aligning to What? Limits to RLHF Based Alignment NAACL 2025

7 Points to Tsinghua but 10 Points to ? Assessing Large Language Models in Agentic Multilingual National Bias ACL 2025

No for Some, Yes for Others: Persona Prompts and Other Sources of False Refusal in Language Models EMNLP 2025