← Application Areas

Machine Learning › Application Areas ›

Fairness

3337 directly classified papers

Papers per year

Papers

An Empirical Study of LLM-as-a-Judge for LLM Evaluation: Fine-tuned Judge Model is not a General Substitute for GPT-4 ACL 2025

CSEval: Towards Automated, Multi-Dimensional, and Reference-Free Counterspeech Evaluation using Auto-Calibrated LLMs NAACL 2025

Self-Pluralising Culture Alignment for Large Language Models NAACL 2025

Is this Chatbot Trying to Sell Something? Towards Oversight of Chatbot Sales Tactics EMNLP 2025

Brown Like Chocolate: How Vision-Language Models Associate Skin Tone with Food Colors EMNLP 2025

No for Some, Yes for Others: Persona Prompts and Other Sources of False Refusal in Language Models EMNLP 2025

Insights from a Disaggregated Analysis of Kinds of Biases in a Multicultural Dataset EMNLP 2025

Assessing Modality Bias in Video Question Answering Benchmarks with Multimodal Large Language Models AAAI 2025

Are you sure? Measuring models bias in content moderation through uncertainty EMNLP 2025

Controlling Large Language Models Through Concept Activation Vectors AAAI 2025

Bias Unveiled: Investigating Social Bias in LLM-Generated Code AAAI 2025

ScamNet: Toward Explainable Large Language Model-Based Fraudulent Shopping Website Detection AAAI 2025

Towards Robust ESG Analysis Against Greenwashing Risks: Aspect-Action Analysis with Cross-Category Generalization ACL 2025

Investigating and Mitigating Undesirable Biases in Large Language Models AAAI 2025

Efficient Counterexample-Guided Fairness Verification and Repair of Neural Networks Using Satisfiability Modulo Convex Programming IJCAI 2025

Exploring and Mitigating Implicit Bias in Large Language Models: A Cross-Domain Evaluation Framework AAAI 2025

Debiasing Static Embeddings for Hate Speech Detection ACL 2025

PolBiX: Detecting LLMs’ Political Bias in Fact-Checking through X-phemisms EMNLP 2025

My LLM might Mimic AAE - But When Should It? NAACL 2025

Mitigating Biases of Large Language Models in Stance Detection with Counterfactual Augmented Calibration NAACL 2025

Inference-Time Selective Debiasing to Enhance Fairness in Text Classification Models NAACL 2025

RobustX: Robust Counterfactual Explanations Made Easy IJCAI 2025

Right Answer, Wrong Score: Uncovering the Inconsistencies of LLM Evaluation in Multiple-Choice Question Answering ACL 2025

Why AI Is WEIRD and Shouldn't Be This Way: Towards AI for Everyone, with Everyone, by Everyone AAAI 2025

Calibration as a Proxy for Fairness and Efficiency in a Perspectivist Ensemble Approach to Irony Detection EMNLP 2025