Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Application Areas
Machine Learning
›
Application Areas
›
Fairness
3337 directly classified papers
Papers per year
2011: 1
2013: 3
2014: 2
2016: 6
2017: 30
2018: 65
2019: 182
2020: 239
2021: 373
2022: 456
2023: 533
2024: 648
2025: 644
2026: 155
Papers
An Empirical Study of LLM-as-a-Judge for LLM Evaluation: Fine-tuned Judge Model is not a General Substitute for GPT-4
ACL 2025
CSEval: Towards Automated, Multi-Dimensional, and Reference-Free Counterspeech Evaluation using Auto-Calibrated LLMs
NAACL 2025
Self-Pluralising Culture Alignment for Large Language Models
NAACL 2025
Is this Chatbot Trying to Sell Something? Towards Oversight of Chatbot Sales Tactics
EMNLP 2025
Brown Like Chocolate: How Vision-Language Models Associate Skin Tone with Food Colors
EMNLP 2025
No for Some, Yes for Others: Persona Prompts and Other Sources of False Refusal in Language Models
EMNLP 2025
Insights from a Disaggregated Analysis of Kinds of Biases in a Multicultural Dataset
EMNLP 2025
Assessing Modality Bias in Video Question Answering Benchmarks with Multimodal Large Language Models
AAAI 2025
Are you sure? Measuring models bias in content moderation through uncertainty
EMNLP 2025
Controlling Large Language Models Through Concept Activation Vectors
AAAI 2025
Bias Unveiled: Investigating Social Bias in LLM-Generated Code
AAAI 2025
ScamNet: Toward Explainable Large Language Model-Based Fraudulent Shopping Website Detection
AAAI 2025
Towards Robust ESG Analysis Against Greenwashing Risks: Aspect-Action Analysis with Cross-Category Generalization
ACL 2025
Investigating and Mitigating Undesirable Biases in Large Language Models
AAAI 2025
Efficient Counterexample-Guided Fairness Verification and Repair of Neural Networks Using Satisfiability Modulo Convex Programming
IJCAI 2025
Exploring and Mitigating Implicit Bias in Large Language Models: A Cross-Domain Evaluation Framework
AAAI 2025
Debiasing Static Embeddings for Hate Speech Detection
ACL 2025
PolBiX: Detecting LLMs’ Political Bias in Fact-Checking through X-phemisms
EMNLP 2025
My LLM might Mimic AAE - But When Should It?
NAACL 2025
Mitigating Biases of Large Language Models in Stance Detection with Counterfactual Augmented Calibration
NAACL 2025
Inference-Time Selective Debiasing to Enhance Fairness in Text Classification Models
NAACL 2025
RobustX: Robust Counterfactual Explanations Made Easy
IJCAI 2025
Right Answer, Wrong Score: Uncovering the Inconsistencies of LLM Evaluation in Multiple-Choice Question Answering
ACL 2025
Why AI Is WEIRD and Shouldn't Be This Way: Towards AI for Everyone, with Everyone, by Everyone
AAAI 2025
Calibration as a Proxy for Fairness and Efficiency in a Perspectivist Ensemble Approach to Irony Detection
EMNLP 2025
<
1
…
30
31
32
…
134
>