Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Keywords
bias detection
419 papers
Explore in graph
Co-occurring keywords
large language model
(12755)
gender bia
(433)
text classification
(6776)
language model
(4573)
fairness evaluation
(112)
social bia
(206)
sentiment analysis
(2079)
natural language processing
(2027)
bias mitigation
(492)
responsible ai
(181)
Papers
Detecting and Mitigating LGBTQIA+ Bias in Large Norwegian Language Models
ACL 2024
”So, are you a different person today?” Analyzing Bias in Questions during Parole Hearings
EMNLP 2024
ChatGPT Doesn’t Trust Chargers Fans: Guardrail Sensitivity in Context
EMNLP 2024
Intersectional Stereotypes in Large Language Models: Dataset and Analysis
EMNLP 2023
Trade-Offs Between Fairness and Privacy in Language Modeling
ACL 2023
Race, Gender, and Age Biases in Biomedical Masked Language Models
ACL 2023
Automated Ableism: An Exploration of Explicit Disability Biases in Sentiment and Toxicity Analysis Models
ACL 2023
Can a Prediction’s Rank Offer a More Accurate Quantification of Bias? A Case Study Measuring Sexism in Debiased Language Models
IJCNLP 2023
Pre-trained Speech Processing Models Contain Human-Like Biases that Propagate to Speech Emotion Recognition
EMNLP 2023
LLMs – the Good, the Bad or the Indispensable?: A Use Case on Legal Statute Prediction and Legal Judgment Prediction on Indian Court Cases
EMNLP 2023
Language-Agnostic Bias Detection in Language Models with Bias Probing
EMNLP 2023
Fair Enough: Standardizing Evaluation and Model Selection for Fairness Research in NLP
EACL 2023
Detecting intersectionality in NER models: A data-driven approach
EACL 2023
HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models
ICCV 2023
Interpreting Unfairness in Graph Neural Networks via Training Node Attribution
AAAI 2023
Inseq: An Interpretability Toolkit for Sequence Generation Models
ACL 2023
Moral Mimicry: Large Language Models Produce Moral Rationalizations Tailored to Political Identity
ACL 2023
Towards Stable Natural Language Understanding via Information Entropy Guided Debiasing
ACL 2023
Everything you need to know about Multilingual LLMs: Towards fair, performant and reliable models for languages of the world
ACL 2023
A Multi-dimensional study on Bias in Vision-Language models
ACL 2023
Characterization of Stigmatizing Language in Medical Records
ACL 2023
Unraveling Downstream Gender Bias from Large Language Models: A Study on AI Educational Writing Assistance
EMNLP 2023
Entity-Based Evaluation of Political Bias in Automatic Summarization
EMNLP 2023
Assessing Political Inclination of Bangla Language Models
EMNLP 2023
Fast Model DeBias with Machine Unlearning
NIPS 2023
<
1
…
9
10
11
…
17
>