bias detection

419 papers

Explore in graph

Co-occurring keywords

large language model (12755) gender bia (433) text classification (6776) language model (4573) fairness evaluation (112) social bia (206) sentiment analysis (2079) natural language processing (2027) bias mitigation (492) responsible ai (181)

Papers

Detecting and Mitigating LGBTQIA+ Bias in Large Norwegian Language Models ACL 2024

”So, are you a different person today?” Analyzing Bias in Questions during Parole Hearings EMNLP 2024

ChatGPT Doesn’t Trust Chargers Fans: Guardrail Sensitivity in Context EMNLP 2024

Intersectional Stereotypes in Large Language Models: Dataset and Analysis EMNLP 2023

Trade-Offs Between Fairness and Privacy in Language Modeling ACL 2023

Race, Gender, and Age Biases in Biomedical Masked Language Models ACL 2023

Automated Ableism: An Exploration of Explicit Disability Biases in Sentiment and Toxicity Analysis Models ACL 2023

Can a Prediction’s Rank Offer a More Accurate Quantification of Bias? A Case Study Measuring Sexism in Debiased Language Models IJCNLP 2023

Pre-trained Speech Processing Models Contain Human-Like Biases that Propagate to Speech Emotion Recognition EMNLP 2023

LLMs – the Good, the Bad or the Indispensable?: A Use Case on Legal Statute Prediction and Legal Judgment Prediction on Indian Court Cases EMNLP 2023

Language-Agnostic Bias Detection in Language Models with Bias Probing EMNLP 2023

Fair Enough: Standardizing Evaluation and Model Selection for Fairness Research in NLP EACL 2023

Detecting intersectionality in NER models: A data-driven approach EACL 2023

HRS-Bench: Holistic, Reliable and Scalable Benchmark for Text-to-Image Models ICCV 2023

Interpreting Unfairness in Graph Neural Networks via Training Node Attribution AAAI 2023

Inseq: An Interpretability Toolkit for Sequence Generation Models ACL 2023

Moral Mimicry: Large Language Models Produce Moral Rationalizations Tailored to Political Identity ACL 2023

Towards Stable Natural Language Understanding via Information Entropy Guided Debiasing ACL 2023

Everything you need to know about Multilingual LLMs: Towards fair, performant and reliable models for languages of the world ACL 2023

A Multi-dimensional study on Bias in Vision-Language models ACL 2023

Characterization of Stigmatizing Language in Medical Records ACL 2023

Unraveling Downstream Gender Bias from Large Language Models: A Study on AI Educational Writing Assistance EMNLP 2023

Entity-Based Evaluation of Political Bias in Automatic Summarization EMNLP 2023

Assessing Political Inclination of Bangla Language Models EMNLP 2023

Fast Model DeBias with Machine Unlearning NIPS 2023