Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Responsible AI
1991 directly classified papers
Papers per year
2011: 1
2016: 1
2017: 7
2018: 10
2019: 22
2020: 51
2021: 91
2022: 145
2023: 207
2024: 526
2025: 760
2026: 170
Papers
Memorization: A Close Look at Books
ACL 2025
Auditing and Enforcing Conditional Fairness via Optimal Transport
AAAI 2025
Equal Merit Does Not Imply Equality: Discrimination at Equilibrium in a Hiring Market with Symmetric Agents
AAAI 2025
Navigating Towards Fairness with Data Selection
AAAI 2025
Understanding PII Leakage in Large Language Models: A Systematic Survey
IJCAI 2025
Exploring Gender Bias in Large Language Models: An In-depth Dive into the German Language
ACL 2025
GenWriter: Reducing Gender Cues in Biographies through Text Rewriting
ACL 2025
Language Models Resist Alignment: Evidence From Data Compression
ACL 2025
From Measurement to Mitigation: Exploring the Transferability of Debiasing Approaches to Gender Bias in Maltese Language Models
ACL 2025
Surface Fairness, Deep Bias: A Comparative Study of Bias in Language Models
ACL 2025
VITAL: A New Dataset for Benchmarking Pluralistic Alignment in Healthcare
ACL 2025
DECASTE: Unveiling Caste Stereotypes in Large Language Models Through Multi-Dimensional Bias Analysis
IJCAI 2025
Brown Like Chocolate: How Vision-Language Models Associate Skin Tone with Food Colors
EMNLP 2025
CrAM: Credibility-Aware Attention Modification in LLMs for Combating Misinformation in RAG
AAAI 2025
Colombian Waitresses y Jueces canadienses: Gender and Country Biases in Occupation Recommendations from LLMs
ACL 2025
GG-BBQ: German Gender Bias Benchmark for Question Answering
ACL 2025
A Statistical and Multi-Perspective Revisiting of the Membership Inference Attack in Large Language Models
ACL 2025
Wanted: Personalised Bias Warnings for Gender Bias in Language Models
ACL 2025
Mind the Gap: Gender-based Differences in Occupational Embeddings
ACL 2025
CulFiT: A Fine-grained Cultural-aware LLM Training Paradigm via Multilingual Critique Data Synthesis
ACL 2025
What is Behind Homelessness Bias? Using LLMs and NLP to Mitigate Homelessness by Acting on Social Stigma
IJCAI 2025
Gender Bias in Nepali-English Machine Translation: A Comparison of LLMs and Existing MT Systems
ACL 2025
Intersectional Bias in Japanese Large Language Models from a Contextualized Perspective
ACL 2025
LED-Merging: Mitigating Safety-Utility Conflicts in Model Merging with Location-Election-Disjoint
ACL 2025
JBBQ: Japanese Bias Benchmark for Analyzing Social Biases in Large Language Models
ACL 2025
<
1
…
10
11
12
…
80
>