Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Responsible AI
1991 directly classified papers
Papers per year
2011: 1
2016: 1
2017: 7
2018: 10
2019: 22
2020: 51
2021: 91
2022: 145
2023: 207
2024: 526
2025: 760
2026: 170
Papers
Probing Gender Bias in Multilingual LLMs: A Case Study of Stereotypes in Persian
EMNLP 2025
Human-AI Moral Judgment Congruence on Real-World Scenarios: A Cross-Lingual Analysis
EMNLP 2025
Debiasing Large Language Models in Thai Political Stance Detection via Counterfactual Calibration
EMNLP 2025
Brown Like Chocolate: How Vision-Language Models Associate Skin Tone with Food Colors
EMNLP 2025
Insights from a Disaggregated Analysis of Kinds of Biases in a Multicultural Dataset
EMNLP 2025
On Effects of Steering Latent Representation for Large Language Model Unlearning
AAAI 2025
CrAM: Credibility-Aware Attention Modification in LLMs for Combating Misinformation in RAG
AAAI 2025
Task-Agnostic Language Model Watermarking via High Entropy Passthrough Layers
AAAI 2025
Training on the Benchmark Is Not All You Need
AAAI 2025
Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data?
AAAI 2025
Look Before You Leap: Enhance Attention and Vigilance Regarding Harmful Content with GuidelineLLM
AAAI 2025
Utilize the Flow Before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning
AAAI 2025
Measuring Human and AI Values Based on Generative Psychometrics with Large Language Models
AAAI 2025
Bridging the Knowledge Gap: Understanding User Expectations for Trustworthy LLM Standards
AAAI 2025
MMJ-Bench: A Comprehensive Study on Jailbreak Attacks and Defenses for Vision Language Models
AAAI 2025
RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios?
AAAI 2025
Multi-OphthaLingua: A Multilingual Benchmark for Assessing and Debiasing LLM Ophthalmological QA in LMICs
AAAI 2025
Trustworthy and Practical AI for Healthcare: A Guided Deferral System with Large Language Models
AAAI 2025
Certified Trustworthiness in the Era of Large Language Models
AAAI 2025
Scalable, Sustainable, Generalizable, and Responsible AI for Public Sector
AAAI 2025
To Err Is AI: A Case Study Informing LLM Flaw Reporting Practices
AAAI 2025
We Are AI: Taking Control of Technology
AAAI 2025
Fostering Epistemic Insights into AI Ethics through a Constructionist Pedagogy: An Interdisciplinary Approach to AI Literacy
AAAI 2025
Investigating and Mitigating Undesirable Biases in Large Language Models
AAAI 2025
Six-CD: Benchmarking Concept Removals for Text-to-image Diffusion Models
CVPR 2025
<
1
…
17
18
19
…
80
>