Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Responsible AI
1991 directly classified papers
Papers per year
2011: 1
2016: 1
2017: 7
2018: 10
2019: 22
2020: 51
2021: 91
2022: 145
2023: 207
2024: 526
2025: 760
2026: 170
Papers
WikiBias as an Extrapolation Corpus for Bias Detection
EMNLP 2024
HOAXPEDIA: A Unified Wikipedia Hoax Articles Dataset
EMNLP 2024
The Rise of AI-Generated Content in Wikipedia
EMNLP 2024
Improving Adversarial Robustness in Vision-Language Models with Architecture and Prompt Design
EMNLP 2024
Comparative Study of Explainability Methods for Legal Outcome Prediction
EMNLP 2024
Revisiting Who’s Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective
EMNLP 2024
From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP
EMNLP 2024
Understanding “Democratization” in NLP and ML Research
EMNLP 2024
PANDA: Persona Attributes Navigation for Detecting and Alleviating Overuse Problem in Large Language Models
EMNLP 2024
WalledEval: A Comprehensive Safety Evaluation Toolkit for Large Language Models
EMNLP 2024
Prompt Leakage effect and mitigation strategies for multi-turn LLM Applications
EMNLP 2024
Rater Cohesion and Quality from a Vicarious Perspective
EMNLP 2024
Implicit Personalization in Language Models: A Systematic Study
EMNLP 2024
Pedagogical Alignment of Large Language Models
EMNLP 2024
LLM generated responses to mitigate the impact of hate speech
EMNLP 2024
Exploiting Cultural Biases via Homoglyphs inText-to-Image Synthesis (Abstract Reprint)
IJCAI 2024
When Fairness Meets Privacy: Exploring Privacy Threats in Fair Binary Classifiers via Membership Inference Attacks
IJCAI 2024
Towards Responsible Speech Processing
INTERSPEECH 2024
Beyond Binary: Towards Embracing Complexities in Cyberbullying Detection and Intervention - a Position Paper
COLING 2024
HealthFC: Verifying Health Claims with Evidence-Based Medical Fact-Checking
COLING 2024
Addressing Bias and Hallucination in Large Language Models
COLING 2024
A Legal Framework for Natural Language Model Training in Portugal
COLING 2024
Intellectual property rights at the training, development and generation stages of Large Language Models
COLING 2024
Enhancing Ethical Explanations of Large Language Models through Iterative Symbolic Refinement
EACL 2024
Leak, Cheat, Repeat: Data Contamination and Evaluation Malpractices in Closed-Source LLMs
EACL 2024
<
1
…
56
57
58
…
80
>