Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Responsible AI
1991 directly classified papers
Papers per year
2011: 1
2016: 1
2017: 7
2018: 10
2019: 22
2020: 51
2021: 91
2022: 145
2023: 207
2024: 526
2025: 760
2026: 170
Papers
Ten Words Only Still Help: Improving Black-Box AI-Generated Text Detection via Proxy-Guided Efficient Re-Sampling
IJCAI 2024
Towards a Principle-based Framework for Assessing the Contribution of Formulas on the Conflicts of Knowledge Bases
IJCAI 2024
Negative Human Rights as a Basis for Long-term AI Safety and Regulation (Abstract Reprint)
IJCAI 2024
Ensuring Fairness Stability for Disentangling Social Inequality in Access to Education: the FAiRDAS General Method
IJCAI 2024
Reassessing Evaluation Functions in Algorithmic Recourse: An Empirical Study from a Human-Centered Perspective
IJCAI 2024
Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices.
INTERSPEECH 2024
All Should Be Equal in the Eyes of LMs: Counterfactually Aware Fair Text Generation
AAAI 2024
Mitigating Large Language Model Hallucinations via Autonomous Knowledge Graph-Based Retrofitting
AAAI 2024
Detecting and Preventing Hallucinations in Large Vision Language Models
AAAI 2024
Small Language Model Can Self-Correct
AAAI 2024
Separate the Wheat from the Chaff: Model Deficiency Unlearning via Parameter-Efficient Module Operation
AAAI 2024
Robust Evaluation Measures for Evaluating Social Biases in Masked Language Models
AAAI 2024
AI Risk Profiles: A Standards Proposal for Pre-deployment AI Risk Disclosures
AAAI 2024
Build Your Own Robot Friend: An Open-Source Learning Module for Accessible and Engaging AI Education
AAAI 2024
Fostering Trustworthiness in Machine Learning Algorithms
AAAI 2024
Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs
ACL 2024
How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety by Humanizing LLMs
ACL 2024
SafetyBench: Evaluating the Safety of Large Language Models
ACL 2024
Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!
ACL 2024
The Earth is Flat because...: Investigating LLMs’ Belief towards Misinformation via Persuasive Conversation
ACL 2024
Can LLMs substitute SQL? Comparing Resource Utilization of Querying LLMs versus Traditional Relational Databases
ACL 2024
Watermarking for Large Language Models
ACL 2024
Benchmarking Cognitive Biases in Large Language Models as Evaluators
ACL 2024
Realistic Evaluation of Toxicity in Large Language Models
ACL 2024
UNIWIZ: A Unified Large Language Model Orchestrated Wizard for Safe Knowledge Grounded Conversations
ACL 2024
<
1
…
48
49
50
…
80
>