Artificial Intelligence › Core AI ›

Responsible AI

1991 directly classified papers

Papers per year

Papers

Ten Words Only Still Help: Improving Black-Box AI-Generated Text Detection via Proxy-Guided Efficient Re-Sampling IJCAI 2024

Towards a Principle-based Framework for Assessing the Contribution of Formulas on the Conflicts of Knowledge Bases IJCAI 2024

Negative Human Rights as a Basis for Long-term AI Safety and Regulation (Abstract Reprint) IJCAI 2024

Ensuring Fairness Stability for Disentangling Social Inequality in Access to Education: the FAiRDAS General Method IJCAI 2024

Reassessing Evaluation Functions in Algorithmic Recourse: An Empirical Study from a Human-Centered Perspective IJCAI 2024

Just Because We Camp, Doesn't Mean We Should: The Ethics of Modelling Queer Voices. INTERSPEECH 2024

All Should Be Equal in the Eyes of LMs: Counterfactually Aware Fair Text Generation AAAI 2024

Mitigating Large Language Model Hallucinations via Autonomous Knowledge Graph-Based Retrofitting AAAI 2024

Detecting and Preventing Hallucinations in Large Vision Language Models AAAI 2024

Small Language Model Can Self-Correct AAAI 2024

Separate the Wheat from the Chaff: Model Deficiency Unlearning via Parameter-Efficient Module Operation AAAI 2024

Robust Evaluation Measures for Evaluating Social Biases in Masked Language Models AAAI 2024

AI Risk Profiles: A Standards Proposal for Pre-deployment AI Risk Disclosures AAAI 2024

Build Your Own Robot Friend: An Open-Source Learning Module for Accessible and Engaging AI Education AAAI 2024

Fostering Trustworthiness in Machine Learning Algorithms AAAI 2024

Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs ACL 2024

How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety by Humanizing LLMs ACL 2024

SafetyBench: Evaluating the Safety of Large Language Models ACL 2024

Emulated Disalignment: Safety Alignment for Large Language Models May Backfire! ACL 2024

The Earth is Flat because...: Investigating LLMs’ Belief towards Misinformation via Persuasive Conversation ACL 2024

Can LLMs substitute SQL? Comparing Resource Utilization of Querying LLMs versus Traditional Relational Databases ACL 2024

Watermarking for Large Language Models ACL 2024

Benchmarking Cognitive Biases in Large Language Models as Evaluators ACL 2024

Realistic Evaluation of Toxicity in Large Language Models ACL 2024

UNIWIZ: A Unified Large Language Model Orchestrated Wizard for Safe Knowledge Grounded Conversations ACL 2024