conftrace_

Moninder Singh

10 papers · 2019–2026 · 5 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+6 more ↓

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (5) 🏃 Academic Marathon (6) 🌈 Renaissance Researcher (5)

🐝 Cross-Pollinator (8) 🗺️ Taxonomy Completionist (24) 🐣 Hot Topic Early Bird 👥 Mega-Team (20) 🧬 Topic Evolution 🗃️ Keyword Collector (50)

Conferences

ACL (4) AAAI (3) IJCAI (1) JMLR (1) NIPS (1)

Top co-authors

Amit Dhurandhar (5) Dennis Wei (5) Karthikeyan Natesan Ramamurthy (4) Kush R. Varshney (4) Q. Vera Liao (2) Pin-Yu Chen (2) Rahul Nair (2) Vijay Arya (2) Ramya Raghavendra (2) Prasanna Sattigeri (2)

Keywords

large language model (3) model interpretability (2) explainable ai (2) text classification (1) reward function (1) model safety (1) interpretable machine learning (1) inverse reinforcement learning (1) evaluation metric (1) constraint satisfaction (1) decision tree (1) data summarization (1) knowledge graph (1) regret bound (1) contextual bandit (1) feature attribution (1) value alignment (1) model ranking (1) model evaluation (1) anomaly detection (1)

Papers

AI Steerability 360: A Toolkit for Steering Large Language Models ACL 2026 Conceptual Diagnostics for Knowledge Graphs and Large Language Models ACL 2025 Ranking Large Language Models without Ground Truth ACL 2024 SocialStigmaQA: A Benchmark to Uncover Stigma Amplification in Generative Language Models AAAI 2024 Your fairness may vary: Pretrained language model fairness in toxic text classification ACL 2022 On the Safety of Interpretable Machine Learning: A Maximum Deviation Approach NIPS 2022 AI Explainability 360: Impact and Design AAAI 2022 Anomaly Attribution with Likelihood Compensation AAAI 2021 AI Explainability 360: An Extensible Toolkit for Understanding Data and Machine Learning Models JMLR 2020 Teaching AI Agents Ethical Values Using Reinforcement Learning and Policy Orchestration IJCAI 2019