Moninder Singh
10 papers · 2019–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (5) π Academic Marathon (6) π Renaissance Researcher (5)
π
Cross-Pollinator
(8)
πΊοΈ
Taxonomy Completionist
(24)
π£
Hot Topic Early Bird
π₯
Mega-Team
(20)
π§¬
Topic Evolution
ποΈ
Keyword Collector
(50)
Conferences
ACL (4)
AAAI (3)
IJCAI (1)
JMLR (1)
NIPS (1)
Top co-authors
Keywords
large language model
(3)
model interpretability
(2)
explainable ai
(2)
text classification
(1)
reward function
(1)
model safety
(1)
interpretable machine learning
(1)
inverse reinforcement learning
(1)
evaluation metric
(1)
constraint satisfaction
(1)
decision tree
(1)
data summarization
(1)
knowledge graph
(1)
regret bound
(1)
contextual bandit
(1)
feature attribution
(1)
value alignment
(1)
model ranking
(1)
model evaluation
(1)
anomaly detection
(1)
Papers
AI Steerability 360: A Toolkit for Steering Large Language Models
ACL 2026
Conceptual Diagnostics for Knowledge Graphs and Large Language Models
ACL 2025
Ranking Large Language Models without Ground Truth
ACL 2024
SocialStigmaQA: A Benchmark to Uncover Stigma Amplification in Generative Language Models
AAAI 2024
Your fairness may vary: Pretrained language model fairness in toxic text classification
ACL 2022
On the Safety of Interpretable Machine Learning: A Maximum Deviation Approach
NIPS 2022
AI Explainability 360: Impact and Design
AAAI 2022
Anomaly Attribution with Likelihood Compensation
AAAI 2021
AI Explainability 360: An Extensible Toolkit for Understanding Data and Machine Learning Models
JMLR 2020
Teaching AI Agents Ethical Values Using Reinforcement Learning and Policy Orchestration
IJCAI 2019