Satinder Singh
48 papers · 2000–2025 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
π§ Keyword Pioneer π£ Hot Topic Early Bird π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (16) π Conference Polyglot (11)
πΊοΈ
Taxonomy Completionist
(16)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Keyword Trendsetter Combo
(6)
π
Grand Slam
π₯
Mega-Team
(27)
π±
Topic Pioneer
π¬
Deep Specialist
(13)
π
Keyword Champion
(2)
π
Conference Pioneer
β
The Questioner
(2)
β‘
Prolific Year
(6)
π
Century Club
(48)
π
Trend Setter
π₯
Unstoppable
(13)
ποΈ
Keyword Collector
(187)
Conferences
NIPS (10)
ICML (9)
ICLR (7)
IJCAI (7)
AAAI (6)
AISTATS (4)
ACL (1)
ALT (1)
COLING (1)
EACL (1)
EMNLP (1)
Top co-authors
Research topics
Keywords
reinforcement learning
(10)
multi-agent system
(4)
neural network
(3)
markov decision process
(3)
model-based reinforcement learning
(3)
intrinsic reward
(3)
reward function
(3)
spectral learning
(3)
reward function learning
(2)
model-free reinforcement learning
(2)
predictive state representation
(2)
query optimization
(2)
deep reinforcement learning
(2)
text classification
(2)
attention mechanism
(2)
recurrent neural network
(2)
singular value decomposition
(2)
supervised learning
(2)
active learning
(2)
low-rank approximation
(2)
Papers
Mastering Board Games by External and Internal Planning with Language Models
ICML 2025
Genie: Generative Interactive Environments
ICML 2024
Human-Timescale Adaptation in an Open-Ended Task Space
ICML 2023
Discovering Evolution Strategies via Meta-Black-Box Optimization
ICLR 2023
ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs
ICML 2023
Discovering Policies with DOMiNO: Diversity Optimization Maintaining Near Optimality
ICLR 2023
Composing Task Knowledge With Modular Successor Feature Approximators
ICLR 2023
In-context Reinforcement Learning with Algorithm Distillation
ICLR 2023
Adaptive Pairwise Weights for Temporal Credit Assignment
AAAI 2022
On the Expressivity of Markov Reward (Extended Abstract)
IJCAI 2022
Bootstrapped Meta-Learning
ICLR 2022
Discovering a set of policies for the worst case reward
ICLR 2021
Efficient Querying for Cooperative Probabilistic Commitments
AAAI 2021
Reinforcement Learning of Implicit and Explicit Control Flow Instructions
ICML 2021
Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in a First-person Simulated 3D Environment
IJCAI 2021
How Should an Agent Practice?
AAAI 2020
Querying to Find a Safe Policy under Uncertain Safety Constraints in Markov Decision Processes
AAAI 2020
Behaviour Suite for Reinforcement Learning
ICLR 2020
Modeling Probabilistic Commitments for Maintenance Is Inherently Harder than for Achievement
AAAI 2020
What Can Learned Intrinsic Rewards Capture?
ICML 2020
Sample Complexity of Reinforcement Learning using Linearly Combined Model Ensembles
AISTATS 2020
Learning to Communicate and Solve Visual Blocks-World Tasks
AAAI 2019
No-Press Diplomacy: Modeling Multi-Agent Gameplay
NIPS 2019
Hindsight Credit Assignment
NIPS 2019
Discovery of Useful Questions as Auxiliary Tasks
NIPS 2019
Learning End-to-End Goal-Oriented Dialog with Multiple Answers
EMNLP 2018
Completing State Representations using Spectral Learning
NIPS 2018
On Learning Intrinsic Rewards for Policy Gradient Methods
NIPS 2018
Markov Decision Processes with Continuous Side Information
ALT 2018
Self-Imitation Learning
ICML 2018
Minimax-Regret Querying on Side Effects for Safe Optimality in Factored Markov Decision Processes
IJCAI 2018
Predicting Counselor Behaviors in Motivational Interviewing Encounters
EACL 2017
Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning
ICML 2017
Understanding and Predicting Empathic Behavior in Counseling Therapy
ACL 2017
Value Prediction Network
NIPS 2017
Repeated Inverse Reinforcement Learning
NIPS 2017
Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games
IJCAI 2016
The Dependence of Effective Planning Horizon on Model Accuracy
IJCAI 2016
Commitment Semantics for Sequential Decision Making under Reward Uncertainty
IJCAI 2016
On Structural Properties of MDPs that Bound Loss Due to Shallow Planning
IJCAI 2016
Low-Rank Spectral Learning with Weighted Loss Functions
AISTATS 2015
Abstraction Selection in Model-based Reinforcement Learning
ICML 2015
Action-Conditional Video Prediction using Deep Networks in Atari Games
NIPS 2015
Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning
NIPS 2014
Low-Rank Spectral Learning
AISTATS 2014
Characterizing EVOI-Sufficient k-Response Query Sets in Decision Problems
AISTATS 2014
Reward Mapping for Transfer in Long-Lived Agents
NIPS 2013
Automatic Optimization of Dialogue Management
COLING 2000