Yash Chandak
23 papers · 2019–2024 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π Academic Marathon (5) π§ Keyword Pioneer π Interdisciplinary Bridge π Conference Polyglot (7) π Cross-Pollinator (7)
π
Cross-Pollinator
(7)
πΊοΈ
Taxonomy Completionist
(43)
π
Grand Slam
π
Keyword Champion
(2)
π₯
Unstoppable
(6)
ποΈ
Keyword Collector
(96)
π
Century Club
(23)
β‘
Prolific Year
(5)
Conferences
NIPS (8)
ICML (6)
AAAI (4)
AISTATS (2)
EMNLP (1)
ICLR (1)
JMLR (1)
Top co-authors
Keywords
reinforcement learning
(9)
off-policy evaluation
(5)
sequential decision making
(3)
policy gradient
(3)
sequential decision-making
(2)
action space
(2)
contextual bandit
(2)
markov decision process
(2)
non-stationary mdp
(2)
policy optimization
(2)
representation learning
(2)
doubly robust estimator
(2)
seldonian algorithm
(2)
counterfactual estimation
(2)
in-context learning
(1)
bayesian inference
(1)
catastrophic forgetting
(1)
domain adaptation
(1)
dynamic regret
(1)
evaluation methodology
(1)
Papers
Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion
EMNLP 2024
A/B testing under Interference with Partial Network Information
AISTATS 2024
Data-Efficient Policy Evaluation Through Behavior Policy Search
JMLR 2024
OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators
NIPS 2024
Adaptive Instrument Design for Indirect Experiments
ICLR 2024
Supervised Pretraining Can Learn In-Context Reinforcement Learning
NIPS 2023
Behavior Alignment via Reward Function Optimization
NIPS 2023
Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments
AISTATS 2023
Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition
ICML 2023
Understanding Self-Predictive Learning for Reinforcement Learning
ICML 2023
On Optimizing Interventions in Shared Autonomy
AAAI 2022
Factored DRO: Factored Distributionally Robust Policies for Contextual Bandits
NIPS 2022
Off-Policy Evaluation for Action-Dependent Non-stationary Environments
NIPS 2022
SOPE: Spectrum of Off-Policy Estimators
NIPS 2021
High Confidence Generalization for Reinforcement Learning
ICML 2021
High-Confidence Off-Policy (or Counterfactual) Variance Estimation
AAAI 2021
Universal Off-Policy Evaluation
NIPS 2021
Lifelong Learning with a Changing Action Set
AAAI 2020
Evaluating the Performance of Reinforcement Learning Algorithms
ICML 2020
Reinforcement Learning When All Actions Are Not Always Available
AAAI 2020
Optimizing for the Future in Non-Stationary MDPs
ICML 2020
Towards Safe Policy Improvement for Non-Stationary MDPs
NIPS 2020
Learning Action Representations for Reinforcement Learning
ICML 2019