Yash Chandak

23 papers · 2019–2024 · 7 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🏃 Academic Marathon (5) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (7) 🐝 Cross-Pollinator (7)

🐝 Cross-Pollinator (7) 🗺️ Taxonomy Completionist (43) 🏆 Grand Slam 🏆 Keyword Champion (2) 🔥 Unstoppable (6) 🗃️ Keyword Collector (96) 💎 Century Club (23) ⚡ Prolific Year (5)

Conferences

NIPS (8) ICML (6) AAAI (4) AISTATS (2) EMNLP (1) ICLR (1) JMLR (1)

Top co-authors

Philip Thomas (7) Philip S. Thomas (7) Emma Brunskill (6) Georgios Theocharous (6) Shiv Shankar (5) Scott Jordan (4) Martha White (4) Scott Niekum (3) Zhaohan Daniel Guo (2) Rémi Munos (2)

Keywords

reinforcement learning (9) off-policy evaluation (5) sequential decision making (3) policy gradient (3) sequential decision-making (2) action space (2) contextual bandit (2) markov decision process (2) non-stationary mdp (2) policy optimization (2) representation learning (2) doubly robust estimator (2) seldonian algorithm (2) counterfactual estimation (2) in-context learning (1) bayesian inference (1) catastrophic forgetting (1) domain adaptation (1) dynamic regret (1) evaluation methodology (1)

Papers

Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion EMNLP 2024 A/B testing under Interference with Partial Network Information AISTATS 2024 Data-Efficient Policy Evaluation Through Behavior Policy Search JMLR 2024 OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators NIPS 2024 Adaptive Instrument Design for Indirect Experiments ICLR 2024 Supervised Pretraining Can Learn In-Context Reinforcement Learning NIPS 2023 Behavior Alignment via Reward Function Optimization NIPS 2023 Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments AISTATS 2023 Representations and Exploration for Deep Reinforcement Learning using Singular Value Decomposition ICML 2023 Understanding Self-Predictive Learning for Reinforcement Learning ICML 2023 On Optimizing Interventions in Shared Autonomy AAAI 2022 Factored DRO: Factored Distributionally Robust Policies for Contextual Bandits NIPS 2022 Off-Policy Evaluation for Action-Dependent Non-stationary Environments NIPS 2022 SOPE: Spectrum of Off-Policy Estimators NIPS 2021 High Confidence Generalization for Reinforcement Learning ICML 2021 High-Confidence Off-Policy (or Counterfactual) Variance Estimation AAAI 2021 Universal Off-Policy Evaluation NIPS 2021 Lifelong Learning with a Changing Action Set AAAI 2020 Evaluating the Performance of Reinforcement Learning Algorithms ICML 2020 Reinforcement Learning When All Actions Are Not Always Available AAAI 2020 Optimizing for the Future in Non-Stationary MDPs ICML 2020 Towards Safe Policy Improvement for Non-Stationary MDPs NIPS 2020 Learning Action Representations for Reinforcement Learning ICML 2019