Pradeep Varakantham
35 papers · 2013–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π£ Hot Topic Early Bird π Interdisciplinary Bridge π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (10) π Conference Polyglot (6)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Interdisciplinary Bridge
π§¬
Topic Evolution
π
Keyword Champion
(3)
ποΈ
Keyword Collector
(150)
π
Conference Pioneer
π
Trend Setter
π
Century Club
(34)
π₯
Unstoppable
(7)
β‘
Prolific Year
(5)
Conferences
AAAI (13)
IJCAI (9)
NIPS (5)
ICLR (4)
UAI (3)
NAACL (1)
Top co-authors
Keywords
reinforcement learning
(6)
curriculum learning
(4)
cost constraint
(3)
restless multi-armed bandit
(3)
unsupervised environment design
(3)
constraint satisfaction
(3)
safe reinforcement learning
(3)
sample efficiency
(2)
deep reinforcement learning
(2)
regret minimization
(2)
resource allocation
(2)
inverse reinforcement learning
(2)
constrained reinforcement learning
(2)
environment design
(2)
combinatorial optimization
(2)
policy learning
(2)
constrained optimization
(2)
hierarchical reinforcement learning
(2)
markov decision process
(2)
conditional value at risk
(2)
Papers
Optimizing Ride-Pooling Operations with Extended Pickup and Drop-Off Flexibility
AAAI 2026
Semantic Loss Guided Data Efficient Supervised Fine Tuning for Safe Responses in LLMs
ICLR 2025
Offline Safe Reinforcement Learning Using Trajectory Classification
AAAI 2025
Marginal Benefit Driven RL Teacher for Unsupervised Environment Design
AAAI 2025
On Minimizing Adversarial Counterfactual Error in Adversarial Reinforcement Learning
ICLR 2025
Bootstrapping Language Models with DPO Implicit Rewards
ICLR 2025
On Generalization Across Environments In Multi-Objective Reinforcement Learning
ICLR 2025
Unlocking the Planning Capabilities of Large Language Models with Maximum Diversity Fine-tuning
NAACL 2025
Improving Environment Novelty Quantification for Effective Unsupervised Environment Design
NIPS 2024
Unsupervised Training Sequence Design: Efficient and Generalizable Agent Training
AAAI 2024
Reward Penalties on Augmented States for Solving Richly Constrained RL Effectively
AAAI 2024
Handling Long and Richly Constrained Tasks through Constrained Hierarchical Reinforcement Learning
AAAI 2024
Safety through feedback in Constrained RL
NIPS 2024
SPRINQL: Sub-optimal Demonstrations driven Offline Imitation Learning
NIPS 2024
Imitate the Good and Avoid the Bad: An Incremental Approach to Safe Reinforcement Learning
AAAI 2024
Transferable Curricula through Difficulty Conditioned Generators
IJCAI 2023
Constrained Reinforcement Learning in Hard Exploration Problems
AAAI 2023
Future Aware Pricing and Matching for Sustainable On-Demand Ride Pooling
AAAI 2023
Generalization through Diversity: Improving Unsupervised Environment Design
IJCAI 2023
Generative Modelling of Stochastic Actions with Arbitrary Constraints in Reinforcement Learning
NIPS 2023
Facilitating Human-Wildlife Cohabitation through Conflict Prediction
AAAI 2022
Efficient resource allocation with fairness constraints in restless multi-armed bandits
UAI 2022
Field Study in Deploying Restless Multi-Armed Bandits: Assisting Non-profits in Improving Maternal and Child Health
AAAI 2022
CLAIM: curriculum learning policy for influence maximization in unknown social networks
UAI 2021
Learn to Intervene: An Adaptive Learning Policy for Restless Bandits in Application to Preventive Healthcare
IJCAI 2021
Solving Online Threat Screening Games using Constrained Action Space Reinforcement Learning
AAAI 2020
Neural Approximate Dynamic Programming for On-Demand Ride-Pooling
AAAI 2020
Correlated Learning for Aggregation Systems
UAI 2019
Mechanism Design for Strategic Project Scheduling
IJCAI 2017
Proactive and Reactive Coordination of Non-dedicated Agent Teams Operating in Uncertain Environments
IJCAI 2017
Scalable Greedy Algorithms for Task/Resource Constrained Multi-Agent Stochastic Planning
IJCAI 2016
Sequential Decision Making for Improving Efficiency in Urban Environments
IJCAI 2016
Robust Repositioning to Counter Unpredictable Demand in Bike Sharing Systems
IJCAI 2016
Probabilistic Inference Based Message-Passing for Resource Constrained DCOPs
IJCAI 2015
Regret based Robust Solutions for Uncertain Markov Decision Processes
NIPS 2013