Reinforcement Learning › Methods ›

Deep RL

3861 directly classified papers

Papers per year

Papers

Partially Observable Hierarchical Reinforcement Learning with AI Planning (Student Abstract) AAAI 2024

Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees NIPS 2024

Regret Bounds for Risk-sensitive Reinforcement Learning with Lipschitz Dynamic Risk Measures AISTATS 2024

StepCoder: Improving Code Generation with Reinforcement Learning from Compiler Feedback ACL 2024

A Visual Active Search Framework for Geospatial Exploration WACV 2024

Scale Optimization Using Evolutionary Reinforcement Learning for Object Detection on Drone Imagery AAAI 2024

Ethics in Action: Training Reinforcement Learning Agents for Moral Decision-making In Text-based Adventure Games AISTATS 2024

Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback ACL 2024

LogoStyleFool: Vitiating Video Recognition Systems via Logo Style Transfer AAAI 2024

What Effects the Generalization in Visual Reinforcement Learning: Policy Consistency with Truncated Return Prediction AAAI 2024

MIM-Reasoner: Learning with Theoretical Guarantees for Multiplex Influence Maximization AISTATS 2024

Virtual Action Actor-Critic Framework for Exploration (Student Abstract) AAAI 2024

Not All Tasks Are Equally Difficult: Multi-Task Deep Reinforcement Learning with Dynamic Depth Routing AAAI 2024

Natural Language-based State Representation in Deep Reinforcement Learning NAACL 2024

Reflect-RL: Two-Player Online RL Fine-Tuning for LMs ACL 2024

Bit_numeval at SemEval-2024 Task 7: Enhance Numerical Sensitivity and Reasoning Completeness for Quantitative Understanding NAACL 2024

Planning Like Human: A Dual-process Framework for Dialogue Planning ACL 2024

Cluster-Based Sampling in Hindsight Experience Replay for Robotic Tasks (Student Abstract) AAAI 2024

Increasing the Difficulty of Automatically Generated Questions via Reinforcement Learning with Synthetic Preference for Cost-Effective Cultural Heritage Dataset Generation EMNLP 2024

BadRL: Sparse Targeted Backdoor Attack against Reinforcement Learning AAAI 2024

Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous and Instruction-guided Driving CVPR 2024

RLfOLD: Reinforcement Learning from Online Demonstrations in Urban Autonomous Driving AAAI 2024

On learning history-based policies for controlling Markov decision processes AISTATS 2024

Abstract and Explore: A Novel Behavioral Metric with Cyclic Dynamics in Reinforcement Learning AAAI 2024

Amortized Active Causal Induction with Deep Reinforcement Learning NIPS 2024