conftrace_

Artificial Intelligence › Core AI ›

Agent Systems

3,885 papers

Papers per year

Papers

Learning Action-Effect Dynamics for Hypothetical Vision-Language Reasoning Task EMNLP 2022

A POMDP Dialogue Policy with 3-way Grounding and Adaptive Sensing for Learning through Communication EMNLP 2022

Oh My Mistake!: Toward Realistic Dialogue State Tracking including Turnback Utterances EMNLP 2022

A Generative User Simulator with GPT-based Architecture and Goal State Tracking for Reinforced Multi-Domain Dialog Systems EMNLP 2022

Imitation Learning by Estimating Expertise of Demonstrators ICML 2022

Safe Learning in Tree-Form Sequential Decision Making: Handling Hard and Soft Constraints ICML 2022

Lagrangian Method for Q-Function Learning (with Applications to Machine Translation) ICML 2022

Adaptive Model Design for Markov Decision Process ICML 2022

Offline RL Policies Should Be Trained to be Adaptive ICML 2022

Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents ICML 2022

A data-driven approach for learning to control computers ICML 2022

Learning-based Optimisation of Particle Accelerators Under Partial Observability Without Real-World Training ICML 2022

Lyapunov Density Models: Constraining Distribution Shift in Learning-Based Control ICML 2022

Tell me why! Explanations support learning relational and causal structure ICML 2022

How to Stay Curious while avoiding Noisy TVs using Aleatoric Uncertainty Estimation ICML 2022

A Simple Reward-free Approach to Constrained Reinforcement Learning ICML 2022

Discovering Generalizable Spatial Goal Representations via Graph-based Active Reward Learning ICML 2022

Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs ICML 2022

Evolving Curricula with Regret-Based Environment Design ICML 2022

A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes ICML 2022

Communicating via Markov Decision Processes ICML 2022

Influence-Augmented Local Simulators: a Scalable Solution for Fast Deep RL in Large Networked Systems ICML 2022

Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes ICML 2022

Langevin Monte Carlo for Contextual Bandits ICML 2022

Expression might be enough: representing pressure and demand for reinforcement learning based traffic signal control ICML 2022