conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Core AI
Artificial Intelligence
›
Core AI
›
Agent Systems
3,885 papers
Papers per year
2002: 1
2003: 1
2006: 12
2007: 5
2008: 11
2009: 8
2010: 17
2011: 14
2012: 21
2013: 17
2014: 17
2015: 16
2016: 32
2017: 102
2018: 123
2019: 198
2020: 210
2021: 269
2022: 291
2023: 415
2024: 663
2025: 1077
2026: 365
Papers
Learning Action-Effect Dynamics for Hypothetical Vision-Language Reasoning Task
EMNLP 2022
A POMDP Dialogue Policy with 3-way Grounding and Adaptive Sensing for Learning through Communication
EMNLP 2022
Oh My Mistake!: Toward Realistic Dialogue State Tracking including Turnback Utterances
EMNLP 2022
A Generative User Simulator with GPT-based Architecture and Goal State Tracking for Reinforced Multi-Domain Dialog Systems
EMNLP 2022
Imitation Learning by Estimating Expertise of Demonstrators
ICML 2022
Safe Learning in Tree-Form Sequential Decision Making: Handling Hard and Soft Constraints
ICML 2022
Lagrangian Method for Q-Function Learning (with Applications to Machine Translation)
ICML 2022
Adaptive Model Design for Markov Decision Process
ICML 2022
Offline RL Policies Should Be Trained to be Adaptive
ICML 2022
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents
ICML 2022
A data-driven approach for learning to control computers
ICML 2022
Learning-based Optimisation of Particle Accelerators Under Partial Observability Without Real-World Training
ICML 2022
Lyapunov Density Models: Constraining Distribution Shift in Learning-Based Control
ICML 2022
Tell me why! Explanations support learning relational and causal structure
ICML 2022
How to Stay Curious while avoiding Noisy TVs using Aleatoric Uncertainty Estimation
ICML 2022
A Simple Reward-free Approach to Constrained Reinforcement Learning
ICML 2022
Discovering Generalizable Spatial Goal Representations via Graph-based Active Reward Learning
ICML 2022
Recurrent Model-Free RL Can Be a Strong Baseline for Many POMDPs
ICML 2022
Evolving Curricula with Regret-Based Environment Design
ICML 2022
A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes
ICML 2022
Communicating via Markov Decision Processes
ICML 2022
Influence-Augmented Local Simulators: a Scalable Solution for Fast Deep RL in Large Networked Systems
ICML 2022
Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes
ICML 2022
Langevin Monte Carlo for Contextual Bandits
ICML 2022
Expression might be enough: representing pressure and demand for reinforcement learning based traffic signal control
ICML 2022
<
1
…
108
109
110
…
156
>