Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Agent Systems
3885 directly classified papers
Papers per year
2002: 1
2003: 1
2006: 12
2007: 5
2008: 11
2009: 8
2010: 17
2011: 14
2012: 21
2013: 17
2014: 17
2015: 16
2016: 32
2017: 102
2018: 123
2019: 198
2020: 210
2021: 269
2022: 291
2023: 415
2024: 663
2025: 1077
2026: 365
Papers
When Your AIs Deceive You: Challenges of Partial Observability in Reinforcement Learning from Human Feedback
NIPS 2024
Disentangling Linear Quadratic Control with Untrusted ML Predictions
NIPS 2024
Synatra: Turning Indirect Knowledge into Direct Demonstrations for Digital Agents at Scale
NIPS 2024
WebVLN: Vision-and-Language Navigation on Websites
AAAI 2024
RoleAgent: Building, Interacting, and Benchmarking High-quality Role-Playing Agents from Scripts
NIPS 2024
Integrated Systems for Computational Scientific Discovery
AAAI 2024
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks
NIPS 2024
Structurally Guided Task Decomposition in Spatial Navigation Tasks (Student Abstract)
AAAI 2024
Enabling Adaptive Agent Training in Open-Ended Simulators by Targeting Diversity
NIPS 2024
Few-shot Algorithms for Consistent Neural Decoding (FALCON) Benchmark
NIPS 2024
Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning
NIPS 2024
Compositional Automata Embeddings for Goal-Conditioned Reinforcement Learning
NIPS 2024
Learning to Assist Humans without Inferring Rewards
NIPS 2024
iVideoGPT: Interactive VideoGPTs are Scalable World Models
NIPS 2024
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search
NIPS 2024
MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems
NIPS 2024
SPO: Sequential Monte Carlo Policy Optimisation
NIPS 2024
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research
JMLR 2024
Model-based Policy Optimization under Approximate Bayesian Inference
AISTATS 2024
Catastrophic Goodhart: regularizing RLHF with KL divergence does not mitigate heavy-tailed reward misspecification
NIPS 2024
Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL
NIPS 2024
Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement Learning
NIPS 2024
Policy Mirror Descent with Lookahead
NIPS 2024
Deep Equilibrium Algorithmic Reasoning
NIPS 2024
Everyday Object Meets Vision-and-Language Navigation Agent via Backdoor
NIPS 2024
<
1
…
65
66
67
…
156
>