Dongbin Zhao
16 papers · 2024–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+5 more ↓ Show less ↑
π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (20) π§ Keyword Pioneer π Conference Polyglot (8)
π
Cross-Pollinator
(8)
π
Keyword Champion
(2)
π
Grand Slam
π
Century Club
(14)
β‘
Prolific Year
(12)
Conferences
ICLR (4)
ACL (2)
CORL (2)
ICML (2)
NIPS (2)
AAAI (1)
EMNLP (1)
ICCV (1)
RSS (1)
Top co-authors
Keywords
offline reinforcement learning
(3)
decision transformer
(2)
world model
(2)
multi-agent reinforcement learning
(1)
zero-shot learning
(1)
sample efficiency
(1)
reinforcement learning
(1)
task generalization
(1)
policy optimization
(1)
robotic manipulation
(1)
self-supervised learning
(1)
chain-of-thought reasoning
(1)
autonomous driving
(1)
trajectory prediction
(1)
visual reinforcement learning
(1)
trajectory modeling
(1)
reward modeling
(1)
out-of-distribution generalization
(1)
behavior cloning
(1)
sequence modeling
(1)
Papers
Spec-o3: A Tool-Augmented Vision-Language Agent for Rare Celestial Object Candidate Vetting via Automated Spectral Inspection
ACL 2026
Beyond Query Memorization: Large Language Model Routing with Query Decomposition and Historical Matching
ACL 2026
ReasonPlan: Unified Scene Prediction and Decision Reasoning for Closed-loop Autonomous Driving
CORL 2025
In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning
AAAI 2025
RLAE: Reinforcement Learning-Assisted Ensemble for LLMs
EMNLP 2025
World4Drive: End-to-End Autonomous Driving via Intention-aware Physical Latent World Model
ICCV 2025
Divergence-Regularized Discounted Aggregation: Equilibrium Finding in Multiplayer Partially Observable Stochastic Games
ICLR 2025
Unsupervised Zero-Shot Reinforcement Learning via Dual-Value Forward-Backward Representation
ICLR 2025
INS: Interaction-aware Synthesis to Enhance Offline Multi-agent Reinforcement Learning
ICLR 2025
Constrained Exploitability Descent: An Offline Reinforcement Learning Method for Finding Mixed-Strategy Nash Equilibrium
ICML 2025
DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy
ICML 2025
ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy
RSS 2025
Empowering LLM Agents with Zero-Shot Optimal Decision-Making through Q-learning
ICLR 2025
FetchBot: Learning Generalizable Object Fetching in Cluttered Scenes via Zero-Shot Sim2Real
CORL 2025
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement
NIPS 2024
Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience Regularization
NIPS 2024