Dongbin Zhao

16 papers · 2024–2026 · 9 conferences · across top CS/AI conferences

Achievements

+5 more ↓

🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (20) 🧭 Keyword Pioneer 🌍 Conference Polyglot (8)

🐝 Cross-Pollinator (8) 🏆 Keyword Champion (2) 🏆 Grand Slam 💎 Century Club (14) ⚡ Prolific Year (12)

Conferences

ICLR (4) ACL (2) CORL (2) ICML (2) NIPS (2) AAAI (1) EMNLP (1) ICCV (1) RSS (1)

Top co-authors

Qichao Zhang (7) Yuanheng Zhu (7) Yuqian Fu (4) Haoran Li (4) Jiajun Chai (4) Jingbo Sun (3) Yupeng Zheng (2) Runyu Lu (2) Sicheng Li (2) Songjun Tu (2)

Keywords

offline reinforcement learning (3) decision transformer (2) world model (2) multi-agent reinforcement learning (1) zero-shot learning (1) sample efficiency (1) reinforcement learning (1) task generalization (1) policy optimization (1) robotic manipulation (1) self-supervised learning (1) chain-of-thought reasoning (1) autonomous driving (1) trajectory prediction (1) visual reinforcement learning (1) trajectory modeling (1) reward modeling (1) out-of-distribution generalization (1) behavior cloning (1) sequence modeling (1)

Papers

Spec-o3: A Tool-Augmented Vision-Language Agent for Rare Celestial Object Candidate Vetting via Automated Spectral Inspection ACL 2026 Beyond Query Memorization: Large Language Model Routing with Query Decomposition and Historical Matching ACL 2026 ReasonPlan: Unified Scene Prediction and Decision Reasoning for Closed-loop Autonomous Driving CORL 2025 In-Dataset Trajectory Return Regularization for Offline Preference-based Reinforcement Learning AAAI 2025 RLAE: Reinforcement Learning-Assisted Ensemble for LLMs EMNLP 2025 World4Drive: End-to-End Autonomous Driving via Intention-aware Physical Latent World Model ICCV 2025 Divergence-Regularized Discounted Aggregation: Equilibrium Finding in Multiplayer Partially Observable Stochastic Games ICLR 2025 Unsupervised Zero-Shot Reinforcement Learning via Dual-Value Forward-Backward Representation ICLR 2025 INS: Interaction-aware Synthesis to Enhance Offline Multi-agent Reinforcement Learning ICLR 2025 Constrained Exploitability Descent: An Offline Reinforcement Learning Method for Finding Mixed-Strategy Nash Equilibrium ICML 2025 DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy ICML 2025 ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy RSS 2025 Empowering LLM Agents with Zero-Shot Optimal Decision-Making through Q-learning ICLR 2025 FetchBot: Learning Generalizable Object Fetching in Cluttered Scenes via Zero-Shot Sim2Real CORL 2025 Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement NIPS 2024 Generalizing Consistency Policy to Visual RL with Prioritized Proximal Experience Regularization NIPS 2024