Yuke Zhu

78 papers · 2015–2026 · 13 conferences · across top CS/AI conferences

Achievements

+18 more ↓

🗺️ Taxonomy Completionist (17) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🐣 Hot Topic Early Bird

🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (17) 🐣 Hot Topic Early Bird 🌟 Keyword Trendsetter Combo (4) 🏠 Conference Loyalist (21) 🏆 Keyword Champion (2) 🤝 Dynamic Duo (20) 🏆 Grand Slam 🌱 Topic Pioneer 👥 Mega-Team (98) 👑 Triple Crown 🔬 Deep Specialist (16) 🧬 Topic Evolution 🗃️ Keyword Collector (261) ⚡ Prolific Year (12) 🔥 Unstoppable (11) 📈 Trend Setter 💎 Century Club (76)

Conferences

CORL (21) RSS (12) CVPR (10) ICLR (7) ICML (7) NIPS (7) ICCV (5) AAAI (2) ACL (2) ECCV (2) EMNLP (1) IJCAI (1) UAI (1)

Top co-authors

Linxi Fan (20) Li Fei-fei (18) Anima Anandkumar (15) Silvio Savarese (13) Ajay Mandlekar (9) Animesh Garg (8) Soroush Nasiriany (8) Zhenyu Jiang (7) Danfei Xu (7) Guanzhi Wang (7)

Research topics

Models (1)

Keywords

imitation learning (11) robot manipulation (10) domain generalization (4) large language model (4) few-shot learning (4) reinforcement learning (4) policy learning (4) multimodal learning (4) domain randomization (4) sample efficiency (3) transformer architecture (3) visual reasoning (3) 3d reconstruction (3) transfer learning (3) sim-to-real transfer (3) robotic manipulation (3) variational inference (3) robot learning (3) causal inference (2) zero-shot learning (2)

Papers

Towards Interpretable Tabular Reasoning: Enhancing LLM Reasoning on Tabular Data with Pre-Constructed Logic Graph ACL 2026 Learning from the Irrecoverable: Error-Localized Policy Optimization for Tool-Integrated LLM Reasoning ACL 2026 Constraint-Preserving Data Generation for One-Shot Visuomotor Policy Generalization CORL 2025 GraspMolmo: Generalizable Task-Oriented Grasping via Large-Scale Synthetic Data Generation CORL 2025 Enhancing Document Understanding with Group Position Embedding: A Novel Approach to Incorporate Layout Information ICLR 2025 LightCity: An Urban Dataset for Outdoor Inverse Rendering and Reconstruction under Multi-illumination Conditions ICCV 2025 CASPER: Inferring Diverse Intents for Assistive Teleoperation with Vision Language Models CORL 2025 Sim-and-Real Co-Training: A Simple Recipe for Vision-Based Robotic Manipulation RSS 2025 ASAP: Aligning Simulation and Real-World Physics for Learning Agile Humanoid Whole-Body Skills RSS 2025 OmniKV: Dynamic Context Selection for Efficient Long-Context LLMs ICLR 2025 LongVILA: Scaling Long-Context Visual Language Models for Long Videos ICLR 2025 One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation ICML 2025 Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids CORL 2025 DreamGen: Unlocking Generalization in Robot Learning through Video World Models CORL 2025 FLARE: Robot Learning with Implicit World Modeling CORL 2025 PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs ICML 2024 AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents ICLR 2024 Eureka: Human-Level Reward Design via Coding Large Language Models ICLR 2024 AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers NIPS 2024 DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset RSS 2024 OKAMI: Teaching Humanoid Robots Manipulation Skills through Single Video Imitation CORL 2024 Harmon: Whole-Body Motion Generation of Humanoid Robots from Language Descriptions CORL 2024 Multi-Task Interactive Robot Fleet Learning with Visual World Models CORL 2024 Building Minimal and Reusable Causal State Abstractions for Reinforcement Learning AAAI 2024 DrEureka: Language Model Guided Sim-To-Real Transfer RSS 2024 RoboCasa: Large-Scale Simulation of Household Tasks for Generalist Robots RSS 2024 INTERPRET: Interactive Predicate Learning from Language Feedback for Generalizable Task Planning RSS 2024 LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning NIPS 2023 Fast Monocular Scene Reconstruction With Global-Sparse Local-Dense Grids CVPR 2023 Cross-Episodic Curriculum for Transformer Agents NIPS 2023 Building Compositional Robot Autonomy with Modularity and Abstraction AAAI 2023 Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning EMNLP 2023 VIMA: Robot Manipulation with Multimodal Prompts ICML 2023 Learning Generalizable Manipulation Policies with Object-Centric 3D Representations CORL 2023 MUTEX: Learning Unified Policies from Multimodal Task Specifications CORL 2023 MimicGen: A Data Generation System for Scalable Robot Learning using Human Demonstrations CORL 2023 MimicPlay: Long-Horizon Imitation Learning by Watching Human Play CORL 2023 Robot Learning on the Job: Human-in-the-Loop Autonomy and Learning During Deployment RSS 2023 Ditto: Building Digital Twins of Articulated Objects From Interaction CVPR 2022 Causal Dynamics Learning for Task-Independent State Abstraction ICML 2022 RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning ICLR 2022 ACID: Action-Conditional Implicit Visual Dynamics for Deformable Object Manipulation RSS 2022 Pre-Trained Language Models for Interactive Decision-Making NIPS 2022 Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object Interactions CVPR 2022 Learning and Retrieval from Prior Data for Skill-based Imitation Learning CORL 2022 Coopernaut: End-to-End Driving With Cooperative Perception for Networked Vehicles CVPR 2022 VIOLA: Imitation Learning for Vision-Based Manipulation with Object Proposal Priors CORL 2022 MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge NIPS 2022 Adaptive Procedural Task Generation for Hard-Exploration Problems ICLR 2021 Adversarial Skill Chaining for Long-Horizon Robot Manipulation via Terminal State Regularization CORL 2021 What Matters in Learning from Offline Human Demonstrations for Robot Manipulation CORL 2021 Dynamic Metric Learning: Towards a Scalable Metric Space To Accommodate Multiple Semantic Scales CVPR 2021 DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence From Box Supervision ICCV 2021 SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies ICML 2021 Coach-Player Multi-agent Reinforcement Learning for Dynamic Team Composition ICML 2021 Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning ICML 2021 Discovering Generalizable Skills via Automated Generation of Diverse Tasks RSS 2021 Synergies Between Affordance and Geometry: 6-DoF Grasp Detection via Implicit Representations RSS 2021 Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and Reasoning NIPS 2020 OCEAN: Online Task Inference for Compositional Tasks with Context Adaptation UAI 2020 RubiksNet: Learnable 3D-Shift for Efficient Video Action Recognition ECCV 2020 Spherical Feature Transform for Deep Metric Learning ECCV 2020 DualSMC: Tunneling Differentiable Filtering and Planning under Continuous POMDPs IJCAI 2020 Learning a Contact-Adaptive Controller for Robust, Efficient Legged Locomotion CORL 2020 DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion CVPR 2019 Neural Task Graphs: Generalizing to Unseen Tasks From a Single Video Demonstration CVPR 2019 Regression Planning Networks NIPS 2019 Situational Fusion of Visual Representation for Visual Navigation ICCV 2019 Dynamics Learning with Cascaded Variational Inference for Multi-Step Manipulation CORL 2019 Learning Task-Oriented Grasping for Tool Manipulation from Simulated Self-Supervision RSS 2018 SURREAL: Open-Source Reinforcement Learning Framework and Robot Manipulation Benchmark CORL 2018 ROBOTURK: A Crowdsourcing Platform for Robotic Skill Learning through Imitation CORL 2018 Reinforcement and Imitation Learning for Diverse Visuomotor Skills RSS 2018 Scene Graph Generation by Iterative Message Passing CVPR 2017 Knowledge Acquisition for Visual Question Answering via Iterative Querying CVPR 2017 Visual Semantic Planning Using Deep Successor Representations ICCV 2017 Visual7W: Grounded Question Answering in Images CVPR 2016 Action Recognition by Hierarchical Mid-Level Action Elements ICCV 2015