Yuke Zhu
78 papers · 2015–2026 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+18 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (17) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5) π£ Hot Topic Early Bird
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(17)
π£
Hot Topic Early Bird
π
Keyword Trendsetter Combo
(4)
π
Conference Loyalist
(21)
π
Keyword Champion
(2)
π€
Dynamic Duo
(20)
π
Grand Slam
π±
Topic Pioneer
π₯
Mega-Team
(98)
π
Triple Crown
π¬
Deep Specialist
(16)
π§¬
Topic Evolution
ποΈ
Keyword Collector
(261)
β‘
Prolific Year
(12)
π₯
Unstoppable
(11)
π
Trend Setter
π
Century Club
(76)
Conferences
CORL (21)
RSS (12)
CVPR (10)
ICLR (7)
ICML (7)
NIPS (7)
ICCV (5)
AAAI (2)
ACL (2)
ECCV (2)
EMNLP (1)
IJCAI (1)
UAI (1)
Top co-authors
Research topics
Keywords
imitation learning
(11)
robot manipulation
(10)
domain generalization
(4)
large language model
(4)
few-shot learning
(4)
reinforcement learning
(4)
policy learning
(4)
multimodal learning
(4)
domain randomization
(4)
sample efficiency
(3)
transformer architecture
(3)
visual reasoning
(3)
3d reconstruction
(3)
transfer learning
(3)
sim-to-real transfer
(3)
robotic manipulation
(3)
variational inference
(3)
robot learning
(3)
causal inference
(2)
zero-shot learning
(2)
Papers
Towards Interpretable Tabular Reasoning: Enhancing LLM Reasoning on Tabular Data with Pre-Constructed Logic Graph
ACL 2026
Learning from the Irrecoverable: Error-Localized Policy Optimization for Tool-Integrated LLM Reasoning
ACL 2026
Constraint-Preserving Data Generation for One-Shot Visuomotor Policy Generalization
CORL 2025
GraspMolmo: Generalizable Task-Oriented Grasping via Large-Scale Synthetic Data Generation
CORL 2025
Enhancing Document Understanding with Group Position Embedding: A Novel Approach to Incorporate Layout Information
ICLR 2025
LightCity: An Urban Dataset for Outdoor Inverse Rendering and Reconstruction under Multi-illumination Conditions
ICCV 2025
CASPER: Inferring Diverse Intents for Assistive Teleoperation with Vision Language Models
CORL 2025
Sim-and-Real Co-Training: A Simple Recipe for Vision-Based Robotic Manipulation
RSS 2025
ASAP: Aligning Simulation and Real-World Physics for Learning Agile Humanoid Whole-Body Skills
RSS 2025
OmniKV: Dynamic Context Selection for Efficient Long-Context LLMs
ICLR 2025
LongVILA: Scaling Long-Context Visual Language Models for Long Videos
ICLR 2025
One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation
ICML 2025
Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids
CORL 2025
DreamGen: Unlocking Generalization in Robot Learning through Video World Models
CORL 2025
FLARE: Robot Learning with Implicit World Modeling
CORL 2025
PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
ICML 2024
AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents
ICLR 2024
Eureka: Human-Level Reward Design via Coding Large Language Models
ICLR 2024
AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers
NIPS 2024
DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
RSS 2024
OKAMI: Teaching Humanoid Robots Manipulation Skills through Single Video Imitation
CORL 2024
Harmon: Whole-Body Motion Generation of Humanoid Robots from Language Descriptions
CORL 2024
Multi-Task Interactive Robot Fleet Learning with Visual World Models
CORL 2024
Building Minimal and Reusable Causal State Abstractions for Reinforcement Learning
AAAI 2024
DrEureka: Language Model Guided Sim-To-Real Transfer
RSS 2024
RoboCasa: Large-Scale Simulation of Household Tasks for Generalist Robots
RSS 2024
INTERPRET: Interactive Predicate Learning from Language Feedback for Generalizable Task Planning
RSS 2024
LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning
NIPS 2023
Fast Monocular Scene Reconstruction With Global-Sparse Local-Dense Grids
CVPR 2023
Cross-Episodic Curriculum for Transformer Agents
NIPS 2023
Building Compositional Robot Autonomy with Modularity and Abstraction
AAAI 2023
Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning
EMNLP 2023
VIMA: Robot Manipulation with Multimodal Prompts
ICML 2023
Learning Generalizable Manipulation Policies with Object-Centric 3D Representations
CORL 2023
MUTEX: Learning Unified Policies from Multimodal Task Specifications
CORL 2023
MimicGen: A Data Generation System for Scalable Robot Learning using Human Demonstrations
CORL 2023
MimicPlay: Long-Horizon Imitation Learning by Watching Human Play
CORL 2023
Robot Learning on the Job: Human-in-the-Loop Autonomy and Learning During Deployment
RSS 2023
Ditto: Building Digital Twins of Articulated Objects From Interaction
CVPR 2022
Causal Dynamics Learning for Task-Independent State Abstraction
ICML 2022
RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning
ICLR 2022
ACID: Action-Conditional Implicit Visual Dynamics for Deformable Object Manipulation
RSS 2022
Pre-Trained Language Models for Interactive Decision-Making
NIPS 2022
Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object Interactions
CVPR 2022
Learning and Retrieval from Prior Data for Skill-based Imitation Learning
CORL 2022
Coopernaut: End-to-End Driving With Cooperative Perception for Networked Vehicles
CVPR 2022
VIOLA: Imitation Learning for Vision-Based Manipulation with Object Proposal Priors
CORL 2022
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge
NIPS 2022
Adaptive Procedural Task Generation for Hard-Exploration Problems
ICLR 2021
Adversarial Skill Chaining for Long-Horizon Robot Manipulation via Terminal State Regularization
CORL 2021
What Matters in Learning from Offline Human Demonstrations for Robot Manipulation
CORL 2021
Dynamic Metric Learning: Towards a Scalable Metric Space To Accommodate Multiple Semantic Scales
CVPR 2021
DiscoBox: Weakly Supervised Instance Segmentation and Semantic Correspondence From Box Supervision
ICCV 2021
SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies
ICML 2021
Coach-Player Multi-agent Reinforcement Learning for Dynamic Team Composition
ICML 2021
Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning
ICML 2021
Discovering Generalizable Skills via Automated Generation of Diverse Tasks
RSS 2021
Synergies Between Affordance and Geometry: 6-DoF Grasp Detection via Implicit Representations
RSS 2021
Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and Reasoning
NIPS 2020
OCEAN: Online Task Inference for Compositional Tasks with Context Adaptation
UAI 2020
RubiksNet: Learnable 3D-Shift for Efficient Video Action Recognition
ECCV 2020
Spherical Feature Transform for Deep Metric Learning
ECCV 2020
DualSMC: Tunneling Differentiable Filtering and Planning under Continuous POMDPs
IJCAI 2020
Learning a Contact-Adaptive Controller for Robust, Efficient Legged Locomotion
CORL 2020
DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion
CVPR 2019
Neural Task Graphs: Generalizing to Unseen Tasks From a Single Video Demonstration
CVPR 2019
Regression Planning Networks
NIPS 2019
Situational Fusion of Visual Representation for Visual Navigation
ICCV 2019
Dynamics Learning with Cascaded Variational Inference for Multi-Step Manipulation
CORL 2019
Learning Task-Oriented Grasping for Tool Manipulation from Simulated Self-Supervision
RSS 2018
SURREAL: Open-Source Reinforcement Learning Framework and Robot Manipulation Benchmark
CORL 2018
ROBOTURK: A Crowdsourcing Platform for Robotic Skill Learning through Imitation
CORL 2018
Reinforcement and Imitation Learning for Diverse Visuomotor Skills
RSS 2018
Scene Graph Generation by Iterative Message Passing
CVPR 2017
Knowledge Acquisition for Visual Question Answering via Iterative Querying
CVPR 2017
Visual Semantic Planning Using Deep Successor Representations
ICCV 2017
Visual7W: Grounded Question Answering in Images
CVPR 2016
Action Recognition by Hierarchical Mid-Level Action Elements
ICCV 2015