Shuran Song

62 papers · 2013–2025 · 8 conferences · across top CS/AI conferences

Achievements

+17 more ↓

🌍 Conference Polyglot (8) 🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (12)

🏃 Academic Marathon (12) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🌟 Keyword Trendsetter Combo (4) 🏠 Conference Loyalist (24) 🤝 Dynamic Duo (11) 👥 Mega-Team (98) 🔬 Deep Specialist (13) 🧬 Topic Evolution 🏆 Keyword Champion (2) ⚡ Prolific Year (12) 💎 Century Club (62) 🗃️ Keyword Collector (193) 📈 Trend Setter 🔥 Unstoppable (11) ❓ The Questioner 🚀 Conference Pioneer

Conferences

CORL (24) CVPR (12) RSS (11) NIPS (5) ECCV (3) ICCV (3) ICLR (3) AAAI (1)

Top co-authors

Zhenjia Xu (11) Cheng Chi (10) Huy Ha (8) Samir Yitzhak Gadre (7) Andy Zeng (7) Benjamin Burchfiel (6) Eric Cousineau (6) Ludwig Schmidt (6) Thomas Funkhouser (6) Siyuan Feng (5)

Research topics

Computer Vision (1) Robotics (1)

Keywords

depth estimation (6) robot manipulation (5) semantic segmentation (4) pose estimation (4) convolutional neural network (4) 3d vision (4) scene understanding (4) policy learning (4) zero-shot learning (4) deformable object (3) robotic manipulation (3) vision-language model (3) imitation learning (3) visuomotor policy (3) self-supervised learning (3) closed-loop control (3) robot navigation (2) 3d reconstruction (2) diffusion model (2) point cloud (2)

Papers

Robot Trains Robot: Automatic Real-World Policy Adaptation and Learning for Humanoids CORL 2025 BEHAVIOR Robot Suite: Streamlining Real-World Whole-Body Manipulation for Everyday Household Activities CORL 2025 DexUMI: Using Human Hand as the Universal Manipulation Interface for Dexterous Manipulation CORL 2025 Language models scale reliably with over-training and on downstream tasks ICLR 2025 Should VLMs be Pre-trained with Image Data? ICLR 2025 Unified Video Action Model RSS 2025 RoboPanoptes: The All-Seeing Robot with Whole-body Dexterity RSS 2025 Real2Code: Reconstruct Articulated Objects via Code Generation ICLR 2025 Vision in Action: Learning Active Perception from Human Demonstrations CORL 2025 ToddlerBot: Open-Source ML-Compatible Humanoid Platform for Loco-Manipulation CORL 2025 One Demo is Worth a Thousand Trajectories: Action-View Augmentation for Visuomotor Policies CORL 2025 DoughNet: A Visual Predictive Model for Topological Manipulation of Deformable Objects ECCV 2024 DataComp-LM: In search of the next generation of training sets for language models NIPS 2024 DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset RSS 2024 Differentiable Robot Rendering CORL 2024 ManiWAV: Learning Robot Manipulation from In-the-Wild Audio-Visual Data CORL 2024 EquiBot: SIM(3)-Equivariant Diffusion Policy for Generalizable and Data Efficient Learning CORL 2024 Flow as the Cross-domain Manipulation Interface CORL 2024 TidyBot++: An Open-Source Holonomic Mobile Manipulator for Robot Learning CORL 2024 Dreamitate: Real-World Visuomotor Policy Learning via Video Generation CORL 2024 Dynamics-Guided Diffusion Model for Sensor-less Robot Manipulator Design CORL 2024 UMI-on-Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers CORL 2024 Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots RSS 2024 XSkill: Cross Embodiment Skill Discovery CORL 2023 REFLECT: Summarizing Robot Experiences for Failure Explanation and Correction CORL 2023 Rearrangement Planning for General Part Assembly CORL 2023 Tracking and Reconstructing Hand Object Interactions from Point Cloud Sequences in the Wild AAAI 2023 Scaling Up and Distilling Down: Language-Guided Robot Skill Acquisition CORL 2023 CoWs on Pasture: Baselines and Benchmarks for Language-Driven Zero-Shot Object Navigation CVPR 2023 RoboNinja: Learning an Adaptive Cutting Policy for Multi-Material Objects RSS 2023 Diffusion Policy: Visuomotor Policy Learning via Action Diffusion RSS 2023 DataComp: In search of the next generation of multimodal datasets NIPS 2023 Continuous Scene Representations for Embodied AI CVPR 2022 Semantic Abstraction: Open-World 3D Scene Understanding from 2D Vision-Language Models CORL 2022 BusyBot: Learning to Interact, Reason, and Plan in a BusyBoard Environment CORL 2022 Patching open-vocabulary models by interpolating weights NIPS 2022 ASPiRe: Adaptive Skill Priors for Reinforcement Learning NIPS 2022 DextAIRity: Deformable Manipulation Can be a Breeze RSS 2022 Iterative Residual Policy for Goal-Conditioned Dynamic Manipulation of Deformable Objects RSS 2022 Leveraging SE(3) Equivariance for Self-supervised Category-Level Object Pose Estimation from Point Clouds NIPS 2021 FlingBot: The Unreasonable Effectiveness of Dynamic Manipulation for Cloth Unfolding CORL 2021 Act the Part: Learning Interaction Strategies for Articulated Object Part Discovery ICCV 2021 GarmentNets: Category-Level Pose Estimation for Garments via Canonical Space Shape Completion ICCV 2021 Fit2Form: 3D Generative Model for Robot Gripper Form Design CORL 2020 Spatial Action Maps for Mobile Manipulation RSS 2020 Learning 3D Dynamic Scene Representations for Robot Manipulation CORL 2020 Multitask Learning Strengthens Adversarial Robustness ECCV 2020 Learning a Decentralized Multi-Arm Motion Planner CORL 2020 Category-Level Articulated Object Pose Estimation CVPR 2020 DensePhysNet: Learning Dense Physical Object Representations Via Multi-Step Dynamic Interactions RSS 2019 Neural Illumination: Lighting Prediction for Indoor Environments CVPR 2019 Normalized Object Coordinate Space for Category-Level 6D Object Pose and Size Estimation CVPR 2019 TossingBot: Learning to Throw Arbitrary Objects with Residual Physics RSS 2019 Neural Graph Matching Networks for Fewshot 3D Action Recognition ECCV 2018 Im2Pano3D: Extrapolating 360° Structure and Semantics Beyond the Field of View CVPR 2018 Semantic Scene Completion From a Single Depth Image CVPR 2017 Physically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks CVPR 2017 3DMatch: Learning Local Geometric Descriptors From RGB-D Reconstructions CVPR 2017 Deep Sliding Shapes for Amodal 3D Object Detection in RGB-D Images CVPR 2016 SUN RGB-D: A RGB-D Scene Understanding Benchmark Suite CVPR 2015 3D ShapeNets: A Deep Representation for Volumetric Shapes CVPR 2015 Tracking Revisited Using RGBD Camera: Unified Benchmark and Baselines ICCV 2013