Shuran Song
62 papers · 2013–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
🌍 Conference Polyglot (8) 🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (12)
🏃
Academic Marathon
(12)
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🌟
Keyword Trendsetter Combo
(4)
🏠
Conference Loyalist
(24)
🤝
Dynamic Duo
(11)
👥
Mega-Team
(98)
🔬
Deep Specialist
(13)
🧬
Topic Evolution
🏆
Keyword Champion
(2)
⚡
Prolific Year
(12)
💎
Century Club
(62)
🗃️
Keyword Collector
(193)
📈
Trend Setter
🔥
Unstoppable
(11)
❓
The Questioner
🚀
Conference Pioneer
Conferences
CORL (24)
CVPR (12)
RSS (11)
NIPS (5)
ECCV (3)
ICCV (3)
ICLR (3)
AAAI (1)
Top co-authors
Research topics
Keywords
depth estimation
(6)
robot manipulation
(5)
semantic segmentation
(4)
pose estimation
(4)
convolutional neural network
(4)
3d vision
(4)
scene understanding
(4)
policy learning
(4)
zero-shot learning
(4)
deformable object
(3)
robotic manipulation
(3)
vision-language model
(3)
imitation learning
(3)
visuomotor policy
(3)
self-supervised learning
(3)
closed-loop control
(3)
robot navigation
(2)
3d reconstruction
(2)
diffusion model
(2)
point cloud
(2)
Papers
Robot Trains Robot: Automatic Real-World Policy Adaptation and Learning for Humanoids
CORL 2025
BEHAVIOR Robot Suite: Streamlining Real-World Whole-Body Manipulation for Everyday Household Activities
CORL 2025
DexUMI: Using Human Hand as the Universal Manipulation Interface for Dexterous Manipulation
CORL 2025
Language models scale reliably with over-training and on downstream tasks
ICLR 2025
Should VLMs be Pre-trained with Image Data?
ICLR 2025
Unified Video Action Model
RSS 2025
RoboPanoptes: The All-Seeing Robot with Whole-body Dexterity
RSS 2025
Real2Code: Reconstruct Articulated Objects via Code Generation
ICLR 2025
Vision in Action: Learning Active Perception from Human Demonstrations
CORL 2025
ToddlerBot: Open-Source ML-Compatible Humanoid Platform for Loco-Manipulation
CORL 2025
One Demo is Worth a Thousand Trajectories: Action-View Augmentation for Visuomotor Policies
CORL 2025
DoughNet: A Visual Predictive Model for Topological Manipulation of Deformable Objects
ECCV 2024
DataComp-LM: In search of the next generation of training sets for language models
NIPS 2024
DROID: A Large-Scale In-The-Wild Robot Manipulation Dataset
RSS 2024
Differentiable Robot Rendering
CORL 2024
ManiWAV: Learning Robot Manipulation from In-the-Wild Audio-Visual Data
CORL 2024
EquiBot: SIM(3)-Equivariant Diffusion Policy for Generalizable and Data Efficient Learning
CORL 2024
Flow as the Cross-domain Manipulation Interface
CORL 2024
TidyBot++: An Open-Source Holonomic Mobile Manipulator for Robot Learning
CORL 2024
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
CORL 2024
Dynamics-Guided Diffusion Model for Sensor-less Robot Manipulator Design
CORL 2024
UMI-on-Legs: Making Manipulation Policies Mobile with Manipulation-Centric Whole-body Controllers
CORL 2024
Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots
RSS 2024
XSkill: Cross Embodiment Skill Discovery
CORL 2023
REFLECT: Summarizing Robot Experiences for Failure Explanation and Correction
CORL 2023
Rearrangement Planning for General Part Assembly
CORL 2023
Tracking and Reconstructing Hand Object Interactions from Point Cloud Sequences in the Wild
AAAI 2023
Scaling Up and Distilling Down: Language-Guided Robot Skill Acquisition
CORL 2023
CoWs on Pasture: Baselines and Benchmarks for Language-Driven Zero-Shot Object Navigation
CVPR 2023
RoboNinja: Learning an Adaptive Cutting Policy for Multi-Material Objects
RSS 2023
Diffusion Policy: Visuomotor Policy Learning via Action Diffusion
RSS 2023
DataComp: In search of the next generation of multimodal datasets
NIPS 2023
Continuous Scene Representations for Embodied AI
CVPR 2022
Semantic Abstraction: Open-World 3D Scene Understanding from 2D Vision-Language Models
CORL 2022
BusyBot: Learning to Interact, Reason, and Plan in a BusyBoard Environment
CORL 2022
Patching open-vocabulary models by interpolating weights
NIPS 2022
ASPiRe: Adaptive Skill Priors for Reinforcement Learning
NIPS 2022
DextAIRity: Deformable Manipulation Can be a Breeze
RSS 2022
Iterative Residual Policy for Goal-Conditioned Dynamic Manipulation of Deformable Objects
RSS 2022
Leveraging SE(3) Equivariance for Self-supervised Category-Level Object Pose Estimation from Point Clouds
NIPS 2021
FlingBot: The Unreasonable Effectiveness of Dynamic Manipulation for Cloth Unfolding
CORL 2021
Act the Part: Learning Interaction Strategies for Articulated Object Part Discovery
ICCV 2021
GarmentNets: Category-Level Pose Estimation for Garments via Canonical Space Shape Completion
ICCV 2021
Fit2Form: 3D Generative Model for Robot Gripper Form Design
CORL 2020
Spatial Action Maps for Mobile Manipulation
RSS 2020
Learning 3D Dynamic Scene Representations for Robot Manipulation
CORL 2020
Multitask Learning Strengthens Adversarial Robustness
ECCV 2020
Learning a Decentralized Multi-Arm Motion Planner
CORL 2020
Category-Level Articulated Object Pose Estimation
CVPR 2020
DensePhysNet: Learning Dense Physical Object Representations Via Multi-Step Dynamic Interactions
RSS 2019
Neural Illumination: Lighting Prediction for Indoor Environments
CVPR 2019
Normalized Object Coordinate Space for Category-Level 6D Object Pose and Size Estimation
CVPR 2019
TossingBot: Learning to Throw Arbitrary Objects with Residual Physics
RSS 2019
Neural Graph Matching Networks for Fewshot 3D Action Recognition
ECCV 2018
Im2Pano3D: Extrapolating 360° Structure and Semantics Beyond the Field of View
CVPR 2018
Semantic Scene Completion From a Single Depth Image
CVPR 2017
Physically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks
CVPR 2017
3DMatch: Learning Local Geometric Descriptors From RGB-D Reconstructions
CVPR 2017
Deep Sliding Shapes for Amodal 3D Object Detection in RGB-D Images
CVPR 2016
SUN RGB-D: A RGB-D Scene Understanding Benchmark Suite
CVPR 2015
3D ShapeNets: A Deep Representation for Volumetric Shapes
CVPR 2015
Tracking Revisited Using RGBD Camera: Unified Benchmark and Baselines
ICCV 2013