Jiangmiao Pang
48 papers · 2018–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π Academic Marathon (7) π Conference Polyglot (8) π Interdisciplinary Bridge π§ Keyword Pioneer π Cross-Pollinator (14)
π
Cross-Pollinator
(14)
π
Renaissance Researcher
(8)
πΊοΈ
Taxonomy Completionist
(78)
π€
Dynamic Duo
(20)
π
Keyword Champion
(5)
π¬
Deep Specialist
(10)
π§¬
Topic Evolution
β‘
Prolific Year
(12)
π₯
Unstoppable
(8)
π
Trend Setter
π
Century Club
(47)
ποΈ
Keyword Collector
(204)
β
The Questioner
Conferences
CVPR (14)
NIPS (8)
ICCV (7)
RSS (6)
CORL (4)
ECCV (4)
ICLR (3)
AAAI (1)
EMNLP (1)
Top co-authors
Keywords
object detection
(9)
instance segmentation
(5)
semantic segmentation
(5)
humanoid robot
(5)
reinforcement learning
(4)
vision-language model
(4)
visual grounding
(3)
scene understanding
(3)
large language model
(3)
robot manipulation
(3)
object tracking
(2)
policy learning
(2)
3d reconstruction
(2)
image segmentation
(2)
3d scene understanding
(2)
question answering
(2)
point cloud
(2)
robotic manipulation
(2)
representation learning
(2)
contrastive learning
(2)
Papers
Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action Modeling
AAAI 2026
Learning Humanoid Standing-up Control across Diverse Postures
RSS 2025
A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning
CVPR 2025
GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation
CVPR 2025
RoboGround: Robotic Manipulation with Grounded Vision-Language Priors
CVPR 2025
Language-to-Space Programming for Training-Free 3D Visual Grounding
EMNLP 2025
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D Capabilities
ICCV 2025
Aether: Geometric-Aware Unified World Modeling
ICCV 2025
Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities
ICCV 2025
GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene
ICCV 2025
VFlowOpt: A Token Pruning Framework for LMMs with Visual Information Flow-Guided Optimization
ICCV 2025
ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting
ICCV 2025
Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation
ICLR 2025
A Unified and General Humanoid Whole-Body Controller for Fine-Grained Locomotion
RSS 2025
BeamDojo: Learning Agile Humanoid Locomotion on Sparse Footholds
RSS 2025
HOMIE: Humanoid Loco-Manipulation with Isomorphic Exoskeleton Cockpit
RSS 2025
Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation
RSS 2025
Gripper Pose and Object Pointflow as Interfaces for Robotic Bimanual Manipulation
RSS 2025
Unified Human-Scene Interaction via Prompted Chain-of-Contacts
ICLR 2024
EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
CVPR 2024
GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction
CVPR 2024
What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights
NIPS 2024
MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations
NIPS 2024
VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding
CORL 2024
Learning H-Infinity Locomotion Control
CORL 2024
Hybrid Internal Model: Learning Agile Legged Locomotion with Simulated Robot Response
ICLR 2024
Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers
NIPS 2024
CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics
NIPS 2024
MGF: Mixed Gaussian Flow for Diverse Trajectory Prediction
NIPS 2024
PointLLM: Empowering Large Language Models to Understand Point Clouds
ECCV 2024
MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training
CVPR 2023
Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking
CVPR 2023
Dense Distinct Query for End-to-End Object Detection
CVPR 2023
Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation
ICCV 2023
OV-PARTS: Towards Open-Vocabulary Part Segmentation
NIPS 2023
DORT: Modeling Dynamic Objects in Recurrent for Multi-Camera 3D Object Detection and Tracking
CORL 2023
Monocular 3D Object Detection with Depth from Motion
ECCV 2022
Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation
CVPR 2022
Dense Siamese Network for Dense Unsupervised Learning
ECCV 2022
Seesaw Loss for Long-Tailed Instance Segmentation
CVPR 2021
K-Net: Towards Unified Image Segmentation
NIPS 2021
Quasi-Dense Similarity Learning for Multiple Object Tracking
CVPR 2021
Probabilistic and Geometric Depth: Detecting Objects in Perspective
CORL 2021
Side-Aware Boundary Localization for More Precise Object Detection
ECCV 2020
Libra R-CNN: Towards Balanced Learning for Object Detection
CVPR 2019
Hybrid Task Cascade for Instance Segmentation
CVPR 2019
Adapting Object Detectors via Selective Cross-Domain Alignment
CVPR 2019
FishNet: A Versatile Backbone for Image, Region, and Pixel Level Prediction
NIPS 2018