Jiangmiao Pang

48 papers · 2018–2026 · 9 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🏃 Academic Marathon (7) 🌍 Conference Polyglot (8) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (14)

🐝 Cross-Pollinator (14) 🌈 Renaissance Researcher (8) 🗺️ Taxonomy Completionist (78) 🤝 Dynamic Duo (20) 🏆 Keyword Champion (5) 🔬 Deep Specialist (10) 🧬 Topic Evolution ⚡ Prolific Year (12) 🔥 Unstoppable (8) 📈 Trend Setter 💎 Century Club (47) 🗃️ Keyword Collector (204) ❓ The Questioner

Conferences

CVPR (14) NIPS (8) ICCV (7) RSS (6) CORL (4) ECCV (4) ICLR (3) AAAI (1) EMNLP (1)

Top co-authors

Dahua Lin (21) Tai WANG (20) Wenwei Zhang (12) Yilun Chen (11) Kai Chen (9) Chen Change Loy (7) Runsen Xu (7) Jianping Shi (5) Jinkun Cao (5) Jiaqi Wang (4)

Keywords

object detection (9) instance segmentation (5) semantic segmentation (5) humanoid robot (5) reinforcement learning (4) vision-language model (4) visual grounding (3) scene understanding (3) large language model (3) robot manipulation (3) object tracking (2) policy learning (2) 3d reconstruction (2) image segmentation (2) 3d scene understanding (2) question answering (2) point cloud (2) robotic manipulation (2) representation learning (2) contrastive learning (2)

Papers

Towards Efficient and Robust Manipulation via Multi-Frame Vision-Language-Action Modeling AAAI 2026 Learning Humanoid Standing-up Control across Diverse Postures RSS 2025 A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning CVPR 2025 GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation CVPR 2025 RoboGround: Robotic Manipulation with Grounded Vision-Language Priors CVPR 2025 Language-to-Space Programming for Training-Free 3D Visual Grounding EMNLP 2025 LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D Capabilities ICCV 2025 Aether: Geometric-Aware Unified World Modeling ICCV 2025 Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities ICCV 2025 GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene ICCV 2025 VFlowOpt: A Token Pruning Framework for LMMs with Visual Information Flow-Guided Optimization ICCV 2025 ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting ICCV 2025 Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation ICLR 2025 A Unified and General Humanoid Whole-Body Controller for Fine-Grained Locomotion RSS 2025 BeamDojo: Learning Agile Humanoid Locomotion on Sparse Footholds RSS 2025 HOMIE: Humanoid Loco-Manipulation with Isomorphic Exoskeleton Cockpit RSS 2025 Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation RSS 2025 Gripper Pose and Object Pointflow as Interfaces for Robotic Bimanual Manipulation RSS 2025 Unified Human-Scene Interaction via Prompted Chain-of-Contacts ICLR 2024 EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI CVPR 2024 GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction CVPR 2024 What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights NIPS 2024 MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations NIPS 2024 VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding CORL 2024 Learning H-Infinity Locomotion Control CORL 2024 Hybrid Internal Model: Learning Agile Legged Locomotion with Simulated Robot Response ICLR 2024 Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers NIPS 2024 CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics NIPS 2024 MGF: Mixed Gaussian Flow for Diverse Trajectory Prediction NIPS 2024 PointLLM: Empowering Large Language Models to Understand Point Clouds ECCV 2024 MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training CVPR 2023 Observation-Centric SORT: Rethinking SORT for Robust Multi-Object Tracking CVPR 2023 Dense Distinct Query for End-to-End Object Detection CVPR 2023 Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation ICCV 2023 OV-PARTS: Towards Open-Vocabulary Part Segmentation NIPS 2023 DORT: Modeling Dynamic Objects in Recurrent for Multi-Camera 3D Object Detection and Tracking CORL 2023 Monocular 3D Object Detection with Depth from Motion ECCV 2022 Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation CVPR 2022 Dense Siamese Network for Dense Unsupervised Learning ECCV 2022 Seesaw Loss for Long-Tailed Instance Segmentation CVPR 2021 K-Net: Towards Unified Image Segmentation NIPS 2021 Quasi-Dense Similarity Learning for Multiple Object Tracking CVPR 2021 Probabilistic and Geometric Depth: Detecting Objects in Perspective CORL 2021 Side-Aware Boundary Localization for More Precise Object Detection ECCV 2020 Libra R-CNN: Towards Balanced Learning for Object Detection CVPR 2019 Hybrid Task Cascade for Instance Segmentation CVPR 2019 Adapting Object Detectors via Selective Cross-Domain Alignment CVPR 2019 FishNet: A Versatile Backbone for Image, Region, and Pixel Level Prediction NIPS 2018