Yuexin Ma

60 papers · 2019–2026 · 10 conferences · across top CS/AI conferences

Achievements

+15 more ↓

🐣 Hot Topic Early Bird 🌍 Conference Polyglot (10) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (6)

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏠 Conference Loyalist (24) 🤝 Dynamic Duo (26) 👑 Triple Crown 🏆 Grand Slam 🔬 Deep Specialist (26) 🏆 Keyword Champion (5) 🚀 Conference Pioneer ⚡ Prolific Year (19) 🗃️ Keyword Collector (234) ❓ The Questioner 💎 Century Club (59) 🔥 Unstoppable (7)

Conferences

CVPR (24) AAAI (9) ICCV (8) ECCV (6) IJCAI (4) ICLR (2) ICML (2) NIPS (2) WACV (2) MICCAI (1)

Top co-authors

Xinge ZHU (26) Lan Xu (14) Jingyi Yu (13) Yuenan Hou (11) Wenping Wang (9) Runnan Chen (9) Jingya Wang (7) Xiaoxiao Long (7) Youquan Liu (7) Xidong Peng (6)

Research topics

Architectures (1)

Keywords

point cloud (11) semantic segmentation (9) autonomous driving (9) multi-modal learning (8) human pose estimation (8) 3d reconstruction (6) 3d vision (6) lidar point cloud (5) multimodal learning (5) motion capture (5) domain adaptation (5) knowledge distillation (5) pose estimation (5) lidar segmentation (5) diffusion model (4) scene understanding (4) human motion capture (4) sensor fusion (4) 3d pose estimation (4) 3d human pose estimation (4)

Papers

Affordance-R1: Reinforcement Learning for Generalizable Affordance Reasoning in Multimodal Large Language Models AAAI 2026 DexGrasp Anything: Towards Universal Robotic Dexterous Grasping with Physics Awareness CVPR 2025 ClimbingCap: Multi-Modal Dataset and Method for Rock Climbing in World Coordinate CVPR 2025 Generalizable Single-View Object Pose Estimation by Two-Side Generating and Matching WACV 2025 UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control ICML 2025 CityAnchor: City-scale 3D Visual Grounding with Multi-modality LLMs ICLR 2025 Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving AAAI 2025 FreeCap: Hybrid Calibration-Free Motion Capture in Open Environments AAAI 2025 UniDemoiré: Towards Universal Image Demoiréing with Data Generation and Synthesis AAAI 2025 ReAL-AD: Towards Human-Like Reasoning in End-to-End Autonomous Driving ICCV 2025 DexH2R: A Benchmark for Dynamic Dexterous Grasping in Human-to-Robot Handover ICCV 2025 Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthesis ICCV 2025 EvolvingGrasp: Evolutionary Grasp Generation via Efficient Preference Alignment ICCV 2025 EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild CVPR 2025 OccMamba: Semantic Occupancy Prediction with State Space Models CVPR 2025 SemGeoMo: Dynamic Contextual Human Motion Generation with Semantic and Geometric Guidance CVPR 2025 LiveHPS++: Robust and Coherent Motion Capture in Dynamic Free Environment ECCV 2024 OctreeOcc: Efficient and Multi-Granularity Occupancy Prediction Using Octree Queries NIPS 2024 HybridGait: A Benchmark for Spatial-Temporal Cloth-Changing Gait Recognition with Hybrid Explorations AAAI 2024 Wonder3D: Single Image to 3D using Cross-Domain Diffusion CVPR 2024 HUNTER: Unsupervised Human-centric 3D Detection via Transferring Knowledge from Synthetic Instances to Real Scenes CVPR 2024 RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method CVPR 2024 TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation CVPR 2024 GaussianShader: 3D Gaussian Splatting with Shading Functions for Reflective Surfaces CVPR 2024 A Unified Framework for Human-centric Point Cloud Video Understanding CVPR 2024 Multi-Space Alignments Towards Universal LiDAR Segmentation CVPR 2024 LiveHPS: LiDAR-based Scene-level Human Pose and Shape Estimation in Free Environment CVPR 2024 Part2Object: Hierarchical Unsupervised 3D Instance Segmentation ECCV 2024 GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image ECCV 2024 Learning to Adapt SAM for Segmenting Cross-domain Point Clouds ECCV 2024 WildRefer: 3D Object Localization in Large-scale Dynamic Scenes with Multi-modal Visual Data and Natural Language ECCV 2024 UC-NERF: Neural Radiance Field for Under-Calibrated Multi-View Cameras in Autonomous Driving ICLR 2024 GaussianPro: 3D Gaussian Splatting with Progressive Propagation ICML 2024 RealDex: Towards Human-like Grasping for Robotic Dexterous Hand IJCAI 2024 RoCoSDF: Row-Column Scanned Neural Signed Distance Fields for Freehand 3D Ultrasound Imaging Shape Reconstruction MICCAI 2024 CL3D: Unsupervised Domain Adaptation for Cross-LiDAR 3D Detection AAAI 2023 Towards Label-free Scene Understanding by Vision Foundation Models NIPS 2023 IKOL: Inverse Kinematics Optimization Layer for 3D Human Pose and Shape Estimation via Gauss-Newton Differentiation AAAI 2023 ContrastMotion: Self-supervised Scene Motion Learning for Large-Scale LiDAR Point Clouds IJCAI 2023 Weakly Supervised 3D Multi-Person Pose Estimation for Large-Scale Scenes Based on Monocular Camera and Single LiDAR AAAI 2023 UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase ICCV 2023 See More and Know More: Zero-shot Point Cloud Segmentation via Multi-modal Visual Data ICCV 2023 Rethinking Range View Representation for LiDAR Segmentation ICCV 2023 Human-centric Scene Understanding for 3D Large-scale Scenarios ICCV 2023 SLOPER4D: A Scene-Aware Dataset for Global 4D Human Pose Estimation in Urban Environments CVPR 2023 CIMI4D: A Large Multimodal Climbing Motion Dataset Under Human-Scene Interactions CVPR 2023 StackFLOW: Monocular Human-Object Reconstruction by Stacked Normalizing Flow with Offset IJCAI 2023 CLIP2Scene: Towards Label-Efficient 3D Scene Understanding by CLIP CVPR 2023 SCPNet: Semantic Scene Completion on Point Cloud CVPR 2023 HSC4D: Human-Centered 4D Scene Capture in Large-Scale Indoor-Outdoor Space Using Wearable IMUs and LiDAR CVPR 2022 STCrowd: A Multimodal Dataset for Pedestrian Perception in Crowded Scenes CVPR 2022 Point-to-Voxel Knowledge Distillation for LiDAR Semantic Segmentation CVPR 2022 SIDE: Center-Based Stereo 3D Detector With Structure-Aware Instance Depth Estimation WACV 2022 LiDARCap: Long-Range Marker-Less 3D Human Motion Capture With LiDAR Point Clouds CVPR 2022 Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR Segmentation CVPR 2021 Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-View Transformation CVPR 2021 ChallenCap: Monocular 3D Capture of Challenging Human Performances Using Multi-Modal References CVPR 2021 AutoTrajectory: Label-free Trajectory Extraction and Prediction from Videos using Dynamic Points ECCV 2020 TrafficPredict: Trajectory Prediction for Heterogeneous Traffic-Agents AAAI 2019 High Performance Gesture Recognition via Effective and Efficient Temporal Modeling IJCAI 2019