Yuexin Ma
60 papers · 2019–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
๐ฃ Hot Topic Early Bird ๐ Conference Polyglot (10) ๐งญ Keyword Pioneer ๐ Interdisciplinary Bridge ๐ Academic Marathon (6)
๐
Interdisciplinary Bridge
๐งญ
Keyword Pioneer
๐ฃ
Hot Topic Early Bird
๐
Conference Loyalist
(24)
๐ค
Dynamic Duo
(26)
๐
Triple Crown
๐
Grand Slam
๐ฌ
Deep Specialist
(26)
๐
Keyword Champion
(5)
๐
Conference Pioneer
โก
Prolific Year
(19)
๐๏ธ
Keyword Collector
(234)
โ
The Questioner
๐
Century Club
(59)
๐ฅ
Unstoppable
(7)
Conferences
CVPR (24)
AAAI (9)
ICCV (8)
ECCV (6)
IJCAI (4)
ICLR (2)
ICML (2)
NIPS (2)
WACV (2)
MICCAI (1)
Top co-authors
Research topics
Keywords
point cloud
(11)
semantic segmentation
(9)
autonomous driving
(9)
multi-modal learning
(8)
human pose estimation
(8)
3d reconstruction
(6)
3d vision
(6)
lidar point cloud
(5)
multimodal learning
(5)
motion capture
(5)
domain adaptation
(5)
knowledge distillation
(5)
pose estimation
(5)
lidar segmentation
(5)
diffusion model
(4)
scene understanding
(4)
human motion capture
(4)
sensor fusion
(4)
3d pose estimation
(4)
3d human pose estimation
(4)
Papers
Affordance-R1: Reinforcement Learning for Generalizable Affordance Reasoning in Multimodal Large Language Models
AAAI 2026
DexGrasp Anything: Towards Universal Robotic Dexterous Grasping with Physics Awareness
CVPR 2025
ClimbingCap: Multi-Modal Dataset and Method for Rock Climbing in World Coordinate
CVPR 2025
Generalizable Single-View Object Pose Estimation by Two-Side Generating and Matching
WACV 2025
UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control
ICML 2025
CityAnchor: City-scale 3D Visual Grounding with Multi-modality LLMs
ICLR 2025
Can LVLMs Obtain a Driverโs License? A Benchmark Towards Reliable AGI for Autonomous Driving
AAAI 2025
FreeCap: Hybrid Calibration-Free Motion Capture in Open Environments
AAAI 2025
UniDemoirรฉ: Towards Universal Image Demoirรฉing with Data Generation and Synthesis
AAAI 2025
ReAL-AD: Towards Human-Like Reasoning in End-to-End Autonomous Driving
ICCV 2025
DexH2R: A Benchmark for Dynamic Dexterous Grasping in Human-to-Robot Handover
ICCV 2025
Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthesis
ICCV 2025
EvolvingGrasp: Evolutionary Grasp Generation via Efficient Preference Alignment
ICCV 2025
EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the Wild
CVPR 2025
OccMamba: Semantic Occupancy Prediction with State Space Models
CVPR 2025
SemGeoMo: Dynamic Contextual Human Motion Generation with Semantic and Geometric Guidance
CVPR 2025
LiveHPS++: Robust and Coherent Motion Capture in Dynamic Free Environment
ECCV 2024
OctreeOcc: Efficient and Multi-Granularity Occupancy Prediction Using Octree Queries
NIPS 2024
HybridGait: A Benchmark for Spatial-Temporal Cloth-Changing Gait Recognition with Hybrid Explorations
AAAI 2024
Wonder3D: Single Image to 3D using Cross-Domain Diffusion
CVPR 2024
HUNTER: Unsupervised Human-centric 3D Detection via Transferring Knowledge from Synthetic Instances to Real Scenes
CVPR 2024
RELI11D: A Comprehensive Multimodal Human Motion Dataset and Method
CVPR 2024
TASeg: Temporal Aggregation Network for LiDAR Semantic Segmentation
CVPR 2024
GaussianShader: 3D Gaussian Splatting with Shading Functions for Reflective Surfaces
CVPR 2024
A Unified Framework for Human-centric Point Cloud Video Understanding
CVPR 2024
Multi-Space Alignments Towards Universal LiDAR Segmentation
CVPR 2024
LiveHPS: LiDAR-based Scene-level Human Pose and Shape Estimation in Free Environment
CVPR 2024
Part2Object: Hierarchical Unsupervised 3D Instance Segmentation
ECCV 2024
GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image
ECCV 2024
Learning to Adapt SAM for Segmenting Cross-domain Point Clouds
ECCV 2024
WildRefer: 3D Object Localization in Large-scale Dynamic Scenes with Multi-modal Visual Data and Natural Language
ECCV 2024
UC-NERF: Neural Radiance Field for Under-Calibrated Multi-View Cameras in Autonomous Driving
ICLR 2024
GaussianPro: 3D Gaussian Splatting with Progressive Propagation
ICML 2024
RealDex: Towards Human-like Grasping for Robotic Dexterous Hand
IJCAI 2024
RoCoSDF: Row-Column Scanned Neural Signed Distance Fields for Freehand 3D Ultrasound Imaging Shape Reconstruction
MICCAI 2024
CL3D: Unsupervised Domain Adaptation for Cross-LiDAR 3D Detection
AAAI 2023
Towards Label-free Scene Understanding by Vision Foundation Models
NIPS 2023
IKOL: Inverse Kinematics Optimization Layer for 3D Human Pose and Shape Estimation via Gauss-Newton Differentiation
AAAI 2023
ContrastMotion: Self-supervised Scene Motion Learning for Large-Scale LiDAR Point Clouds
IJCAI 2023
Weakly Supervised 3D Multi-Person Pose Estimation for Large-Scale Scenes Based on Monocular Camera and Single LiDAR
AAAI 2023
UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase
ICCV 2023
See More and Know More: Zero-shot Point Cloud Segmentation via Multi-modal Visual Data
ICCV 2023
Rethinking Range View Representation for LiDAR Segmentation
ICCV 2023
Human-centric Scene Understanding for 3D Large-scale Scenarios
ICCV 2023
SLOPER4D: A Scene-Aware Dataset for Global 4D Human Pose Estimation in Urban Environments
CVPR 2023
CIMI4D: A Large Multimodal Climbing Motion Dataset Under Human-Scene Interactions
CVPR 2023
StackFLOW: Monocular Human-Object Reconstruction by Stacked Normalizing Flow with Offset
IJCAI 2023
CLIP2Scene: Towards Label-Efficient 3D Scene Understanding by CLIP
CVPR 2023
SCPNet: Semantic Scene Completion on Point Cloud
CVPR 2023
HSC4D: Human-Centered 4D Scene Capture in Large-Scale Indoor-Outdoor Space Using Wearable IMUs and LiDAR
CVPR 2022
STCrowd: A Multimodal Dataset for Pedestrian Perception in Crowded Scenes
CVPR 2022
Point-to-Voxel Knowledge Distillation for LiDAR Semantic Segmentation
CVPR 2022
SIDE: Center-Based Stereo 3D Detector With Structure-Aware Instance Depth Estimation
WACV 2022
LiDARCap: Long-Range Marker-Less 3D Human Motion Capture With LiDAR Point Clouds
CVPR 2022
Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR Segmentation
CVPR 2021
Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-View Transformation
CVPR 2021
ChallenCap: Monocular 3D Capture of Challenging Human Performances Using Multi-Modal References
CVPR 2021
AutoTrajectory: Label-free Trajectory Extraction and Prediction from Videos using Dynamic Points
ECCV 2020
TrafficPredict: Trajectory Prediction for Heterogeneous Traffic-Agents
AAAI 2019
High Performance Gesture Recognition via Effective and Efficient Temporal Modeling
IJCAI 2019