Hujun Bao

92 papers · 2018–2026 · 8 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🏃 Academic Marathon (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (8) 🐝 Cross-Pollinator (7)

🐝 Cross-Pollinator (7) 🌍 Conference Polyglot (8) 🏃 Academic Marathon (7) 🏠 Conference Loyalist (40) 🤝 Dynamic Duo (42) 🔬 Deep Specialist (47) 🏆 Keyword Champion (8) 📈 Trend Setter 💎 Century Club (90) ⚡ Prolific Year (15) 🔥 Unstoppable (8) 🗃️ Keyword Collector (393)

Conferences

CVPR (40) ICCV (27) NIPS (8) ECCV (7) AAAI (5) ICLR (3) CORL (1) IJCAI (1)

Top co-authors

Xiaowei Zhou (43) Sida Peng (31) Guofeng Zhang (28) Zhaopeng Cui (24) Jiaming Sun (12) Yijin Li (10) Weiwei Xu (8) Qing Shuai (8) Zhaoyang Huang (8) Zhen Xu (8)

Research topics

Techniques (1) Robotics (1)

Keywords

3d reconstruction (22) novel view synthesis (17) neural radiance field (15) point cloud (9) depth estimation (8) scene reconstruction (8) diffusion model (7) view synthesis (7) 3d vision (5) implicit neural representation (5) neural rendering (5) 3d gaussian splatting (5) pose estimation (5) scene understanding (5) visual localization (4) semantic segmentation (4) structure from motion (4) feature extraction (4) image generation (4) differentiable rendering (4)

Papers

StreamingTalker: Audio-driven 3D Facial Animation with Autoregressive Diffusion Model AAAI 2026 One-Shot Refiner: Boosting Feed-forward Novel View Synthesis via One-Step Diffusion AAAI 2026 StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models CVPR 2025 LookCloser: Frequency-aware Radiance Field for Tiny-Detail Scene CVPR 2025 ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction ICLR 2025 UniRestore3D: A Scalable Framework For General Shape Restoration ICLR 2025 AccidentalGS: 3D Gaussian Splatting from Accidental Camera Motion ICCV 2025 Precise Action-to-Video Generation Through Visual Action Prompts ICCV 2025 Ready-to-React: Online Reaction Policy for Two-Character Interaction Generation ICLR 2025 GURecon: Learning Detailed 3D Geometric Uncertainties for Neural Surface Reconstruction AAAI 2025 SpatialCrafter: Unleashing the Imagination of Video Diffusion Models for Scene Reconstruction from Limited Observations ICCV 2025 ReTracker: Exploring Image Matching for Robust Online Any Point Tracking ICCV 2025 IntrinsicControlNet: Cross-distribution Image Generation with Real and Unreal ICCV 2025 Hierarchy UGP: Hierarchy Unified Gaussian Primitive for Large-Scale Dynamic Scene Reconstruction ICCV 2025 UniVerse: Unleashing the Scene Prior of Video Diffusion Models for Robust Radiance Field Reconstruction ICCV 2025 InstaScene: Towards Complete 3D Instance Decomposition and Reconstruction from Cluttered Scenes ICCV 2025 GaussianUpdate: Continual 3D Gaussian Splatting Update for Changing Environments ICCV 2025 SpatialTrackerV2: Advancing 3D Point Tracking with Explicit Camera Motion ICCV 2025 Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models ICCV 2025 BlinkTrack: Feature Tracking over 80 FPS via Events and Images ICCV 2025 LightCity: An Urban Dataset for Outdoor Inverse Rendering and Reconstruction under Multi-illumination Conditions ICCV 2025 EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds ICCV 2025 FreeTimeGS: Free Gaussian Primitives at Anytime Anywhere for Dynamic Scene Reconstruction CVPR 2025 MoEE: Mixture of Emotion Experts for Audio-Driven Portrait Animation CVPR 2025 Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation CVPR 2025 Multi-view Reconstruction via SfM-guided Monocular Depth Estimation CVPR 2025 EnvGS: Modeling View-Dependent Appearance with Environment Gaussian CVPR 2025 SGFormer: Satellite-Ground Fusion for 3D Semantic Scene Completion CVPR 2025 CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian Field ECCV 2024 ETO:Efficient Transformer-based Local Feature Matching by Organizing Multiple Homography Hypotheses NIPS 2024 A Global Depth-Range-Free Multi-View Stereo Transformer Network with Pose Embedding NIPS 2024 PNeRFLoc: Visual Localization with Point-Based Neural Radiance Fields AAAI 2024 Boosting Image Restoration via Priors from Pre-trained Models CVPR 2024 3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation CVPR 2024 GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image CVPR 2024 Generating Human Motion in 3D Scenes from Text Descriptions CVPR 2024 Detector-Free Structure from Motion CVPR 2024 4K4D: Real-Time 4D View Synthesis at 4K Resolution CVPR 2024 Relightable and Animatable Neural Avatar from Sparse-View Video CVPR 2024 "BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events" ECCV 2024 Error-aware Sampling in Adaptive Shells for Neural Surface Reconstruction IJCAI 2024 CP-SLAM: Collaborative Neural Point-based SLAM System NIPS 2023 Representing Volumetric Videos As Dynamic MLP Maps CVPR 2023 PATS: Patch Area Transportation With Subdivision for Local Feature Matching CVPR 2023 SINE: Semantic-Driven Image-Based NeRF Editing With Prior-Guided Editing Field CVPR 2023 Learning Human Mesh Recovery in 3D Scenes CVPR 2023 Multi-Modal Neural Radiance Field for Monocular Dense SLAM with a Light-Weight ToF Sensor ICCV 2023 DPS-Net: Deep Polarimetric Stereo Depth Estimation ICCV 2023 Hierarchical Generation of Human-Object Interactions with Diffusion Probabilistic Models ICCV 2023 IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis ICCV 2023 PVO: Panoptic Visual Odometry CVPR 2023 CF-Font: Content Fusion for Few-Shot Font Generation CVPR 2023 I2-SDF: Intrinsic Indoor Scene Reconstruction and Editing via Raytracing in Neural SDFs CVPR 2023 Learning Neural Volumetric Representations of Dynamic Humans in Minutes CVPR 2023 AutoRecon: Automated 3D Object Discovery and Reconstruction CVPR 2023 Compact Neural Volumetric Video Representations with Dynamic Codebooks NIPS 2023 NeuMesh: Learning Disentangled Neural Mesh-Based Implicit Field for Geometry and Texture Editing ECCV 2022 Geometry-aware Two-scale PIFu Representation for Human Reconstruction NIPS 2022 Active Boundary Loss for Semantic Segmentation AAAI 2022 OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models NIPS 2022 TotalSelfScan: Learning Full-body Avatars from Self-Portrait Videos of Faces, Hands, and Bodies NIPS 2022 Neural 3D Scene Reconstruction With the Manhattan-World Assumption CVPR 2022 SelfRecon: Self Reconstruction Your Digital Avatar From Monocular Video CVPR 2022 NICE-SLAM: Neural Implicit Scalable Encoding for SLAM CVPR 2022 DELTAR: Depth Estimation from a Light-Weight ToF Sensor and RGB Image ECCV 2022 Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering ICCV 2021 You Don't Only Look Once: Constructing Spatial-Temporal Memory for Integrated 3D Object Detection and Tracking ICCV 2021 DeepPanoContext: Panoramic 3D Scene Understanding With Holistic Scene Context Graph and Relation-Based Optimization ICCV 2021 Recurrent Multi-View Alignment Network for Unsupervised Surface Registration CVPR 2021 LoFTR: Detector-Free Local Feature Matching With Transformers CVPR 2021 VS-Net: Voting With Segmentation for Visual Localization CVPR 2021 StereoPIFu: Depth Aware Clothed Human Digitization via Stereo Vision CVPR 2021 Neural Body: Implicit Neural Representations With Structured Latent Codes for Novel View Synthesis of Dynamic Humans CVPR 2021 Reconstructing 3D Human Pose by Watching Humans in the Mirror CVPR 2021 NeuralRecon: Real-Time Coherent 3D Reconstruction From Monocular Video CVPR 2021 Location-Aware Single Image Reflection Removal ICCV 2021 AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis ICCV 2021 Graph-Based Asynchronous Event Processing for Rapid Object Recognition ICCV 2021 Animatable Neural Radiance Fields for Modeling Dynamic Human Bodies ICCV 2021 BCNet: Learning Body and Cloth Shape from A Single Image ECCV 2020 Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation CVPR 2020 SMAP: Single-Shot Multi-Person Absolute 3D Pose Estimation ECCV 2020 SelfVoxeLO: Self-supervised LiDAR Odometry with Voxel-based Deep Neural Networks CORL 2020 Deep Snake for Real-Time Instance Segmentation CVPR 2020 Motion Capture from Internet Videos ECCV 2020 Prior Guided Dropout for Robust Visual Localization in Dynamic Environments ICCV 2019 Fast and Robust Multi-Person 3D Pose Estimation From Multiple Views CVPR 2019 A Late Fusion CNN for Digital Matting CVPR 2019 PVNet: Pixel-Wise Voting Network for 6DoF Pose Estimation CVPR 2019 GIFT: Learning Transformation-Invariant Dense Visual Descriptors via Group CNNs NIPS 2019 Depth Completion From Sparse LiDAR Data With Depth-Normal Constraints ICCV 2019 ICE-BA: Incremental, Consistent and Efficient Bundle Adjustment for Visual-Inertial SLAM CVPR 2018