Hujun Bao
92 papers · 2018–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
🏃 Academic Marathon (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (8) 🐝 Cross-Pollinator (7)
🐝
Cross-Pollinator
(7)
🌍
Conference Polyglot
(8)
🏃
Academic Marathon
(7)
🏠
Conference Loyalist
(40)
🤝
Dynamic Duo
(42)
🔬
Deep Specialist
(47)
🏆
Keyword Champion
(8)
📈
Trend Setter
💎
Century Club
(90)
⚡
Prolific Year
(15)
🔥
Unstoppable
(8)
🗃️
Keyword Collector
(393)
Conferences
CVPR (40)
ICCV (27)
NIPS (8)
ECCV (7)
AAAI (5)
ICLR (3)
CORL (1)
IJCAI (1)
Top co-authors
Research topics
Keywords
3d reconstruction
(22)
novel view synthesis
(17)
neural radiance field
(15)
point cloud
(9)
depth estimation
(8)
scene reconstruction
(8)
diffusion model
(7)
view synthesis
(7)
3d vision
(5)
implicit neural representation
(5)
neural rendering
(5)
3d gaussian splatting
(5)
pose estimation
(5)
scene understanding
(5)
visual localization
(4)
semantic segmentation
(4)
structure from motion
(4)
feature extraction
(4)
image generation
(4)
differentiable rendering
(4)
Papers
StreamingTalker: Audio-driven 3D Facial Animation with Autoregressive Diffusion Model
AAAI 2026
One-Shot Refiner: Boosting Feed-forward Novel View Synthesis via One-Step Diffusion
AAAI 2026
StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models
CVPR 2025
LookCloser: Frequency-aware Radiance Field for Tiny-Detail Scene
CVPR 2025
ND-SDF: Learning Normal Deflection Fields for High-Fidelity Indoor Reconstruction
ICLR 2025
UniRestore3D: A Scalable Framework For General Shape Restoration
ICLR 2025
AccidentalGS: 3D Gaussian Splatting from Accidental Camera Motion
ICCV 2025
Precise Action-to-Video Generation Through Visual Action Prompts
ICCV 2025
Ready-to-React: Online Reaction Policy for Two-Character Interaction Generation
ICLR 2025
GURecon: Learning Detailed 3D Geometric Uncertainties for Neural Surface Reconstruction
AAAI 2025
SpatialCrafter: Unleashing the Imagination of Video Diffusion Models for Scene Reconstruction from Limited Observations
ICCV 2025
ReTracker: Exploring Image Matching for Robust Online Any Point Tracking
ICCV 2025
IntrinsicControlNet: Cross-distribution Image Generation with Real and Unreal
ICCV 2025
Hierarchy UGP: Hierarchy Unified Gaussian Primitive for Large-Scale Dynamic Scene Reconstruction
ICCV 2025
UniVerse: Unleashing the Scene Prior of Video Diffusion Models for Robust Radiance Field Reconstruction
ICCV 2025
InstaScene: Towards Complete 3D Instance Decomposition and Reconstruction from Cluttered Scenes
ICCV 2025
GaussianUpdate: Continual 3D Gaussian Splatting Update for Changing Environments
ICCV 2025
SpatialTrackerV2: Advancing 3D Point Tracking with Explicit Camera Motion
ICCV 2025
Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models
ICCV 2025
BlinkTrack: Feature Tracking over 80 FPS via Events and Images
ICCV 2025
LightCity: An Urban Dataset for Outdoor Inverse Rendering and Reconstruction under Multi-illumination Conditions
ICCV 2025
EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds
ICCV 2025
FreeTimeGS: Free Gaussian Primitives at Anytime Anywhere for Dynamic Scene Reconstruction
CVPR 2025
MoEE: Mixture of Emotion Experts for Audio-Driven Portrait Animation
CVPR 2025
Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation
CVPR 2025
Multi-view Reconstruction via SfM-guided Monocular Depth Estimation
CVPR 2025
EnvGS: Modeling View-Dependent Appearance with Environment Gaussian
CVPR 2025
SGFormer: Satellite-Ground Fusion for 3D Semantic Scene Completion
CVPR 2025
CG-SLAM: Efficient Dense RGB-D SLAM in a Consistent Uncertainty-aware 3D Gaussian Field
ECCV 2024
ETO:Efficient Transformer-based Local Feature Matching by Organizing Multiple Homography Hypotheses
NIPS 2024
A Global Depth-Range-Free Multi-View Stereo Transformer Network with Pose Embedding
NIPS 2024
PNeRFLoc: Visual Localization with Point-Based Neural Radiance Fields
AAAI 2024
Boosting Image Restoration via Priors from Pre-trained Models
CVPR 2024
3D-SceneDreamer: Text-Driven 3D-Consistent Scene Generation
CVPR 2024
GeneAvatar: Generic Expression-Aware Volumetric Head Avatar Editing from a Single Image
CVPR 2024
Generating Human Motion in 3D Scenes from Text Descriptions
CVPR 2024
Detector-Free Structure from Motion
CVPR 2024
4K4D: Real-Time 4D View Synthesis at 4K Resolution
CVPR 2024
Relightable and Animatable Neural Avatar from Sparse-View Video
CVPR 2024
"BlinkVision: A Benchmark for Optical Flow, Scene Flow and Point Tracking Estimation using RGB Frames and Events"
ECCV 2024
Error-aware Sampling in Adaptive Shells for Neural Surface Reconstruction
IJCAI 2024
CP-SLAM: Collaborative Neural Point-based SLAM System
NIPS 2023
Representing Volumetric Videos As Dynamic MLP Maps
CVPR 2023
PATS: Patch Area Transportation With Subdivision for Local Feature Matching
CVPR 2023
SINE: Semantic-Driven Image-Based NeRF Editing With Prior-Guided Editing Field
CVPR 2023
Learning Human Mesh Recovery in 3D Scenes
CVPR 2023
Multi-Modal Neural Radiance Field for Monocular Dense SLAM with a Light-Weight ToF Sensor
ICCV 2023
DPS-Net: Deep Polarimetric Stereo Depth Estimation
ICCV 2023
Hierarchical Generation of Human-Object Interactions with Diffusion Probabilistic Models
ICCV 2023
IntrinsicNeRF: Learning Intrinsic Neural Radiance Fields for Editable Novel View Synthesis
ICCV 2023
PVO: Panoptic Visual Odometry
CVPR 2023
CF-Font: Content Fusion for Few-Shot Font Generation
CVPR 2023
I2-SDF: Intrinsic Indoor Scene Reconstruction and Editing via Raytracing in Neural SDFs
CVPR 2023
Learning Neural Volumetric Representations of Dynamic Humans in Minutes
CVPR 2023
AutoRecon: Automated 3D Object Discovery and Reconstruction
CVPR 2023
Compact Neural Volumetric Video Representations with Dynamic Codebooks
NIPS 2023
NeuMesh: Learning Disentangled Neural Mesh-Based Implicit Field for Geometry and Texture Editing
ECCV 2022
Geometry-aware Two-scale PIFu Representation for Human Reconstruction
NIPS 2022
Active Boundary Loss for Semantic Segmentation
AAAI 2022
OnePose++: Keypoint-Free One-Shot Object Pose Estimation without CAD Models
NIPS 2022
TotalSelfScan: Learning Full-body Avatars from Self-Portrait Videos of Faces, Hands, and Bodies
NIPS 2022
Neural 3D Scene Reconstruction With the Manhattan-World Assumption
CVPR 2022
SelfRecon: Self Reconstruction Your Digital Avatar From Monocular Video
CVPR 2022
NICE-SLAM: Neural Implicit Scalable Encoding for SLAM
CVPR 2022
DELTAR: Depth Estimation from a Light-Weight ToF Sensor and RGB Image
ECCV 2022
Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering
ICCV 2021
You Don't Only Look Once: Constructing Spatial-Temporal Memory for Integrated 3D Object Detection and Tracking
ICCV 2021
DeepPanoContext: Panoramic 3D Scene Understanding With Holistic Scene Context Graph and Relation-Based Optimization
ICCV 2021
Recurrent Multi-View Alignment Network for Unsupervised Surface Registration
CVPR 2021
LoFTR: Detector-Free Local Feature Matching With Transformers
CVPR 2021
VS-Net: Voting With Segmentation for Visual Localization
CVPR 2021
StereoPIFu: Depth Aware Clothed Human Digitization via Stereo Vision
CVPR 2021
Neural Body: Implicit Neural Representations With Structured Latent Codes for Novel View Synthesis of Dynamic Humans
CVPR 2021
Reconstructing 3D Human Pose by Watching Humans in the Mirror
CVPR 2021
NeuralRecon: Real-Time Coherent 3D Reconstruction From Monocular Video
CVPR 2021
Location-Aware Single Image Reflection Removal
ICCV 2021
AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis
ICCV 2021
Graph-Based Asynchronous Event Processing for Rapid Object Recognition
ICCV 2021
Animatable Neural Radiance Fields for Modeling Dynamic Human Bodies
ICCV 2021
BCNet: Learning Body and Cloth Shape from A Single Image
ECCV 2020
Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation
CVPR 2020
SMAP: Single-Shot Multi-Person Absolute 3D Pose Estimation
ECCV 2020
SelfVoxeLO: Self-supervised LiDAR Odometry with Voxel-based Deep Neural Networks
CORL 2020
Deep Snake for Real-Time Instance Segmentation
CVPR 2020
Motion Capture from Internet Videos
ECCV 2020
Prior Guided Dropout for Robust Visual Localization in Dynamic Environments
ICCV 2019
Fast and Robust Multi-Person 3D Pose Estimation From Multiple Views
CVPR 2019
A Late Fusion CNN for Digital Matting
CVPR 2019
PVNet: Pixel-Wise Voting Network for 6DoF Pose Estimation
CVPR 2019
GIFT: Learning Transformation-Invariant Dense Visual Descriptors via Group CNNs
NIPS 2019
Depth Completion From Sparse LiDAR Data With Depth-Normal Constraints
ICCV 2019
ICE-BA: Incremental, Consistent and Efficient Bundle Adjustment for Visual-Inertial SLAM
CVPR 2018