Hao Zhao
58 papers · 2017–2026 · 15 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
🌍 Conference Polyglot (15) 🏃 Academic Marathon (9) 🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (12)
🌉
Interdisciplinary Bridge
🐣
Hot Topic Early Bird
🐝
Cross-Pollinator
(12)
🤝
Dynamic Duo
(12)
👑
Triple Crown
🏆
Grand Slam
👥
Mega-Team
(22)
🌱
Topic Pioneer
🔬
Deep Specialist
(15)
❓
The Questioner
🚀
Conference Pioneer
🗃️
Keyword Collector
(213)
🔥
Unstoppable
(5)
⚡
Prolific Year
(22)
💎
Century Club
(57)
Conferences
CVPR (14)
ICCV (10)
ECCV (6)
CORL (4)
ICLR (4)
NIPS (4)
WACV (4)
ICML (3)
ACL (2)
EMNLP (2)
AAAI (1)
COLING (1)
IJCAI (1)
MICCAI (1)
RSS (1)
Top co-authors
Keywords
point cloud
(6)
autonomous driving
(5)
large language model
(4)
3d reconstruction
(3)
convolutional neural network
(3)
neural radiance field
(3)
multi-task learning
(3)
scene understanding
(3)
diffusion model
(3)
gaussian splatting
(3)
3d object detection
(2)
implicit representation
(2)
neural network optimization
(2)
instance segmentation
(2)
semantic parsing
(2)
video generation
(2)
benchmark evaluation
(2)
knowledge distillation
(2)
depth estimation
(2)
neural rendering
(2)
Papers
Apertus: Democratizing Open and Compliant LLMs for Global Language Environments
ACL 2026
3DSceneEditor: Controllable 3D Scene Editing with Gaussian Splatting
WACV 2026
FB-4D: Spatial-Temporal Coherent Dynamic 3D Content Generation with Feature Banks
WACV 2026
LiON: Learning Point-Wise Abstaining Penalty for LiDAR Outlier DetectioN Using Diverse Synthetic Data
AAAI 2025
GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting
ICCV 2025
DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation
ICCV 2025
LLMsPark: A Benchmark for Evaluating Large Language Models in Strategic Gaming Contexts
EMNLP 2025
Detect Anything 3D in the Wild
ICCV 2025
Hi3DGen: High-fidelity 3D Geometry Generation from Images via Normal Bridging
ICCV 2025
Idea23D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs
COLING 2025
Diffusion-Based Visual Anagram as Multi-Task Learning
WACV 2025
Self-Aligning Depth-Regularized Radiance Fields for Asynchronous RGB-D Sequences
WACV 2025
Morpheus: A Neural-driven Animatronic Face with Hybrid Actuation and Diverse Emotion Control
RSS 2025
In-Context Meta LoRA Generation
IJCAI 2025
Analytical Lyapunov Function Discovery: An RL-based Generative Approach
ICML 2025
Is In-Context Learning Sufficient for Instruction Following in LLMs?
ICLR 2025
One View, Many Worlds: Single-Image to 3D object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation
CORL 2025
RoboChemist: Long-Horizon and Safety-Compliant Robotic Chemical Experimentation
CORL 2025
PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model
CVPR 2025
PhysGen3D: Crafting a Miniature Interactive World from a Single Image
CVPR 2025
UniScene: Unified Occupancy-centric Driving Scene Generation
CVPR 2025
Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modeling
ICLR 2025
Reversible Decoupling Network for Single Image Reflection Removal
CVPR 2025
InvRGB+L: Inverse Rendering of Complex Scenes with Unified Color and LiDAR Reflectance Modeling
ICCV 2025
Elucidating the Design Space of Torque-aware Vision-Language-Action Models
CORL 2025
Structured-NeRF: Hierarchical Scene Graph with Neural Representation
ECCV 2024
Dual-frame Fluid Motion Estimation with Test-time Optimization and Zero-divergence Loss
NIPS 2024
Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving
CORL 2024
HyperMoE: Towards Better Mixture of Experts via Transferring Among Experts
ACL 2024
FastMAC: Stochastic Spectral Sampling of Correspondence Graph
CVPR 2024
SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis
CVPR 2024
Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents
CVPR 2024
TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes
ECCV 2024
SCP-Diff: Spatial-Categorical Joint Prior for Diffusion Based Semantic Image Synthesis
ECCV 2024
Training-Free Model Merging for Multi-target Domain Adaptation
ECCV 2024
Increasing Model Capacity for Free: A Simple Strategy for Parameter Efficient Fine-tuning
ICLR 2024
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning
ICML 2024
FairDiff: Fair Segmentation with Point-Image Diffusion
MICCAI 2024
DQS3D: Densely-matched Quantization-aware Semi-supervised 3D Detection
ICCV 2023
Prototype-based HyperAdapter for Sample-Efficient Multi-task Tuning
EMNLP 2023
Understanding Embodied Reference with Touch-Line Transformer
ICLR 2023
DPF: Learning Dense Prediction Fields With Weak Supervision
CVPR 2023
On Pitfalls of Test-Time Adaptation
ICML 2023
PAD: A Dataset and Benchmark for Pose-agnostic Anomaly Detection
NIPS 2023
Delving Into Shape-Aware Zero-Shot Semantic Segmentation
CVPR 2023
3D Implicit Transporter for Temporally Consistent Keypoint Discovery
ICCV 2023
INT2: Interactive Trajectory Prediction at Intersections
ICCV 2023
Cerberus Transformer: Joint Semantic, Affordance and Attribute Parsing
CVPR 2022
High-Fidelity Human Avatars From a Single RGB Camera
CVPR 2022
SNAKE: Shape-aware Neural 3D Keypoint Field
NIPS 2022
SC-wLS: Towards Interpretable Feed-Forward Camera Re-localization
ECCV 2022
TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun Distillation
NIPS 2022
A Closed-Form Solution to Universal Style Transfer
ICCV 2019
Deeply-Supervised Knowledge Synergy
CVPR 2019
Efficient Semantic Scene Completion Network with Spatial Group Convolution
ECCV 2018
Physics Inspired Optimization on Semantic Transfer Features: An Alternative Method for Room Layout Estimation
CVPR 2017
Network Sketching: Exploiting Binary Structure in Deep CNNs
CVPR 2017
Decoder Network Over Lightweight Reconstructed Feature for Fast Semantic Style Transfer
ICCV 2017