Hao Zhao

58 papers · 2017–2026 · 15 conferences · across top CS/AI conferences

Achievements

+15 more ↓

🌍 Conference Polyglot (15) 🏃 Academic Marathon (9) 🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (12)

🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🐝 Cross-Pollinator (12) 🤝 Dynamic Duo (12) 👑 Triple Crown 🏆 Grand Slam 👥 Mega-Team (22) 🌱 Topic Pioneer 🔬 Deep Specialist (15) ❓ The Questioner 🚀 Conference Pioneer 🗃️ Keyword Collector (213) 🔥 Unstoppable (5) ⚡ Prolific Year (22) 💎 Century Club (57)

Conferences

CVPR (14) ICCV (10) ECCV (6) CORL (4) ICLR (4) NIPS (4) WACV (4) ICML (3) ACL (2) EMNLP (2) AAAI (1) COLING (1) IJCAI (1) MICCAI (1) RSS (1)

Top co-authors

Guyue Zhou (12) Huan-ang Gao (12) Xiaoxue Chen (7) Pengfei Li (6) Anbang Yao (6) Mingju Gao (5) Wenyi Li (5) Yurong Chen (5) Li Zhang (5) Zongzheng Zhang (5)

Keywords

point cloud (6) autonomous driving (5) large language model (4) 3d reconstruction (3) convolutional neural network (3) neural radiance field (3) multi-task learning (3) scene understanding (3) diffusion model (3) gaussian splatting (3) 3d object detection (2) implicit representation (2) neural network optimization (2) instance segmentation (2) semantic parsing (2) video generation (2) benchmark evaluation (2) knowledge distillation (2) depth estimation (2) neural rendering (2)

Papers

Apertus: Democratizing Open and Compliant LLMs for Global Language Environments ACL 2026 3DSceneEditor: Controllable 3D Scene Editing with Gaussian Splatting WACV 2026 FB-4D: Spatial-Temporal Coherent Dynamic 3D Content Generation with Feature Banks WACV 2026 LiON: Learning Point-Wise Abstaining Penalty for LiDAR Outlier DetectioN Using Diverse Synthetic Data AAAI 2025 GS-Occ3D: Scaling Vision-only Occupancy Reconstruction with Gaussian Splatting ICCV 2025 DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation ICCV 2025 LLMsPark: A Benchmark for Evaluating Large Language Models in Strategic Gaming Contexts EMNLP 2025 Detect Anything 3D in the Wild ICCV 2025 Hi3DGen: High-fidelity 3D Geometry Generation from Images via Normal Bridging ICCV 2025 Idea23D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs COLING 2025 Diffusion-Based Visual Anagram as Multi-Task Learning WACV 2025 Self-Aligning Depth-Regularized Radiance Fields for Asynchronous RGB-D Sequences WACV 2025 Morpheus: A Neural-driven Animatronic Face with Hybrid Actuation and Diverse Emotion Control RSS 2025 In-Context Meta LoRA Generation IJCAI 2025 Analytical Lyapunov Function Discovery: An RL-based Generative Approach ICML 2025 Is In-Context Learning Sufficient for Instruction Following in LLMs? ICLR 2025 One View, Many Worlds: Single-Image to 3D object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation CORL 2025 RoboChemist: Long-Horizon and Safety-Compliant Robotic Chemical Experimentation CORL 2025 PartRM: Modeling Part-Level Dynamics with Large Cross-State Reconstruction Model CVPR 2025 PhysGen3D: Crafting a Miniature Interactive World from a Single Image CVPR 2025 UniScene: Unified Occupancy-centric Driving Scene Generation CVPR 2025 Ctrl-U: Robust Conditional Image Generation via Uncertainty-aware Reward Modeling ICLR 2025 Reversible Decoupling Network for Single Image Reflection Removal CVPR 2025 InvRGB+L: Inverse Rendering of Complex Scenes with Unified Color and LiDAR Reflectance Modeling ICCV 2025 Elucidating the Design Space of Torque-aware Vision-Language-Action Models CORL 2025 Structured-NeRF: Hierarchical Scene Graph with Neural Representation ECCV 2024 Dual-frame Fluid Motion Estimation with Test-time Optimization and Zero-divergence Loss NIPS 2024 Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving CORL 2024 HyperMoE: Towards Better Mixture of Experts via Transferring Among Experts ACL 2024 FastMAC: Stochastic Spectral Sampling of Correspondence Graph CVPR 2024 SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis CVPR 2024 Editable Scene Simulation for Autonomous Driving via Collaborative LLM-Agents CVPR 2024 TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes ECCV 2024 SCP-Diff: Spatial-Categorical Joint Prior for Diffusion Based Semantic Image Synthesis ECCV 2024 Training-Free Model Merging for Multi-target Domain Adaptation ECCV 2024 Increasing Model Capacity for Free: A Simple Strategy for Parameter Efficient Fine-tuning ICLR 2024 Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning ICML 2024 FairDiff: Fair Segmentation with Point-Image Diffusion MICCAI 2024 DQS3D: Densely-matched Quantization-aware Semi-supervised 3D Detection ICCV 2023 Prototype-based HyperAdapter for Sample-Efficient Multi-task Tuning EMNLP 2023 Understanding Embodied Reference with Touch-Line Transformer ICLR 2023 DPF: Learning Dense Prediction Fields With Weak Supervision CVPR 2023 On Pitfalls of Test-Time Adaptation ICML 2023 PAD: A Dataset and Benchmark for Pose-agnostic Anomaly Detection NIPS 2023 Delving Into Shape-Aware Zero-Shot Semantic Segmentation CVPR 2023 3D Implicit Transporter for Temporally Consistent Keypoint Discovery ICCV 2023 INT2: Interactive Trajectory Prediction at Intersections ICCV 2023 Cerberus Transformer: Joint Semantic, Affordance and Attribute Parsing CVPR 2022 High-Fidelity Human Avatars From a Single RGB Camera CVPR 2022 SNAKE: Shape-aware Neural 3D Keypoint Field NIPS 2022 SC-wLS: Towards Interpretable Feed-Forward Camera Re-localization ECCV 2022 TOIST: Task Oriented Instance Segmentation Transformer with Noun-Pronoun Distillation NIPS 2022 A Closed-Form Solution to Universal Style Transfer ICCV 2019 Deeply-Supervised Knowledge Synergy CVPR 2019 Efficient Semantic Scene Completion Network with Spatial Group Convolution ECCV 2018 Physics Inspired Optimization on Semantic Transfer Features: An Alternative Method for Room Layout Estimation CVPR 2017 Network Sketching: Exploiting Binary Structure in Deep CNNs CVPR 2017 Decoder Network Over Lightweight Reconstructed Feature for Fast Semantic Style Transfer ICCV 2017