Cheng-hao Kuo

19 papers · 2020–2026 · 5 conferences · across top CS/AI conferences

Achievements

+6 more ↓

🏃 Academic Marathon (6) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (5) 🐝 Cross-Pollinator (11)

🌍 Conference Polyglot (5) 🏃 Academic Marathon (6) 🤝 Dynamic Duo (15) 💎 Century Club (19) ⚡ Prolific Year (6) 🗃️ Keyword Collector (99)

Conferences

CVPR (6) WACV (5) ICCV (4) ECCV (3) INTERSPEECH (1)

Top co-authors

Min Sun (15) Ke ZHANG (7) Lu Xia (6) Yuyin Sun (5) Albert Y. C. Chen (5) Nan Qiao (4) Jiajia Luo (4) Che-Chun Su (4) Ming-Feng Li (3) Xiao Zeng (3)

Keywords

domain adaptation (4) vision language model (3) uncertainty quantification (2) open vocabulary (2) 3d scene understanding (2) human pose estimation (2) vision-language model (2) 3d object detection (2) indoor scene understanding (2) voxel representation (2) depth estimation (2) object detection (1) transfer learning (1) weakly supervised learning (1) scene understanding (1) vision transformer (1) contrastive learning (1) computer vision (1) 3d vision (1) human analysis (1)

Papers

Advancing Multimodal LLMs by Large-Scale 3D Visual Instruction Dataset Generation WACV 2026 UA-Pose: Uncertainty-Aware 6D Object Pose Estimation and Online Object Completion with Partial References CVPR 2025 V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations WACV 2025 POp-GS: Next Best View in 3D-Gaussian Splatting with P-Optimality CVPR 2025 Zero-shot 3D Question Answering via Voxel-based Dynamic Token Compression CVPR 2025 Details Matter for Indoor Open-vocabulary 3D Instance Segmentation ICCV 2025 OpenM3D: Open Vocabulary Multi-view Indoor 3D Object Detection without Human Annotations ICCV 2025 Ex2Eg-MAE: A Framework for Adaptation of Exocentric Video Masked Autoencoders for Egocentric Social Role Understanding ECCV 2024 Correspondence-Free SE(3) Point Cloud Registration in RKHS via Unsupervised Equivariant Learning ECCV 2024 ReCLIP: Refine Contrastive Language Image Pre-Training With Source Free Domain Adaptation WACV 2024 GDA: Generalized Diffusion for Robust Test-time Adaptation CVPR 2024 No More Ambiguity in 360deg Room Layout via Bi-Layout Estimation CVPR 2024 GenRC: Generative 3D Room Completion from Sparse Image Collections ECCV 2024 Bidirectional Alignment for Domain Adaptive Detection with Transformers ICCV 2023 ImGeoNet: Image-induced Geometry-aware Voxel Representation for Multi-view 3D Object Detection ICCV 2023 Human-in-the-Loop Video Semantic Segmentation Auto-Annotation WACV 2023 CameraPose: Weakly-Supervised Monocular 3D Human Pose Estimation by Leveraging In-the-Wild 2D Annotations WACV 2023 Acted vs. Improvised: Domain Adaptation for Elicitation Approaches in Audio-Visual Emotion Recognition INTERSPEECH 2021 MEBOW: Monocular Estimation of Body Orientation in the Wild CVPR 2020