Cheng-hao Kuo
19 papers · 2020–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
🏃 Academic Marathon (6) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (5) 🐝 Cross-Pollinator (11)
🌍
Conference Polyglot
(5)
🏃
Academic Marathon
(6)
🤝
Dynamic Duo
(15)
💎
Century Club
(19)
⚡
Prolific Year
(6)
🗃️
Keyword Collector
(99)
Conferences
CVPR (6)
WACV (5)
ICCV (4)
ECCV (3)
INTERSPEECH (1)
Top co-authors
Keywords
domain adaptation
(4)
vision language model
(3)
uncertainty quantification
(2)
open vocabulary
(2)
3d scene understanding
(2)
human pose estimation
(2)
vision-language model
(2)
3d object detection
(2)
indoor scene understanding
(2)
voxel representation
(2)
depth estimation
(2)
object detection
(1)
transfer learning
(1)
weakly supervised learning
(1)
scene understanding
(1)
vision transformer
(1)
contrastive learning
(1)
computer vision
(1)
3d vision
(1)
human analysis
(1)
Papers
Advancing Multimodal LLMs by Large-Scale 3D Visual Instruction Dataset Generation
WACV 2026
UA-Pose: Uncertainty-Aware 6D Object Pose Estimation and Online Object Completion with Partial References
CVPR 2025
V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations
WACV 2025
POp-GS: Next Best View in 3D-Gaussian Splatting with P-Optimality
CVPR 2025
Zero-shot 3D Question Answering via Voxel-based Dynamic Token Compression
CVPR 2025
Details Matter for Indoor Open-vocabulary 3D Instance Segmentation
ICCV 2025
OpenM3D: Open Vocabulary Multi-view Indoor 3D Object Detection without Human Annotations
ICCV 2025
Ex2Eg-MAE: A Framework for Adaptation of Exocentric Video Masked Autoencoders for Egocentric Social Role Understanding
ECCV 2024
Correspondence-Free SE(3) Point Cloud Registration in RKHS via Unsupervised Equivariant Learning
ECCV 2024
ReCLIP: Refine Contrastive Language Image Pre-Training With Source Free Domain Adaptation
WACV 2024
GDA: Generalized Diffusion for Robust Test-time Adaptation
CVPR 2024
No More Ambiguity in 360deg Room Layout via Bi-Layout Estimation
CVPR 2024
GenRC: Generative 3D Room Completion from Sparse Image Collections
ECCV 2024
Bidirectional Alignment for Domain Adaptive Detection with Transformers
ICCV 2023
ImGeoNet: Image-induced Geometry-aware Voxel Representation for Multi-view 3D Object Detection
ICCV 2023
Human-in-the-Loop Video Semantic Segmentation Auto-Annotation
WACV 2023
CameraPose: Weakly-Supervised Monocular 3D Human Pose Estimation by Leveraging In-the-Wild 2D Annotations
WACV 2023
Acted vs. Improvised: Domain Adaptation for Elicitation Approaches in Audio-Visual Emotion Recognition
INTERSPEECH 2021
MEBOW: Monocular Estimation of Body Orientation in the Wild
CVPR 2020