Min Sun

63 papers · 2012–2026 · 10 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🐝 Cross-Pollinator (12) 🏃 Academic Marathon (14) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (10) 🌈 Renaissance Researcher (8)

🌈 Renaissance Researcher (8) 🗺️ Taxonomy Completionist (97) 🧭 Keyword Pioneer 🔬 Deep Specialist (16) 🧬 Topic Evolution 🤝 Dynamic Duo (15) 🏆 Keyword Champion (4) 💎 Century Club (60) 🗃️ Keyword Collector (241) 🔥 Unstoppable (10) ⚡ Prolific Year (6) 🚀 Conference Pioneer 📈 Trend Setter

Conferences

CVPR (16) ICCV (12) ECCV (11) WACV (9) AAAI (4) ACL (4) NIPS (3) CORL (2) AISTATS (1) IJCAI (1)

Top co-authors

Cheng-hao Kuo (15) Fu-En Wang (8) Cheng Sun (8) Ming-Hsuan Yang (6) Hwann-Tzong Chen (6) Albert Y. C. Chen (5) Yi-Hsuan Tsai (5) Ke ZHANG (5) An-Chieh Cheng (5) Yu-Lun Liu (4)

Keywords

depth estimation (10) room layout estimation (7) 3d reconstruction (7) semantic segmentation (5) domain adaptation (5) scene understanding (4) convolutional neural network (4) indoor scene understanding (4) object detection (3) 3d object detection (3) multimodal learning (3) indoor scene (3) vision-language model (3) vision language model (3) panoramic image (3) branch and bound (2) multi-object tracking (2) 3d scene understanding (2) video understanding (2) instance segmentation (2)

Papers

VLN-NF: Feasibility-Aware Vision-and-Language Navigation with False-Premise Instructions ACL 2026 ADAPT: Benchmarking Commonsense Planning under Unspecified Affordance Constraints ACL 2026 Listening Like Humans: Semantics-Guided Noise-Robust Multimodal Speech Recognition ACL 2026 PS3: Part Level Instance Segmentation in 3D WACV 2026 Advancing Multimodal LLMs by Large-Scale 3D Visual Instruction Dataset Generation WACV 2026 uLayout: Unified Room Layout Estimation for Perspective and Panoramic Images WACV 2025 UA-Pose: Uncertainty-Aware 6D Object Pose Estimation and Online Object Completion with Partial References CVPR 2025 DreaMo: Articulated 3D Reconstruction from a Single Casual Video WACV 2025 POp-GS: Next Best View in 3D-Gaussian Splatting with P-Optimality CVPR 2025 V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations WACV 2025 OpenM3D: Open Vocabulary Multi-view Indoor 3D Object Detection without Human Annotations ICCV 2025 Zero-shot 3D Question Answering via Voxel-based Dynamic Token Compression CVPR 2025 Details Matter for Indoor Open-vocabulary 3D Instance Segmentation ICCV 2025 Correspondence-Free SE(3) Point Cloud Registration in RKHS via Unsupervised Equivariant Learning ECCV 2024 Context-Aware Replanning with Pre-Explored Semantic Map for Object Navigation CORL 2024 No More Ambiguity in 360deg Room Layout via Bi-Layout Estimation CVPR 2024 GDA: Generalized Diffusion for Robust Test-time Adaptation CVPR 2024 ReCLIP: Refine Contrastive Language Image Pre-Training With Source Free Domain Adaptation WACV 2024 GenRC: Generative 3D Room Completion from Sparse Image Collections ECCV 2024 Self-Training Room Layout via Geometry-aware Ray-casting ECCV 2024 Ex2Eg-MAE: A Framework for Adaptation of Exocentric Video Masked Autoencoders for Egocentric Social Role Understanding ECCV 2024 ImGeoNet: Image-induced Geometry-aware Voxel Representation for Multi-view 3D Object Detection ICCV 2023 Bidirectional Alignment for Domain Adaptive Detection with Transformers ICCV 2023 MixFairFace: Towards Ultimate Fairness via MixFair Adapter in Face Recognition AAAI 2023 Dense Prediction With Attentive Feature Aggregation WACV 2023 Direct Voxel Grid Optimization: Super-Fast Convergence for Radiance Fields Reconstruction CVPR 2022 CC-3DT: Panoramic 3D Object Tracking via Cross-Camera Fusion CORL 2022 Autoregressive 3D Shape Generation via Canonical Mapping ECCV 2022 Data Efficient 3D Learner via Knowledge Transferred from 2D Model ECCV 2022 360-MLC: Multi-view Layout Consistency for Self-training and Hyper-parameter Tuning NIPS 2022 Semiconductor Defect Detection by Hybrid Classical-Quantum Deep Learning CVPR 2022 Toward Robust Long Range Policy Transfer AAAI 2021 LED2-Net: Monocular 360deg Layout Estimation via Differentiable Depth Rendering CVPR 2021 HoHoNet: 360 Indoor Holistic Understanding With Latent Horizontal Features CVPR 2021 Indoor Panorama Planar 3D Reconstruction via Divide and Conquer CVPR 2021 Specialize and Fuse: Pyramidal Output Representation for Semantic Segmentation ICCV 2021 Learning 3D Dense Correspondence via Canonical Point Autoencoder NIPS 2021 Controllable Image Synthesis via SegVAE ECCV 2020 InstaNAS: Instance-Aware Neural Architecture Search AAAI 2020 BiFuse: Monocular 360 Depth Estimation via Bi-Projection Fusion CVPR 2020 Mitigating Forgetting in Online Continual Learning via Instance-Aware Parameterization NIPS 2020 360-Indoor: Towards Learning Real-World Objects in 360deg Indoor Equirectangular Images WACV 2020 Visual Question Answering on 360deg Images WACV 2020 Point-to-Point Video Generation ICCV 2019 HorizonNet: Learning Room Layout With 1D Representation and Pano Stretch Data Augmentation CVPR 2019 Unsupervised Stylish Image Description Generation via Domain Layer Norm AAAI 2019 DuLa-Net: A Dual-Projection Network for Estimating Room Layouts From a Single RGB Panorama CVPR 2019 Joint Monocular 3D Vehicle Detection and Tracking ICCV 2019 Liquid Pouring Monitoring via Rich Sensory Inputs ECCV 2018 DPP-Net: Device-aware Progressive Search for Pareto-optimal Neural Architectures ECCV 2018 Leveraging Motion Priors in Videos for Improving Human Segmentation ECCV 2018 A Unified Model for Extractive and Abstractive Summarization using Inconsistency Loss ACL 2018 Efficient Uncertainty Estimation for Semantic Segmentation in Videos ECCV 2018 Cube Padding for Weakly-Supervised Saliency Prediction in 360° Videos CVPR 2018 No More Discrimination: Cross City Adaptation of Road Scene Segmenters ICCV 2017 Anticipating Daily Intention Using On-Wrist Motion Triggered Sensing ICCV 2017 Show, Adapt and Tell: Adversarial Training of Cross-Domain Image Captioner ICCV 2017 Visual Forecasting by Imitating Dynamics in Natural Sequences ICCV 2017 Deep 360 Pilot: Learning a Deep Agent for Piloting Through 360deg Sports Videos CVPR 2017 Agent-Centric Risk Assessment: Accident Anticipation and Risky Region Localization CVPR 2017 Tactics of Adversarial Attack on Deep Reinforcement Learning Agents IJCAI 2017 Find the Best Path: An Efficient and Accurate Classifier for Image Hierarchies ICCV 2013 Efficient and Exact MAP-MRF Inference using Branch and Bound AISTATS 2012