Min Sun
63 papers · 2012–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π Cross-Pollinator (12) π Academic Marathon (14) π Interdisciplinary Bridge π Conference Polyglot (10) π Renaissance Researcher (8)
π
Renaissance Researcher
(8)
πΊοΈ
Taxonomy Completionist
(97)
π§
Keyword Pioneer
π¬
Deep Specialist
(16)
π§¬
Topic Evolution
π€
Dynamic Duo
(15)
π
Keyword Champion
(4)
π
Century Club
(60)
ποΈ
Keyword Collector
(241)
π₯
Unstoppable
(10)
β‘
Prolific Year
(6)
π
Conference Pioneer
π
Trend Setter
Conferences
CVPR (16)
ICCV (12)
ECCV (11)
WACV (9)
AAAI (4)
ACL (4)
NIPS (3)
CORL (2)
AISTATS (1)
IJCAI (1)
Top co-authors
Keywords
depth estimation
(10)
room layout estimation
(7)
3d reconstruction
(7)
semantic segmentation
(5)
domain adaptation
(5)
scene understanding
(4)
convolutional neural network
(4)
indoor scene understanding
(4)
object detection
(3)
3d object detection
(3)
multimodal learning
(3)
indoor scene
(3)
vision-language model
(3)
vision language model
(3)
panoramic image
(3)
branch and bound
(2)
multi-object tracking
(2)
3d scene understanding
(2)
video understanding
(2)
instance segmentation
(2)
Papers
VLN-NF: Feasibility-Aware Vision-and-Language Navigation with False-Premise Instructions
ACL 2026
ADAPT: Benchmarking Commonsense Planning under Unspecified Affordance Constraints
ACL 2026
Listening Like Humans: Semantics-Guided Noise-Robust Multimodal Speech Recognition
ACL 2026
PS3: Part Level Instance Segmentation in 3D
WACV 2026
Advancing Multimodal LLMs by Large-Scale 3D Visual Instruction Dataset Generation
WACV 2026
uLayout: Unified Room Layout Estimation for Perspective and Panoramic Images
WACV 2025
UA-Pose: Uncertainty-Aware 6D Object Pose Estimation and Online Object Completion with Partial References
CVPR 2025
DreaMo: Articulated 3D Reconstruction from a Single Casual Video
WACV 2025
POp-GS: Next Best View in 3D-Gaussian Splatting with P-Optimality
CVPR 2025
V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations
WACV 2025
OpenM3D: Open Vocabulary Multi-view Indoor 3D Object Detection without Human Annotations
ICCV 2025
Zero-shot 3D Question Answering via Voxel-based Dynamic Token Compression
CVPR 2025
Details Matter for Indoor Open-vocabulary 3D Instance Segmentation
ICCV 2025
Correspondence-Free SE(3) Point Cloud Registration in RKHS via Unsupervised Equivariant Learning
ECCV 2024
Context-Aware Replanning with Pre-Explored Semantic Map for Object Navigation
CORL 2024
No More Ambiguity in 360deg Room Layout via Bi-Layout Estimation
CVPR 2024
GDA: Generalized Diffusion for Robust Test-time Adaptation
CVPR 2024
ReCLIP: Refine Contrastive Language Image Pre-Training With Source Free Domain Adaptation
WACV 2024
GenRC: Generative 3D Room Completion from Sparse Image Collections
ECCV 2024
Self-Training Room Layout via Geometry-aware Ray-casting
ECCV 2024
Ex2Eg-MAE: A Framework for Adaptation of Exocentric Video Masked Autoencoders for Egocentric Social Role Understanding
ECCV 2024
ImGeoNet: Image-induced Geometry-aware Voxel Representation for Multi-view 3D Object Detection
ICCV 2023
Bidirectional Alignment for Domain Adaptive Detection with Transformers
ICCV 2023
MixFairFace: Towards Ultimate Fairness via MixFair Adapter in Face Recognition
AAAI 2023
Dense Prediction With Attentive Feature Aggregation
WACV 2023
Direct Voxel Grid Optimization: Super-Fast Convergence for Radiance Fields Reconstruction
CVPR 2022
CC-3DT: Panoramic 3D Object Tracking via Cross-Camera Fusion
CORL 2022
Autoregressive 3D Shape Generation via Canonical Mapping
ECCV 2022
Data Efficient 3D Learner via Knowledge Transferred from 2D Model
ECCV 2022
360-MLC: Multi-view Layout Consistency for Self-training and Hyper-parameter Tuning
NIPS 2022
Semiconductor Defect Detection by Hybrid Classical-Quantum Deep Learning
CVPR 2022
Toward Robust Long Range Policy Transfer
AAAI 2021
LED2-Net: Monocular 360deg Layout Estimation via Differentiable Depth Rendering
CVPR 2021
HoHoNet: 360 Indoor Holistic Understanding With Latent Horizontal Features
CVPR 2021
Indoor Panorama Planar 3D Reconstruction via Divide and Conquer
CVPR 2021
Specialize and Fuse: Pyramidal Output Representation for Semantic Segmentation
ICCV 2021
Learning 3D Dense Correspondence via Canonical Point Autoencoder
NIPS 2021
Controllable Image Synthesis via SegVAE
ECCV 2020
InstaNAS: Instance-Aware Neural Architecture Search
AAAI 2020
BiFuse: Monocular 360 Depth Estimation via Bi-Projection Fusion
CVPR 2020
Mitigating Forgetting in Online Continual Learning via Instance-Aware Parameterization
NIPS 2020
360-Indoor: Towards Learning Real-World Objects in 360deg Indoor Equirectangular Images
WACV 2020
Visual Question Answering on 360deg Images
WACV 2020
Point-to-Point Video Generation
ICCV 2019
HorizonNet: Learning Room Layout With 1D Representation and Pano Stretch Data Augmentation
CVPR 2019
Unsupervised Stylish Image Description Generation via Domain Layer Norm
AAAI 2019
DuLa-Net: A Dual-Projection Network for Estimating Room Layouts From a Single RGB Panorama
CVPR 2019
Joint Monocular 3D Vehicle Detection and Tracking
ICCV 2019
Liquid Pouring Monitoring via Rich Sensory Inputs
ECCV 2018
DPP-Net: Device-aware Progressive Search for Pareto-optimal Neural Architectures
ECCV 2018
Leveraging Motion Priors in Videos for Improving Human Segmentation
ECCV 2018
A Unified Model for Extractive and Abstractive Summarization using Inconsistency Loss
ACL 2018
Efficient Uncertainty Estimation for Semantic Segmentation in Videos
ECCV 2018
Cube Padding for Weakly-Supervised Saliency Prediction in 360Β° Videos
CVPR 2018
No More Discrimination: Cross City Adaptation of Road Scene Segmenters
ICCV 2017
Anticipating Daily Intention Using On-Wrist Motion Triggered Sensing
ICCV 2017
Show, Adapt and Tell: Adversarial Training of Cross-Domain Image Captioner
ICCV 2017
Visual Forecasting by Imitating Dynamics in Natural Sequences
ICCV 2017
Deep 360 Pilot: Learning a Deep Agent for Piloting Through 360deg Sports Videos
CVPR 2017
Agent-Centric Risk Assessment: Accident Anticipation and Risky Region Localization
CVPR 2017
Tactics of Adversarial Attack on Deep Reinforcement Learning Agents
IJCAI 2017
Find the Best Path: An Efficient and Accurate Classifier for Image Hierarchies
ICCV 2013
Efficient and Exact MAP-MRF Inference using Branch and Bound
AISTATS 2012