Yu-Wing Tai

89 papers · 2013–2026 · 7 conferences · across top CS/AI conferences

Achievements

+14 more ↓

🐝 Cross-Pollinator (14) 🌍 Conference Polyglot (7) 🧭 Keyword Pioneer 🏃 Academic Marathon (12) 🌈 Renaissance Researcher (7)

🐣 Hot Topic Early Bird 🌍 Conference Polyglot (7) 🏃 Academic Marathon (12) 🌟 Keyword Trendsetter Combo (7) 🏠 Conference Loyalist (40) 🔬 Deep Specialist (14) 🏆 Keyword Champion (2) 🤝 Dynamic Duo (50) 🔥 Unstoppable (13) 🗃️ Keyword Collector (334) ⚡ Prolific Year (11) 🚀 Conference Pioneer 💎 Century Club (88) 📈 Trend Setter

Conferences

CVPR (40) ECCV (18) ICCV (17) NIPS (7) AAAI (3) ICLR (3) WACV (1)

Top co-authors

Chi-Keung Tang (50) Lei Ke (16) Fisher Yu (9) In So Kweon (9) Qi Fan (8) Cewu Lu (8) Martin Danelljan (8) Xiaoyong Shen (5) Wenjie Pei (5) Jaesik Park (5)

Keywords

semantic segmentation (8) object detection (7) neural network (6) neural radiance field (6) alpha matte (5) depth estimation (4) image segmentation (4) zero-shot learning (4) instance segmentation (4) 3d reconstruction (4) convolutional neural network (4) video object segmentation (4) weakly supervised learning (3) 3d vision (3) video matting (3) image matting (3) attention mechanism (3) object tracking (3) surface normal (3) adversarial learning (2)

Papers

RealRep: Generalized SDR-to-HDR Conversion via Attribute-Disentangled Representation Learning AAAI 2026 Motion-Agent: A Conversational Framework for Human Motion Generation with LLMs ICLR 2025 Segment Anything Meets Point Tracking WACV 2025 Stable Segment Anything Model ICLR 2025 Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware Spatio-Temporal Sampling CVPR 2024 DragVideo: Interactive Drag-style Video Editing ECCV 2024 Distill Gold from Massive Ores: Bi-level Data Pruning towards Efficient Dataset Distillation ECCV 2024 Diffusion-Generated Pseudo-Observations for High-Quality Sparse-View Reconstruction ECCV 2024 ChatCam: Empowering Camera Control through Conversational AI NIPS 2024 SANeRF-HQ: Segment Anything for NeRF in High Quality CVPR 2024 C3Net: Compound Conditioned ControlNet for Multimodal Content Generation CVPR 2024 NeRF-RPN: A General Framework for Object Detection in NeRFs CVPR 2023 Cascade-DETR: Delving into High-Quality Universal Object Detection ICCV 2023 EgoPCA: A New Framework for Egocentric Hand-Object Interaction Understanding ICCV 2023 FaceDNeRF: Semantics-Driven Face Reconstruction, Prompt Editing and Relighting with Diffusion Models NIPS 2023 Instance Neural Radiance Field ICCV 2023 Mask-Free Video Instance Segmentation CVPR 2023 Ultrahigh Resolution Image/Video Matting With Spatio-Temporal Sparsity CVPR 2023 Compression-Aware Video Super-Resolution CVPR 2023 BiMatting: Efficient Video Matting via Binarization NIPS 2023 Segment Anything in High Quality NIPS 2023 Towards Robust Object Detection Invariant to Real-World Domain Shifts ICLR 2023 Interactiveness Field in Human-Object Interactions CVPR 2022 Unsupervised Multi-View Object Segmentation Using Radiance Field Propagation NIPS 2022 Human Instance Matting via Mutual Guidance and Multi-Instance Refinement CVPR 2022 Transcoded Video Restoration by Temporal Spatial Auxiliary Network AAAI 2022 Video Mask Transfiner for High-Quality Video Instance Segmentation ECCV 2022 Few-Shot Video Object Detection ECCV 2022 Few-Shot Object Detection with Model Calibration ECCV 2022 Self-Support Few-Shot Semantic Segmentation ECCV 2022 Mask Transfiner for High-Quality Instance Segmentation CVPR 2022 Look Back and Forth: Video Super-Resolution With Explicit Temporal Difference Modeling CVPR 2022 Semantic Image Matting CVPR 2021 Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation NIPS 2021 Prototypical Cross-Attention Networks for Multiple Object Tracking and Segmentation NIPS 2021 Group Collaborative Learning for Co-Salient Object Detection CVPR 2021 Deep Video Matting via Spatio-Temporal Alignment and Aggregation CVPR 2021 Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion CVPR 2021 Deep Occlusion-Aware Instance Segmentation With Overlapping BiLayers CVPR 2021 HAA500: Human-Centric Atomic Action Dataset With Curated Videos ICCV 2021 Occlusion-Aware Video Object Inpainting ICCV 2021 Pyramid Multi-view Stereo Net with Self-adaptive View Aggregation ECCV 2020 GSNet: Joint Vehicle Pose and Shape Reconstruction with Geometrical and Scene-aware Supervision ECCV 2020 Towards Global Explanations of Convolutional Neural Networks With Concept Attribution CVPR 2020 Few-Shot Object Detection With Attention-RPN and Multi-Relation Detector CVPR 2020 FSS-1000: A 1000-Class Dataset for Few-Shot Segmentation CVPR 2020 Boosting the Transferability of Adversarial Samples via Attention CVPR 2020 CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local Refinement CVPR 2020 Fast Video Object Segmentation With Temporal Aggregation Network and Dynamic Template Matching CVPR 2020 Cascaded Deep Monocular 3D Human Pose Estimation With Evolutionary Training Data CVPR 2020 Learning Video Object Segmentation From Unlabeled Videos CVPR 2020 Pointwise Rotation-Invariant Network with Adaptive Sampling and 3D Spherical Voxel Convolution AAAI 2020 Fully Convolutional Networks for Continuous Sign Language Recognition ECCV 2020 Dive Deeper Into Box for Object Detection ECCV 2020 Dense Hybrid Recurrent Multi-view Stereo Net with Dynamic Consistency Checking ECCV 2020 Commonality-Parsing Network across Shape and Appearance for Partially Supervised Instance Segmentation ECCV 2020 Memory-Attended Recurrent Network for Video Captioning CVPR 2019 MMFace: A Multi-Metric Regression Network for Unconstrained Face Reconstruction CVPR 2019 Adversarial Attacks Beyond the Image Space CVPR 2019 Non-Local Recurrent Neural Memory for Supervised Sequence Modeling ICCV 2019 Reflective Decoding Network for Image Captioning ICCV 2019 Cross-Domain Adaptation for Animal Pose Estimation ICCV 2019 LADN: Local Adversarial Disentangling Network for Facial Makeup and De-Makeup ICCV 2019 Attribute-Guided Face Generation Using Conditional CycleGAN ECCV 2018 Pairwise Body-Part Attention for Recognizing Human-Object Interactions ECCV 2018 Weakly and Semi Supervised Human Body Part Parsing via Pose-Guided Knowledge Transfer CVPR 2018 Learning Dual Convolutional Neural Networks for Low-Level Vision CVPR 2018 Deep High Dynamic Range Imaging with Large Foreground Motions ECCV 2018 Deep Video Generation, Prediction and Completion of Human Action Sequences ECCV 2018 Image Generation from Sketch Constraint Using Contextual GAN ECCV 2018 Accurate Single Stage Detector Using Recurrent Rolling Convolution CVPR 2017 A Unified Approach of Multi-Scale Deep and Hand-Crafted Features for Defocus Estimation CVPR 2017 Learning Discriminative Data Fitting Functions for Blind Image Deblurring ICCV 2017 RMPE: Regional Multi-Person Pose Estimation ICCV 2017 Weakly- and Self-Supervised Learning for Content-Aware Deep Image Retargeting ICCV 2017 Deep Saliency With Encoded Low Level Distance Map and High Level Features CVPR 2016 Efficient and Robust Color Consistency for Community Photo Collections CVPR 2016 Accurate Depth Map Estimation From a Lenslet Light Field Camera CVPR 2015 RGB-Guided Hyperspectral Image Upsampling ICCV 2015 Fast Randomized Singular Value Thresholding for Nuclear Norm Minimization CVPR 2015 Data-Driven Depth Map Refinement via Multi-Scale Sparse Representation CVPR 2015 Exploiting Shading Cues in Kinect IR Images for Geometry Refinement CVPR 2014 Calibrating a Non-isotropic Near Point Light Source using a Plane CVPR 2014 Salient Region Detection via High-Dimensional Color Transform CVPR 2014 Shading-Based Shape Refinement of RGB-D Images CVPR 2013 Modeling the Calibration Pipeline of the Lytro Camera for High Quality Light-Field Image Reconstruction ICCV 2013 Partial Sum Minimization of Singular Values in RPCA for Low-Level Vision ICCV 2013 A Learning-Based Approach to Reduce JPEG Artifacts in Image Matting ICCV 2013 Multiview Photometric Stereo Using Planar Mesh Parameterization ICCV 2013