Yu-Wing Tai
89 papers · 2013–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
🐝 Cross-Pollinator (14) 🌍 Conference Polyglot (7) 🧭 Keyword Pioneer 🏃 Academic Marathon (12) 🌈 Renaissance Researcher (7)
🐣
Hot Topic Early Bird
🌍
Conference Polyglot
(7)
🏃
Academic Marathon
(12)
🌟
Keyword Trendsetter Combo
(7)
🏠
Conference Loyalist
(40)
🔬
Deep Specialist
(14)
🏆
Keyword Champion
(2)
🤝
Dynamic Duo
(50)
🔥
Unstoppable
(13)
🗃️
Keyword Collector
(334)
⚡
Prolific Year
(11)
🚀
Conference Pioneer
💎
Century Club
(88)
📈
Trend Setter
Conferences
CVPR (40)
ECCV (18)
ICCV (17)
NIPS (7)
AAAI (3)
ICLR (3)
WACV (1)
Top co-authors
Keywords
semantic segmentation
(8)
object detection
(7)
neural network
(6)
neural radiance field
(6)
alpha matte
(5)
depth estimation
(4)
image segmentation
(4)
zero-shot learning
(4)
instance segmentation
(4)
3d reconstruction
(4)
convolutional neural network
(4)
video object segmentation
(4)
weakly supervised learning
(3)
3d vision
(3)
video matting
(3)
image matting
(3)
attention mechanism
(3)
object tracking
(3)
surface normal
(3)
adversarial learning
(2)
Papers
RealRep: Generalized SDR-to-HDR Conversion via Attribute-Disentangled Representation Learning
AAAI 2026
Motion-Agent: A Conversational Framework for Human Motion Generation with LLMs
ICLR 2025
Segment Anything Meets Point Tracking
WACV 2025
Stable Segment Anything Model
ICLR 2025
Gear-NeRF: Free-Viewpoint Rendering and Tracking with Motion-aware Spatio-Temporal Sampling
CVPR 2024
DragVideo: Interactive Drag-style Video Editing
ECCV 2024
Distill Gold from Massive Ores: Bi-level Data Pruning towards Efficient Dataset Distillation
ECCV 2024
Diffusion-Generated Pseudo-Observations for High-Quality Sparse-View Reconstruction
ECCV 2024
ChatCam: Empowering Camera Control through Conversational AI
NIPS 2024
SANeRF-HQ: Segment Anything for NeRF in High Quality
CVPR 2024
C3Net: Compound Conditioned ControlNet for Multimodal Content Generation
CVPR 2024
NeRF-RPN: A General Framework for Object Detection in NeRFs
CVPR 2023
Cascade-DETR: Delving into High-Quality Universal Object Detection
ICCV 2023
EgoPCA: A New Framework for Egocentric Hand-Object Interaction Understanding
ICCV 2023
FaceDNeRF: Semantics-Driven Face Reconstruction, Prompt Editing and Relighting with Diffusion Models
NIPS 2023
Instance Neural Radiance Field
ICCV 2023
Mask-Free Video Instance Segmentation
CVPR 2023
Ultrahigh Resolution Image/Video Matting With Spatio-Temporal Sparsity
CVPR 2023
Compression-Aware Video Super-Resolution
CVPR 2023
BiMatting: Efficient Video Matting via Binarization
NIPS 2023
Segment Anything in High Quality
NIPS 2023
Towards Robust Object Detection Invariant to Real-World Domain Shifts
ICLR 2023
Interactiveness Field in Human-Object Interactions
CVPR 2022
Unsupervised Multi-View Object Segmentation Using Radiance Field Propagation
NIPS 2022
Human Instance Matting via Mutual Guidance and Multi-Instance Refinement
CVPR 2022
Transcoded Video Restoration by Temporal Spatial Auxiliary Network
AAAI 2022
Video Mask Transfiner for High-Quality Video Instance Segmentation
ECCV 2022
Few-Shot Video Object Detection
ECCV 2022
Few-Shot Object Detection with Model Calibration
ECCV 2022
Self-Support Few-Shot Semantic Segmentation
ECCV 2022
Mask Transfiner for High-Quality Instance Segmentation
CVPR 2022
Look Back and Forth: Video Super-Resolution With Explicit Temporal Difference Modeling
CVPR 2022
Semantic Image Matting
CVPR 2021
Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation
NIPS 2021
Prototypical Cross-Attention Networks for Multiple Object Tracking and Segmentation
NIPS 2021
Group Collaborative Learning for Co-Salient Object Detection
CVPR 2021
Deep Video Matting via Spatio-Temporal Alignment and Aggregation
CVPR 2021
Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion
CVPR 2021
Deep Occlusion-Aware Instance Segmentation With Overlapping BiLayers
CVPR 2021
HAA500: Human-Centric Atomic Action Dataset With Curated Videos
ICCV 2021
Occlusion-Aware Video Object Inpainting
ICCV 2021
Pyramid Multi-view Stereo Net with Self-adaptive View Aggregation
ECCV 2020
GSNet: Joint Vehicle Pose and Shape Reconstruction with Geometrical and Scene-aware Supervision
ECCV 2020
Towards Global Explanations of Convolutional Neural Networks With Concept Attribution
CVPR 2020
Few-Shot Object Detection With Attention-RPN and Multi-Relation Detector
CVPR 2020
FSS-1000: A 1000-Class Dataset for Few-Shot Segmentation
CVPR 2020
Boosting the Transferability of Adversarial Samples via Attention
CVPR 2020
CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local Refinement
CVPR 2020
Fast Video Object Segmentation With Temporal Aggregation Network and Dynamic Template Matching
CVPR 2020
Cascaded Deep Monocular 3D Human Pose Estimation With Evolutionary Training Data
CVPR 2020
Learning Video Object Segmentation From Unlabeled Videos
CVPR 2020
Pointwise Rotation-Invariant Network with Adaptive Sampling and 3D Spherical Voxel Convolution
AAAI 2020
Fully Convolutional Networks for Continuous Sign Language Recognition
ECCV 2020
Dive Deeper Into Box for Object Detection
ECCV 2020
Dense Hybrid Recurrent Multi-view Stereo Net with Dynamic Consistency Checking
ECCV 2020
Commonality-Parsing Network across Shape and Appearance for Partially Supervised Instance Segmentation
ECCV 2020
Memory-Attended Recurrent Network for Video Captioning
CVPR 2019
MMFace: A Multi-Metric Regression Network for Unconstrained Face Reconstruction
CVPR 2019
Adversarial Attacks Beyond the Image Space
CVPR 2019
Non-Local Recurrent Neural Memory for Supervised Sequence Modeling
ICCV 2019
Reflective Decoding Network for Image Captioning
ICCV 2019
Cross-Domain Adaptation for Animal Pose Estimation
ICCV 2019
LADN: Local Adversarial Disentangling Network for Facial Makeup and De-Makeup
ICCV 2019
Attribute-Guided Face Generation Using Conditional CycleGAN
ECCV 2018
Pairwise Body-Part Attention for Recognizing Human-Object Interactions
ECCV 2018
Weakly and Semi Supervised Human Body Part Parsing via Pose-Guided Knowledge Transfer
CVPR 2018
Learning Dual Convolutional Neural Networks for Low-Level Vision
CVPR 2018
Deep High Dynamic Range Imaging with Large Foreground Motions
ECCV 2018
Deep Video Generation, Prediction and Completion of Human Action Sequences
ECCV 2018
Image Generation from Sketch Constraint Using Contextual GAN
ECCV 2018
Accurate Single Stage Detector Using Recurrent Rolling Convolution
CVPR 2017
A Unified Approach of Multi-Scale Deep and Hand-Crafted Features for Defocus Estimation
CVPR 2017
Learning Discriminative Data Fitting Functions for Blind Image Deblurring
ICCV 2017
RMPE: Regional Multi-Person Pose Estimation
ICCV 2017
Weakly- and Self-Supervised Learning for Content-Aware Deep Image Retargeting
ICCV 2017
Deep Saliency With Encoded Low Level Distance Map and High Level Features
CVPR 2016
Efficient and Robust Color Consistency for Community Photo Collections
CVPR 2016
Accurate Depth Map Estimation From a Lenslet Light Field Camera
CVPR 2015
RGB-Guided Hyperspectral Image Upsampling
ICCV 2015
Fast Randomized Singular Value Thresholding for Nuclear Norm Minimization
CVPR 2015
Data-Driven Depth Map Refinement via Multi-Scale Sparse Representation
CVPR 2015
Exploiting Shading Cues in Kinect IR Images for Geometry Refinement
CVPR 2014
Calibrating a Non-isotropic Near Point Light Source using a Plane
CVPR 2014
Salient Region Detection via High-Dimensional Color Transform
CVPR 2014
Shading-Based Shape Refinement of RGB-D Images
CVPR 2013
Modeling the Calibration Pipeline of the Lytro Camera for High Quality Light-Field Image Reconstruction
ICCV 2013
Partial Sum Minimization of Singular Values in RPCA for Low-Level Vision
ICCV 2013
A Learning-Based Approach to Reduce JPEG Artifacts in Image Matting
ICCV 2013
Multiview Photometric Stereo Using Planar Mesh Parameterization
ICCV 2013