Zhe Lin
157 papers · 2013–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
🌈 Renaissance Researcher (9) 🌍 Conference Polyglot (12) 🧭 Keyword Pioneer 🏃 Academic Marathon (12) 🌉 Interdisciplinary Bridge
🐝
Cross-Pollinator
(4)
🌍
Conference Polyglot
(12)
🏃
Academic Marathon
(12)
🌟
Keyword Trendsetter Combo
(7)
🏠
Conference Loyalist
(75)
🤝
Dynamic Duo
(41)
🔬
Deep Specialist
(26)
🏆
Keyword Champion
(15)
🌱
Topic Pioneer
💎
Century Club
(156)
⚡
Prolific Year
(15)
🗃️
Keyword Collector
(561)
🚀
Conference Pioneer
🔥
Unstoppable
(13)
📈
Trend Setter
Conferences
CVPR (75)
ICCV (26)
ECCV (24)
WACV (7)
ICLR (6)
AAAI (3)
ACL (3)
EMNLP (3)
IJCAI (3)
NIPS (3)
COLING (2)
IJCNLP (2)
Top co-authors
Keywords
semantic segmentation
(16)
diffusion model
(16)
image editing
(15)
convolutional neural network
(15)
image generation
(13)
image inpainting
(10)
object detection
(9)
image segmentation
(8)
generative model
(8)
image restoration
(7)
attention mechanism
(7)
knowledge distillation
(7)
image captioning
(6)
contrastive learning
(6)
self-supervised learning
(6)
instance segmentation
(6)
neural network
(6)
salient object detection
(5)
feature extraction
(5)
video generation
(5)
Papers
RealUHR: Harnessing Patch-Cascade Flows for Photorealistic Ultra-High-Resolution Synthesis
AAAI 2026
Baking Gaussian Splatting into Diffusion Denoiser for Fast and Scalable Single-stage Image-to-3D Generation and Reconstruction
ICCV 2025
FINECAPTION: Compositional Image Captioning Focusing on Wherever You Want at Any Granularity
CVPR 2025
Layer- and Timestep-Adaptive Differentiable Token Compression Ratios for Efficient Diffusion Transformers
CVPR 2025
Generative Video Propagation
CVPR 2025
ObjectMover: Generative Object Movement with Video Prior
CVPR 2025
TransPixeler: Advancing Text-to-Video Generation with Transparency
CVPR 2025
Generative Image Layer Decomposition with Visual Effects
CVPR 2025
Refine-by-Align: Reference-Guided Artifacts Refinement through Semantic Alignment
ICLR 2025
Multitwine: Multi-Object Compositing with Text and Layout Control
CVPR 2025
ImageFolder: Autoregressive Image Generation with Folded Tokens
ICLR 2025
MetaShadow: Object-Centered Shadow Detection, Removal, and Synthesis
CVPR 2025
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics
CVPR 2025
DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization
ICCV 2025
TurboFill: Adapting Few-step Text-to-image Model for Fast Image Inpainting
CVPR 2025
IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation
CVPR 2024
Content-Aware Image Color Editing With Auxiliary Color Restoration Tasks
WACV 2024
Amodal Scene Analysis via Holistic Occlusion Relation Inference and Generative Mask Completion
AAAI 2024
Latent Feature-Guided Diffusion Models for Shadow Removal
WACV 2024
SCoRD: Subject-Conditional Relation Detection With Text-Augmented Data
WACV 2024
Image Inpainting via Iteratively Decoupled Probabilistic Modeling
ICLR 2024
Advancing Vision-Language Models with Adapter Ensemble Strategies
EMNLP 2024
Thinking Outside the BBox: Unconstrained Generative Object Compositing
ECCV 2024
Mixture of Efficient Diffusion Experts Through Automatic Interval and Sub-Network Selection
ECCV 2024
Removing Distributional Discrepancies in Captions Improves Image-Text Alignment
ECCV 2024
SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis
ECCV 2024
Brush2Prompt: Contextual Prompt Generator for Object Inpainting
CVPR 2024
SmartMask: Context Aware High-Fidelity Mask Generation for Fine-grained Object Insertion and Layout Control
CVPR 2024
UniHuman: A Unified Model For Editing Human Images in the Wild
CVPR 2024
Video-P2P: Video Editing with Cross-attention Control
CVPR 2024
InstantBooth: Personalized Text-to-Image Generation without Test-Time Finetuning
CVPR 2024
Attention-Driven Training-Free Efficiency Enhancement of Diffusion Models
CVPR 2024
PRN: Panoptic Refinement Network
WACV 2023
AIMS: All-Inclusive Multi-Level Segmentation for Anything
NIPS 2023
Automatic High Resolution Wire Segmentation and Removal
CVPR 2023
Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models
CVPR 2023
TopNet: Transformer-Based Object Placement Network for Image Compositing
CVPR 2023
ObjectStitch: Object Compositing With Diffusion Model
CVPR 2023
SmartBrush: Text and Shape Guided Object Inpainting With Diffusion Model
CVPR 2023
SceneComposer: Any-Level Semantic Image Synthesis
CVPR 2023
SimpSON: Simplifying Photo Cleanup With Single-Click Distracting Object Segmentation Network
CVPR 2023
Harnessing the Spatial-Temporal Attention of Diffusion Models for High-Fidelity Text-to-Image Synthesis
ICCV 2023
Perceptual Artifacts Localization for Image Synthesis Tasks
ICCV 2023
High Quality Entity Segmentation
ICCV 2023
Human MotionFormer: Transferring Human Motions with Vision Transformers
ICLR 2023
Interactive Portrait Harmonization
ICLR 2023
XFormer: Fast and Accurate Monocular 3D Body Capture
IJCAI 2023
Image Inpainting with Cascaded Modulation GAN and Object-Aware Training
ECCV 2022
Visual Information Guided Zero-Shot Paraphrase Generation
COLING 2022
MAT: Mask-Aware Transformer for Large Hole Image Inpainting
CVPR 2022
SketchEdit: Mask-Free Local Image Manipulation With Partial Sketches
CVPR 2022
Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling
CVPR 2022
EI-CLIP: Entity-Aware Interventional Contrastive Learning for E-Commerce Cross-Modal Retrieval
CVPR 2022
Layered Depth Refinement With Mask Guidance
CVPR 2022
High Quality Segmentation for Ultra High-Resolution Images
CVPR 2022
Lite Vision Transformer With Enhanced Self-Attention
CVPR 2022
StyleBabel: Artistic Style Tagging and Captioning
ECCV 2022
3D-FM GAN: Towards 3D-Controllable Face Manipulation
ECCV 2022
CoGS: Controllable Generation and Search from Sketch and Style
ECCV 2022
Inpainting at Modern Camera Resolution by Guided PatchMatch with Auto-Curation
ECCV 2022
Controllable Shadow Generation Using Pixel Height Maps
ECCV 2022
Improving Closed and Open-Vocabulary Attribute Prediction Using Transformers
ECCV 2022
GALA: Toward Geometry-and-Lighting-Aware Object Search for Compositing
ECCV 2022
Perceptual Artifacts Localization for Inpainting
ECCV 2022
CA-SSL: Class-Agnostic Semi-Supervised Learning for Detection and Segmentation
ECCV 2022
Towards Document-Level Paraphrase Generation with Sentence Rewriting and Reordering
EMNLP 2021
Mask Guided Matting via Progressive Refinement Network
CVPR 2021
Deep Image Compositing
WACV 2021
Automatic Object Recoloring Using Adversarial Learning
WACV 2021
Multimodal Contrastive Training for Visual Representation Learning
CVPR 2021
Multi-Scale Aligned Distillation for Low-Resolution Detection
CVPR 2021
Learning To Predict Visual Attributes in the Wild
CVPR 2021
Neural Sentence Simplification with Semantic Dependency Information
AAAI 2021
Making Better Use of Bilingual Information for Cross-Lingual AMR Parsing
ACL 2021
Pushing Paraphrase Away from Original Sentence: A Multi-Round Paraphrase Generation Approach
ACL 2021
Face Image Retrieval With Attribute Manipulation
ICCV 2021
Language-Guided Global Image Editing via Cross-Modal Cyclic Mechanism
ICCV 2021
Pushing Paraphrase Away from Original Sentence: A Multi-Round Paraphrase Generation Approach
IJCNLP 2021
ALADIN: All Layer Adaptive Instance Normalization for Fine-Grained Style Similarity
ICCV 2021
Content-Aware GAN Compression
CVPR 2021
SSH: A Self-Supervised Framework for Image Harmonization
ICCV 2021
CR-Fill: Generative Image Inpainting With Auxiliary Contextual Reconstruction
ICCV 2021
Making Better Use of Bilingual Information for Cross-Lingual AMR Parsing
IJCNLP 2021
High-Resolution Image Inpainting with Iterative Confidence Feedback and Guided Upsampling
ECCV 2020
Context-Aware Group Captioning via Self-Attention and Contrastive Features
CVPR 2020
Temporally Distributed Networks for Fast Video Semantic Segmentation
CVPR 2020
PhraseCut: Language-Based Image Segmentation in the Wild
CVPR 2020
Learning Visual Emotion Representations From Web Data
CVPR 2020
On the Helpfulness of Document Context to Sentence Simplification
COLING 2020
On the Decidability of Intuitionistic Tense Logic without Disjunction
IJCAI 2020
Structure-Guided Ranking Loss for Single Image Depth Prediction
CVPR 2020
SDC-Depth: Semantic Divide-and-Conquer Network for Monocular Depth Estimation
CVPR 2020
Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions
ECCV 2020
Shape Adaptor: A Learnable Resizing Module
ECCV 2020
Unsupervised Video Object Segmentation with Joint Hotspot Tracking
ECCV 2020
Unselfie: Translating Selfies to Neutral-pose Portraits in the Wild
ECCV 2020
Incorporating Reinforced Adversarial Learning in Autoregressive Image Generation
ECCV 2020
Best Frame Selection in a Short Video
WACV 2020
Scene Graph Modification Based on Natural Language Commands
EMNLP 2020
Scene Graph Generation With External Knowledge and Image Reconstruction
CVPR 2019
Foreground-Aware Image Inpainting
CVPR 2019
CapSal: Leveraging Captioning to Boost Semantics for Salient Object Detection
CVPR 2019
Expressing Visual Relationships via Language
ACL 2019
Image Super-Resolution by Neural Texture Transfer
CVPR 2019
Semantic Component Decomposition for Face Attribute Manipulation
CVPR 2019
Free-Form Image Inpainting With Gated Convolution
ICCV 2019
Fast Video Object Segmentation via Dynamic Targeting Network
ICCV 2019
Multimodal Style Transfer via Graph Cuts
ICCV 2019
Scaling Object Detection by Transferring Classification Weights
ICCV 2019
Towards High-Resolution Salient Object Detection
ICCV 2019
Neural Rejuvenation: Improving Deep Network Training by Enhancing Computational Resource Utilization
CVPR 2019
Contextual-based Image Inpainting: Infer, Match, and Translate
ECCV 2018
Stochastic Downsampling for Cost-Adjustable Inference and Improved Regularization in Convolutional Networks
CVPR 2018
Learning to Understand Image Blur
CVPR 2018
Generative Image Inpainting With Contextual Attention
CVPR 2018
Good View Hunting: Learning Photo Composition From Dense View Pairs
CVPR 2018
MAttNet: Modular Attention Network for Referring Expression Comprehension
CVPR 2018
Learning to Blend Photos
ECCV 2018
Compositing-aware Image Search
ECCV 2018
Concept Mask: Large-Scale Segmentation from Semantic Concepts
ECCV 2018
Sequence-to-Segment Networks for Segment Detection
NIPS 2018
Rethinking the Smaller-Norm-Less-Informative Assumption in Channel Pruning of Convolution Layers
ICLR 2018
Video Scene Parsing With Predictive Feature Learning
ICCV 2017
Recurrent Multimodal Interaction for Referring Image Segmentation
ICCV 2017
FoveaNet: Perspective-Aware Urban Scene Parsing
ICCV 2017
Personalized Image Aesthetics
ICCV 2017
Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition
CVPR 2017
Predicting Scene Parsing and Motion Dynamics in the Future
NIPS 2017
Deep Image Harmonization
CVPR 2017
Spatial-Semantic Image Search by Visual Feature Synthesis
CVPR 2017
High-Resolution Image Inpainting Using Multi-Scale Neural Patch Synthesis
CVPR 2017
Scene Parsing With Global Context Embedding
ICCV 2017
Event-Specific Image Importance
CVPR 2016
Shortlist Selection With Residual-Aware Distance Estimator for K-Nearest Neighbor Search
CVPR 2016
A Multi-Level Contextual Model For Person Recognition in Photo Albums
CVPR 2016
Automatic Content-Aware Color and Tone Stylization
CVPR 2016
Nonlinear Hierarchical Part-Based Regression for Unconstrained Face Alignment
IJCAI 2016
Unconstrained Salient Object Detection via Proposal Subset Optimization
CVPR 2016
Towards Unified Depth and Semantic Prediction From a Single Image
CVPR 2015
PatchCut: Data-Driven Object Segmentation via Local Shape Transfer
CVPR 2015
Collaborative Feature Learning From Social Media
CVPR 2015
Salient Object Subitizing
CVPR 2015
A Convolutional Neural Network Cascade for Face Detection
CVPR 2015
Joint Object and Part Segmentation Using Deep Learned Potentials
ICCV 2015
Minimum Barrier Salient Object Detection at 80 FPS
ICCV 2015
Deep Multi-Patch Aggregation Network for Image Style, Aesthetics, and Quality Estimation
ICCV 2015
Distance Encoded Product Quantization
CVPR 2014
Efficient Boosted Exemplar-based Face Detection
CVPR 2014
Nonparametric Context Modeling of Local Appearance for Pose- and Expression-Robust Facial Landmark Localization
CVPR 2014
Fast Image Super-Resolution Based on In-Place Example Regression
CVPR 2013
Text Localization in Natural Images Using Stroke Feature Transform and Text Covariance Descriptors
ICCV 2013
Exemplar-Based Graph Matching for Robust Facial Landmark Localization
ICCV 2013
Probabilistic Elastic Matching for Pose Variant Face Verification
CVPR 2013
Large Displacement Optical Flow from Nearest Neighbor Fields
CVPR 2013
Probabilistic Elastic Part Model for Unsupervised Face Detector Adaptation
ICCV 2013
Detecting and Aligning Faces by Image Retrieval
CVPR 2013
Exemplar-Based Face Parsing
CVPR 2013