Lu Yuan
85 papers · 2014–2025 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+20 more ↓ Show less ↑
π Interdisciplinary Bridge π Academic Marathon (11) π Conference Polyglot (9) π Renaissance Researcher (6) πΊοΈ Taxonomy Completionist (105)
π
Cross-Pollinator
(15)
π
Interdisciplinary Bridge
π£
Hot Topic Early Bird
π
Keyword Trendsetter Combo
(4)
π
Conference Loyalist
(41)
π
The Namer
π₯
Mega-Team
(20)
π
Triple Crown
π
Grand Slam
π€
Dynamic Duo
(43)
π¬
Deep Specialist
(17)
π§¬
Topic Evolution
π
Keyword Champion
(2)
π₯
Unstoppable
(12)
β
The Questioner
π
Century Club
(85)
ποΈ
Keyword Collector
(321)
π
Conference Pioneer
π
Trend Setter
β‘
Prolific Year
(13)
Conferences
CVPR (41)
NIPS (11)
ECCV (10)
ICCV (10)
AAAI (4)
ICLR (4)
EMNLP (2)
ICML (2)
NAACL (1)
Top co-authors
Keywords
object detection
(13)
zero-shot learning
(10)
vision transformer
(9)
contrastive learning
(9)
transfer learning
(7)
semantic segmentation
(7)
image classification
(6)
multimodal learning
(6)
image generation
(6)
convolutional neural network
(6)
visual question answering
(5)
self-supervised learning
(5)
attention mechanism
(4)
diffusion model
(4)
data augmentation
(3)
representation learning
(3)
domain adaptation
(3)
self-attention mechanism
(3)
few-shot learning
(3)
model compression
(3)
Papers
Exploring Invariance in Images through One-way Wave Equations
ICML 2025
i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data
NAACL 2024
Learning Subject-Aware Cropping by Outpainting Professional Photos
AAAI 2024
i-Code Studio: A Configurable and Composable Framework for Integrative AI
EMNLP 2024
Efficient Modulation for Vision Networks
ICLR 2024
Fully Authentic Visual Question Answering Dataset from Online Communities
ECCV 2024
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
CVPR 2024
OmniViD: A Generative Framework for Universal Video Understanding
CVPR 2024
Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection
NIPS 2023
PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers
AAAI 2023
Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding
CVPR 2023
i-Code: An Integrative and Composable Multimodal Learning Framework
AAAI 2023
Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations
ICLR 2023
Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain Prompting
ICCV 2023
TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance
ICCV 2023
LACMA: Language-Aligning Contrastive Learning with Meta-Actions for Embodied Instruction Following
EMNLP 2023
Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis
AAAI 2023
Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles
CVPR 2023
MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining
CVPR 2023
Generalized Decoding for Pixel, Image, and Language
CVPR 2023
Masked Video Distillation: Rethinking Masked Feature Modeling for Self-Supervised Video Representation Learning
CVPR 2023
Look Before You Match: Instance Understanding Matters in Video Object Segmentation
CVPR 2023
X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusion
ICML 2023
Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models
NIPS 2023
Bootstrapped Masked Autoencoders for Vision BERT Pretraining
ECCV 2022
OmniVL: One Foundation Model for Image-Language and Video-Language Tasks
NIPS 2022
REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering
NIPS 2022
K-LITE: Learning Transferable Visual Models with External Knowledge
NIPS 2022
Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning
NIPS 2022
GLIPv2: Unifying Localization and Vision-Language Understanding
NIPS 2022
Mobile-Former: Bridging MobileNet and Transformer
CVPR 2022
Grounded Language-Image Pre-Training
CVPR 2022
RegionCLIP: Region-Based Language-Image Pretraining
CVPR 2022
CSWin Transformer: A General Vision Transformer Backbone With Cross-Shaped Windows
CVPR 2022
Reduce Information Loss in Transformers for Pluralistic Image Inpainting
CVPR 2022
Large-Scale Pre-Training for Person Re-Identification With Noisy Labels
CVPR 2022
BEVT: BERT Pretraining of Video Transformers
CVPR 2022
Unified Contrastive Learning in Image-Text-Label Space
CVPR 2022
HairCLIP: Design Your Hair by Text and Reference Image
CVPR 2022
An Empirical Study of Training End-to-End Vision-and-Language Transformers
CVPR 2022
MiniViT: Compressing Vision Transformers With Weight Multiplexing
CVPR 2022
General Facial Representation Learning in a Visual-Linguistic Manner
CVPR 2022
Vector Quantized Diffusion Model for Text-to-Image Synthesis
CVPR 2022
DNA: Improving Few-Shot Transfer Learning with Low-Rank Decomposition and Alignment
ECCV 2022
TinyViT: Fast Pretraining Distillation for Small Vision Transformers
ECCV 2022
DaViT: Dual Attention Vision Transformers
ECCV 2022
Should All Proposals Be Treated Equally in Object Detection?
ECCV 2022
Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training
ECCV 2022
Efficient Self-supervised Vision Transformers for Representation Learning
ICLR 2022
Focal Attention for Long-Range Interactions in Vision Transformers
NIPS 2021
Stronger NAS with Weaker Predictors
NIPS 2021
MicroNet: Improving Image Recognition With Extremely Low FLOPs
ICCV 2021
Dynamic Transfer for Multi-Source Domain Adaptation
CVPR 2021
CvT: Introducing Convolutions to Vision Transformers
ICCV 2021
Unsupervised Pre-Training for Person Re-Identification
CVPR 2021
Dynamic Head: Unifying Object Detection Heads With Attentions
CVPR 2021
Dynamic DETR: End-to-End Object Detection With Dynamic Attention
ICCV 2021
Revisiting Dynamic Convolution via Matrix Decomposition
ICLR 2021
Improve Unsupervised Pretraining for Few-Label Transfer
ICCV 2021
Chasing Sparsity in Vision Transformers: An End-to-End Exploration
NIPS 2021
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding
ICCV 2021
Lite-HRNet: A Lightweight High-Resolution Network
CVPR 2021
Dynamic Convolution: Attention Over Convolution Kernels
CVPR 2020
LSM: Learning Subspace Minimization for Low-Level Vision
CVPR 2020
Cross-Domain Correspondence Learning for Exemplar-Based Image Translation
CVPR 2020
Dynamic ReLU
ECCV 2020
DA-NAS: Data Adapted Pruning for Efficient Neural Architecture Search
ECCV 2020
GreedyFool: Distortion-Aware Sparse Adversarial Attack
NIPS 2020
Density-Aware Graph for Deep Semi-Supervised Visual Recognition
CVPR 2020
Rethinking Classification and Localization for Object Detection
CVPR 2020
Bidirectional Learning for Domain Adaptation of Semantic Segmentation
CVPR 2019
Face Parsing With RoI Tanh-Warping
CVPR 2019
Mask-Guided Portrait Editing With Conditional GANs
CVPR 2019
Deep Exemplar-Based Video Colorization
CVPR 2019
Arbitrary Style Transfer With Deep Feature Reshuffle
CVPR 2018
Towards High Performance Video Object Detection
CVPR 2018
Stereoscopic Neural Style Transfer
CVPR 2018
Decouple Learning for Parameterized Image Operators
ECCV 2018
Deep Feature Flow for Video Recognition
CVPR 2017
Coherent Online Video Style Transfer
ICCV 2017
Flow-Guided Feature Aggregation for Video Object Detection
ICCV 2017
StyleBank: An Explicit Representation for Neural Image Style Transfer
CVPR 2017
Image Deblurring Using Smartphone Inertial Sensors
CVPR 2016
Dual-Feature Warping-Based Motion Model Estimation
ICCV 2015
SteadyFlow: Spatially Smooth Optical Flow for Video Stabilization
CVPR 2014