Lu Yuan

85 papers · 2014–2025 · 9 conferences · across top CS/AI conferences

Achievements

+20 more ↓

🌉 Interdisciplinary Bridge 🏃 Academic Marathon (11) 🌍 Conference Polyglot (9) 🌈 Renaissance Researcher (6) 🗺️ Taxonomy Completionist (105)

🐝 Cross-Pollinator (15) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🌟 Keyword Trendsetter Combo (4) 🏠 Conference Loyalist (41) 📛 The Namer 👥 Mega-Team (20) 👑 Triple Crown 🏆 Grand Slam 🤝 Dynamic Duo (43) 🔬 Deep Specialist (17) 🧬 Topic Evolution 🏆 Keyword Champion (2) 🔥 Unstoppable (12) ❓ The Questioner 💎 Century Club (85) 🗃️ Keyword Collector (321) 🚀 Conference Pioneer 📈 Trend Setter ⚡ Prolific Year (13)

Conferences

CVPR (41) NIPS (11) ECCV (10) ICCV (10) AAAI (4) ICLR (4) EMNLP (2) ICML (2) NAACL (1)

Top co-authors

Dongdong Chen (43) Xiyang Dai (29) Yinpeng Chen (22) Mengchen Liu (21) Bin Xiao (16) Nenghai Yu (15) Dong Chen (14) Jianwei Yang (12) Jianmin Bao (12) Zicheng Liu (11)

Keywords

object detection (13) zero-shot learning (10) vision transformer (9) contrastive learning (9) transfer learning (7) semantic segmentation (7) image classification (6) multimodal learning (6) image generation (6) convolutional neural network (6) visual question answering (5) self-supervised learning (5) attention mechanism (4) diffusion model (4) data augmentation (3) representation learning (3) domain adaptation (3) self-attention mechanism (3) few-shot learning (3) model compression (3)

Papers

Exploring Invariance in Images through One-way Wave Equations ICML 2025 i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data NAACL 2024 Learning Subject-Aware Cropping by Outpainting Professional Photos AAAI 2024 i-Code Studio: A Configurable and Composable Framework for Integrative AI EMNLP 2024 Efficient Modulation for Vision Networks ICLR 2024 Fully Authentic Visual Question Answering Dataset from Online Communities ECCV 2024 Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks CVPR 2024 OmniViD: A Generative Framework for Universal Video Understanding CVPR 2024 Learning from Rich Semantics and Coarse Locations for Long-tailed Object Detection NIPS 2023 PeCo: Perceptual Codebook for BERT Pre-training of Vision Transformers AAAI 2023 Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding CVPR 2023 i-Code: An Integrative and Composable Multimodal Learning Framework AAAI 2023 Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations ICLR 2023 Improving Adversarial Robustness of Masked Autoencoders via Test-time Frequency-domain Prompting ICCV 2023 TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance ICCV 2023 LACMA: Language-Aligning Contrastive Learning with Meta-Actions for Embodied Instruction Following EMNLP 2023 Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis AAAI 2023 Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles CVPR 2023 MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining CVPR 2023 Generalized Decoding for Pixel, Image, and Language CVPR 2023 Masked Video Distillation: Rethinking Masked Feature Modeling for Self-Supervised Video Representation Learning CVPR 2023 Look Before You Match: Instance Understanding Matters in Video Object Segmentation CVPR 2023 X-Paste: Revisiting Scalable Copy-Paste for Instance Segmentation using CLIP and StableDiffusion ICML 2023 Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models NIPS 2023 Bootstrapped Masked Autoencoders for Vision BERT Pretraining ECCV 2022 OmniVL: One Foundation Model for Image-Language and Video-Language Tasks NIPS 2022 REVIVE: Regional Visual Representation Matters in Knowledge-Based Visual Question Answering NIPS 2022 K-LITE: Learning Transferable Visual Models with External Knowledge NIPS 2022 Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning NIPS 2022 GLIPv2: Unifying Localization and Vision-Language Understanding NIPS 2022 Mobile-Former: Bridging MobileNet and Transformer CVPR 2022 Grounded Language-Image Pre-Training CVPR 2022 RegionCLIP: Region-Based Language-Image Pretraining CVPR 2022 CSWin Transformer: A General Vision Transformer Backbone With Cross-Shaped Windows CVPR 2022 Reduce Information Loss in Transformers for Pluralistic Image Inpainting CVPR 2022 Large-Scale Pre-Training for Person Re-Identification With Noisy Labels CVPR 2022 BEVT: BERT Pretraining of Video Transformers CVPR 2022 Unified Contrastive Learning in Image-Text-Label Space CVPR 2022 HairCLIP: Design Your Hair by Text and Reference Image CVPR 2022 An Empirical Study of Training End-to-End Vision-and-Language Transformers CVPR 2022 MiniViT: Compressing Vision Transformers With Weight Multiplexing CVPR 2022 General Facial Representation Learning in a Visual-Linguistic Manner CVPR 2022 Vector Quantized Diffusion Model for Text-to-Image Synthesis CVPR 2022 DNA: Improving Few-Shot Transfer Learning with Low-Rank Decomposition and Alignment ECCV 2022 TinyViT: Fast Pretraining Distillation for Small Vision Transformers ECCV 2022 DaViT: Dual Attention Vision Transformers ECCV 2022 Should All Proposals Be Treated Equally in Object Detection? ECCV 2022 Learning Visual Representation from Modality-Shared Contrastive Language-Image Pre-training ECCV 2022 Efficient Self-supervised Vision Transformers for Representation Learning ICLR 2022 Focal Attention for Long-Range Interactions in Vision Transformers NIPS 2021 Stronger NAS with Weaker Predictors NIPS 2021 MicroNet: Improving Image Recognition With Extremely Low FLOPs ICCV 2021 Dynamic Transfer for Multi-Source Domain Adaptation CVPR 2021 CvT: Introducing Convolutions to Vision Transformers ICCV 2021 Unsupervised Pre-Training for Person Re-Identification CVPR 2021 Dynamic Head: Unifying Object Detection Heads With Attentions CVPR 2021 Dynamic DETR: End-to-End Object Detection With Dynamic Attention ICCV 2021 Revisiting Dynamic Convolution via Matrix Decomposition ICLR 2021 Improve Unsupervised Pretraining for Few-Label Transfer ICCV 2021 Chasing Sparsity in Vision Transformers: An End-to-End Exploration NIPS 2021 Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding ICCV 2021 Lite-HRNet: A Lightweight High-Resolution Network CVPR 2021 Dynamic Convolution: Attention Over Convolution Kernels CVPR 2020 LSM: Learning Subspace Minimization for Low-Level Vision CVPR 2020 Cross-Domain Correspondence Learning for Exemplar-Based Image Translation CVPR 2020 Dynamic ReLU ECCV 2020 DA-NAS: Data Adapted Pruning for Efficient Neural Architecture Search ECCV 2020 GreedyFool: Distortion-Aware Sparse Adversarial Attack NIPS 2020 Density-Aware Graph for Deep Semi-Supervised Visual Recognition CVPR 2020 Rethinking Classification and Localization for Object Detection CVPR 2020 Bidirectional Learning for Domain Adaptation of Semantic Segmentation CVPR 2019 Face Parsing With RoI Tanh-Warping CVPR 2019 Mask-Guided Portrait Editing With Conditional GANs CVPR 2019 Deep Exemplar-Based Video Colorization CVPR 2019 Arbitrary Style Transfer With Deep Feature Reshuffle CVPR 2018 Towards High Performance Video Object Detection CVPR 2018 Stereoscopic Neural Style Transfer CVPR 2018 Decouple Learning for Parameterized Image Operators ECCV 2018 Deep Feature Flow for Video Recognition CVPR 2017 Coherent Online Video Style Transfer ICCV 2017 Flow-Guided Feature Aggregation for Video Object Detection ICCV 2017 StyleBank: An Explicit Representation for Neural Image Style Transfer CVPR 2017 Image Deblurring Using Smartphone Inertial Sensors CVPR 2016 Dual-Feature Warping-Based Motion Model Estimation ICCV 2015 SteadyFlow: Spatially Smooth Optical Flow for Video Stabilization CVPR 2014