Yibing Song

57 papers · 2017–2026 · 8 conferences · across top CS/AI conferences

Achievements

+14 more ↓

🐝 Cross-Pollinator (9) 🌍 Conference Polyglot (7) 🧭 Keyword Pioneer 🏃 Academic Marathon (8) 🌈 Renaissance Researcher (7)

🌈 Renaissance Researcher (7) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (76) 🏠 Conference Loyalist (20) 🔬 Deep Specialist (10) 👑 Triple Crown 🧬 Topic Evolution 🏆 Grand Slam 🔥 Unstoppable (9) 📈 Trend Setter ⚡ Prolific Year (5) 🚀 Conference Pioneer 💎 Century Club (56) 🗃️ Keyword Collector (213)

Conferences

CVPR (20) ICLR (11) NIPS (9) ICCV (8) ECCV (4) ICML (2) IJCAI (2) AAAI (1)

Top co-authors

Wei Liu (8) Hongyu Liu (7) Jue Wang (7) Chongjian GE (7) Chao Ma (7) Ping Luo (6) Yong Zhang (6) Lingqiao Liu (5) Li Yuan (5) Liang Chen (5)

Keywords

self-supervised learning (6) representation learning (5) convolutional neural network (5) domain generalization (5) image generation (5) vision transformer (4) vision-language model (4) object tracking (3) multimodal learning (3) image restoration (3) visual tracking (3) zero-shot learning (2) contrastive learning (2) deepfake detection (2) image reconstruction (2) image editing (2) unsupervised learning (2) knowledge distillation (2) 3d reconstruction (2) image classification (2)

Papers

AsFT: Anchoring Safety During LLM Fine-Tuning Within Narrow Safety Basin AAAI 2026 Dynamic Diffusion Transformer ICLR 2025 REMEDY: Recipe Merging Dynamics in Large Vision-Language Models ICLR 2025 AutoCGP: Closed-Loop Concept-Guided Policies from Unlabeled Demonstrations ICLR 2025 Re-Aligning Language to Visual Objects with an Agentic Workflow ICLR 2025 AvatarArtist: Open-Domain 4D Avatarization CVPR 2025 Advancing Textual Prompt Learning with Anchored Attributes ICCV 2025 LLaVA-CoT: Let Vision Language Models Reason Step-by-Step ICCV 2025 UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation CVPR 2025 A Stitch in Time Saves Nine: Small VLM is a Precise Guidance for Accelerating Large VLMs CVPR 2025 Foley-Flow: Coordinated Video-to-Audio Generation with Masked Audio-Visual Alignment and Dynamic Conditional Flows CVPR 2025 PiCO: Peer Review in LLMs based on Consistency Optimization ICLR 2025 Aligning Audio-Visual Joint Representations with an Agentic Workflow NIPS 2024 LFME: A Simple Framework for Learning from Multiple Experts in Domain Generalization NIPS 2024 Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation NIPS 2024 Image Inpainting via Iteratively Decoupled Probabilistic Modeling ICLR 2024 InstructDET: Diversifying Referring Object Detection with Generalized Instructions ICLR 2024 Both Diverse and Realism Matter: Physical Attribute and Style Alignment for Rainy Image Generation ICCV 2023 Improved Test-Time Adaptation for Domain Generalization CVPR 2023 Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space Viewpoint CVPR 2023 Advancing Visual Grounding With Scene Knowledge: Benchmark and Method CVPR 2023 Domain Generalization via Rationale Invariance ICCV 2023 Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation ICCV 2023 Efficient Video Action Detection with Token Dropout and Context Refinement ICCV 2023 DiffusionDet: Diffusion Model for Object Detection ICCV 2023 Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning ICLR 2023 Human MotionFormer: Transferring Human Motions with Vision Transformers ICLR 2023 Evolving Semantic Prototype Improves Generative Zero-Shot Learning ICML 2023 VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training NIPS 2022 OST: Improving Generalization of DeepFake Detection via One-Shot Test-Time Training NIPS 2022 One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations NIPS 2022 Self-Supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection CVPR 2022 EViT: Expediting Vision Transformers via Token Reorganizations ICLR 2022 DynaMixer: A Vision MLP Architecture with Dynamic Mixing ICML 2022 AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition NIPS 2022 Disentangled Cycle Consistency for Highly-Realistic Virtual Try-On CVPR 2021 Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation Learning NIPS 2021 Stabilized Medical Image Attacks ICLR 2021 Parser-Free Virtual Try-On via Distilling Appearance Flows CVPR 2021 DeFLOCNet: Deep Image Editing via Flexible Low-Level Controls CVPR 2021 IoU Attack: Towards Temporally Coherent Black-Box Adversarial Attack for Visual Object Tracking CVPR 2021 ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows CVPR 2021 PD-GAN: Probabilistic Diverse GAN for Image Inpainting CVPR 2021 VideoMoCo: Contrastive Video Representation Learning With Temporally Adversarial Examples CVPR 2021 Rethinking Image Inpainting via a Mutual Encoder-Decoder with Feature Equalizations ECCV 2020 Robust Tracking against Adversarial Attacks ECCV 2020 Rethinking Image Deraining via Rain Streaks and Vapors ECCV 2020 MVF-Net: Multi-View 3D Face Morphable Model Regression CVPR 2019 Unsupervised Deep Tracking CVPR 2019 Deep Attentive Tracking via Reciprocative Learning NIPS 2018 VITAL: VIsual Tracking via Adversarial Learning CVPR 2018 Look Deeper into Depth: Monocular Depth Estimation with Semantic Booster and Attention-Driven Loss ECCV 2018 Dynamic Scene Deblurring Using Spatially Variant Recurrent Neural Networks CVPR 2018 Image Correction via Deep Reciprocating HDR Transformation CVPR 2018 Fast Preprocessing for Robust Face Sketch Synthesis IJCAI 2017 CREST: Convolutional Residual Learning for Visual Tracking ICCV 2017 Learning to Hallucinate Face Images via Component Generation and Enhancement IJCAI 2017