Yibing Song
57 papers · 2017–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π Cross-Pollinator (9) π Conference Polyglot (7) π§ Keyword Pioneer π Academic Marathon (8) π Renaissance Researcher (7)
π
Renaissance Researcher
(7)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(76)
π
Conference Loyalist
(20)
π¬
Deep Specialist
(10)
π
Triple Crown
π§¬
Topic Evolution
π
Grand Slam
π₯
Unstoppable
(9)
π
Trend Setter
β‘
Prolific Year
(5)
π
Conference Pioneer
π
Century Club
(56)
ποΈ
Keyword Collector
(213)
Conferences
CVPR (20)
ICLR (11)
NIPS (9)
ICCV (8)
ECCV (4)
ICML (2)
IJCAI (2)
AAAI (1)
Top co-authors
Keywords
self-supervised learning
(6)
representation learning
(5)
convolutional neural network
(5)
domain generalization
(5)
image generation
(5)
vision transformer
(4)
vision-language model
(4)
object tracking
(3)
multimodal learning
(3)
image restoration
(3)
visual tracking
(3)
zero-shot learning
(2)
contrastive learning
(2)
deepfake detection
(2)
image reconstruction
(2)
image editing
(2)
unsupervised learning
(2)
knowledge distillation
(2)
3d reconstruction
(2)
image classification
(2)
Papers
AsFT: Anchoring Safety During LLM Fine-Tuning Within Narrow Safety Basin
AAAI 2026
Dynamic Diffusion Transformer
ICLR 2025
REMEDY: Recipe Merging Dynamics in Large Vision-Language Models
ICLR 2025
AutoCGP: Closed-Loop Concept-Guided Policies from Unlabeled Demonstrations
ICLR 2025
Re-Aligning Language to Visual Objects with an Agentic Workflow
ICLR 2025
AvatarArtist: Open-Domain 4D Avatarization
CVPR 2025
Advancing Textual Prompt Learning with Anchored Attributes
ICCV 2025
LLaVA-CoT: Let Vision Language Models Reason Step-by-Step
ICCV 2025
UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation
CVPR 2025
A Stitch in Time Saves Nine: Small VLM is a Precise Guidance for Accelerating Large VLMs
CVPR 2025
Foley-Flow: Coordinated Video-to-Audio Generation with Masked Audio-Visual Alignment and Dynamic Conditional Flows
CVPR 2025
PiCO: Peer Review in LLMs based on Consistency Optimization
ICLR 2025
Aligning Audio-Visual Joint Representations with an Agentic Workflow
NIPS 2024
LFME: A Simple Framework for Learning from Multiple Experts in Domain Generalization
NIPS 2024
Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation
NIPS 2024
Image Inpainting via Iteratively Decoupled Probabilistic Modeling
ICLR 2024
InstructDET: Diversifying Referring Object Detection with Generalized Instructions
ICLR 2024
Both Diverse and Realism Matter: Physical Attribute and Style Alignment for Rainy Image Generation
ICCV 2023
Improved Test-Time Adaptation for Domain Generalization
CVPR 2023
Delving StyleGAN Inversion for Image Editing: A Foundation Latent Space Viewpoint
CVPR 2023
Advancing Visual Grounding With Scene Knowledge: Benchmark and Method
CVPR 2023
Domain Generalization via Rationale Invariance
ICCV 2023
Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation
ICCV 2023
Efficient Video Action Detection with Token Dropout and Context Refinement
ICCV 2023
DiffusionDet: Diffusion Model for Object Detection
ICCV 2023
Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning
ICLR 2023
Human MotionFormer: Transferring Human Motions with Vision Transformers
ICLR 2023
Evolving Semantic Prototype Improves Generative Zero-Shot Learning
ICML 2023
VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
NIPS 2022
OST: Improving Generalization of DeepFake Detection via One-Shot Test-Time Training
NIPS 2022
One Model to Edit Them All: Free-Form Text-Driven Image Manipulation with Semantic Modulations
NIPS 2022
Self-Supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection
CVPR 2022
EViT: Expediting Vision Transformers via Token Reorganizations
ICLR 2022
DynaMixer: A Vision MLP Architecture with Dynamic Mixing
ICML 2022
AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition
NIPS 2022
Disentangled Cycle Consistency for Highly-Realistic Virtual Try-On
CVPR 2021
Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation Learning
NIPS 2021
Stabilized Medical Image Attacks
ICLR 2021
Parser-Free Virtual Try-On via Distilling Appearance Flows
CVPR 2021
DeFLOCNet: Deep Image Editing via Flexible Low-Level Controls
CVPR 2021
IoU Attack: Towards Temporally Coherent Black-Box Adversarial Attack for Visual Object Tracking
CVPR 2021
ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows
CVPR 2021
PD-GAN: Probabilistic Diverse GAN for Image Inpainting
CVPR 2021
VideoMoCo: Contrastive Video Representation Learning With Temporally Adversarial Examples
CVPR 2021
Rethinking Image Inpainting via a Mutual Encoder-Decoder with Feature Equalizations
ECCV 2020
Robust Tracking against Adversarial Attacks
ECCV 2020
Rethinking Image Deraining via Rain Streaks and Vapors
ECCV 2020
MVF-Net: Multi-View 3D Face Morphable Model Regression
CVPR 2019
Unsupervised Deep Tracking
CVPR 2019
Deep Attentive Tracking via Reciprocative Learning
NIPS 2018
VITAL: VIsual Tracking via Adversarial Learning
CVPR 2018
Look Deeper into Depth: Monocular Depth Estimation with Semantic Booster and Attention-Driven Loss
ECCV 2018
Dynamic Scene Deblurring Using Spatially Variant Recurrent Neural Networks
CVPR 2018
Image Correction via Deep Reciprocating HDR Transformation
CVPR 2018
Fast Preprocessing for Robust Face Sketch Synthesis
IJCAI 2017
CREST: Convolutional Residual Learning for Visual Tracking
ICCV 2017
Learning to Hallucinate Face Images via Component Generation and Enhancement
IJCAI 2017