Ming-Yu Liu
69 papers · 2013–2025 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
π Cross-Pollinator (9) π Academic Marathon (12) π§ Keyword Pioneer π Conference Polyglot (13) π Renaissance Researcher (9)
π
Renaissance Researcher
(9)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(106)
π
Conference Loyalist
(27)
π±
Topic Pioneer
π
Keyword Champion
π
Grand Slam
π§¬
Topic Evolution
π¬
Deep Specialist
(14)
π€
Dynamic Duo
(20)
π₯
Mega-Team
(28)
π
Trend Setter
π₯
Unstoppable
(13)
β‘
Prolific Year
(9)
π
Conference Pioneer
ποΈ
Keyword Collector
(287)
π
Century Club
(69)
Conferences
CVPR (27)
NIPS (13)
ICCV (9)
ECCV (7)
ICLR (3)
ICML (3)
AAAI (1)
ACL (1)
CORL (1)
EMNLP (1)
IJCAI (1)
RSS (1)
WACV (1)
Top co-authors
Research topics
Keywords
generative model
(7)
semantic segmentation
(6)
video generation
(6)
generative adversarial network
(5)
unsupervised learning
(5)
image generation
(4)
image captioning
(4)
object detection
(4)
adversarial training
(4)
diffusion model
(4)
neural network
(4)
image-to-image translation
(3)
few-shot learning
(3)
image synthesis
(3)
video understanding
(3)
multimodal learning
(3)
3d reconstruction
(3)
video synthesis
(3)
depth estimation
(3)
image translation
(3)
Papers
DreamGen: Unlocking Generalization in Robot Learning through Video World Models
CORL 2025
Describe Anything: Detailed Localized Image and Video Captioning
ICCV 2025
Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling
ICLR 2025
A Comprehensive Study of Decoder-Only LLMs for Text-to-Image Generation
CVPR 2025
Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator
ICML 2025
ArtiScene: Language-Driven Artistic 3D Scene Generation Through Image Intermediary
CVPR 2025
Articulated Kinematics Distillation from Video Diffusion Models
CVPR 2025
Dynamic Camera Poses and Where to Find Them
CVPR 2025
One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation
ICML 2025
High-Quality Joint Image and Video Tokenization with Causal VAE
ICLR 2025
EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation
ICLR 2025
HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation
CVPR 2025
CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models
CVPR 2025
Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation
CVPR 2024
Condition-Aware Neural Network for Controlled Image Generation
CVPR 2024
JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation
CVPR 2024
DiffCollage: Parallel Generation of Large Content With Diffusion Models
CVPR 2023
Magic3D: High-Resolution Text-to-3D Content Creation
CVPR 2023
Neuralangelo: High-Fidelity Neural Surface Reconstruction
CVPR 2023
Loss-Guided Diffusion Models for Plug-and-Play Controllable Generation
ICML 2023
SPACE: Speech-driven Portrait Animation with Controllable Expression
ICCV 2023
Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models
ICCV 2023
ATT3D: Amortized Text-to-3D Object Synthesis
ICCV 2023
Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning
EMNLP 2023
Implicit Warping for Animation with Image Sets
NIPS 2022
Generating Long Videos of Dynamic Scenes
NIPS 2022
Multimodal Conditional Image Synthesis with Product-of-Experts GANs
ECCV 2022
Implicit Neural Representations with Levels-of-Experts
NIPS 2022
GANcraft: Unsupervised 3D Neural Rendering of Minecraft Worlds
ICCV 2021
One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing
CVPR 2021
Deep Marching Tetrahedra: a Hybrid Representation for High-Resolution 3D Shape Synthesis
NIPS 2021
SymGAN: Orientation Estimation without Annotation for Symmetric Objects
WACV 2020
Learning compositional functions via multiplicative weight updates
NIPS 2020
On the distance between two neural networks and the stability of learning
NIPS 2020
Learning to Generate Multiple Style Transfer Outputs for an Input Sentence
ACL 2020
UNAS: Differentiable Architecture Search Meets Reinforcement Learning
CVPR 2020
Instance-Aware, Context-Focused, and Memory-Efficient Weakly Supervised Object Detection
CVPR 2020
COCO-FUNIT: Few-Shot Unsupervised Image Translation with a Content Conditioned Style Encoder
ECCV 2020
World-Consistent Video-to-Video Synthesis
ECCV 2020
UFOΒ²: A Unified Framework towards Omni-supervised Object Detection
ECCV 2020
STEP: Spatio-Temporal Progressive Learning for Video Action Detection
CVPR 2019
Neural Turtle Graphics for Modeling City Road Layouts
ICCV 2019
PointFlow: 3D Point Cloud Generation With Continuous Normalizing Flows
ICCV 2019
Meta-Sim: Learning to Generate Synthetic Datasets
ICCV 2019
Few-Shot Unsupervised Image-to-Image Translation
ICCV 2019
Unsupervised Stylish Image Description Generation via Domain Layer Norm
AAAI 2019
Dancing to Music
NIPS 2019
Few-shot Video-to-Video Synthesis
NIPS 2019
CityFlow: A City-Scale Benchmark for Multi-Target Multi-Camera Vehicle Tracking and Re-Identification
CVPR 2019
Semantic Image Synthesis With Spatially-Adaptive Normalization
CVPR 2019
PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume
CVPR 2018
High-Resolution Image Synthesis and Semantic Manipulation With Conditional GANs
CVPR 2018
MoCoGAN: Decomposing Motion and Content for Video Generation
CVPR 2018
Video-to-Video Synthesis
NIPS 2018
Context-aware Synthesis and Placement of Object Instances
NIPS 2018
Learning Superpixels With Segmentation-Aware Affinity Loss
CVPR 2018
A Closed-form Solution to Photorealistic Image Stylization
ECCV 2018
Superpixel Sampling Networks
ECCV 2018
Multimodal Unsupervised Image-to-image Translation
ECCV 2018
Unsupervised Image-to-Image Translation Networks
NIPS 2017
Deep 360 Pilot: Learning a Deep Agent for Piloting Through 360deg Sports Videos
CVPR 2017
CASENet: Deep Category-Aware Semantic Edge Detection
CVPR 2017
Tactics of Adversarial Attack on Deep Reinforcement Learning Agents
IJCAI 2017
Gaussian Conditional Random Field Network for Semantic Segmentation
CVPR 2016
Deep Gaussian Conditional Random Field Network: A Model-Based Deep Network for Discriminative Denoising
CVPR 2016
Coupled Generative Adversarial Networks
NIPS 2016
Layered Interpretation of Street View Images
RSS 2015
Recursive Context Propagation Network for Semantic Scene Labeling
NIPS 2014
Joint Geodesic Upsampling of Depth Images
CVPR 2013