Ming-Yu Liu

69 papers · 2013–2025 · 13 conferences · across top CS/AI conferences

Achievements

+17 more ↓

🐝 Cross-Pollinator (9) 🏃 Academic Marathon (12) 🧭 Keyword Pioneer 🌍 Conference Polyglot (13) 🌈 Renaissance Researcher (9)

🌈 Renaissance Researcher (9) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (106) 🏠 Conference Loyalist (27) 🌱 Topic Pioneer 🏆 Keyword Champion 🏆 Grand Slam 🧬 Topic Evolution 🔬 Deep Specialist (14) 🤝 Dynamic Duo (20) 👥 Mega-Team (28) 📈 Trend Setter 🔥 Unstoppable (13) ⚡ Prolific Year (9) 🚀 Conference Pioneer 🗃️ Keyword Collector (287) 💎 Century Club (69)

Conferences

CVPR (27) NIPS (13) ICCV (9) ECCV (7) ICLR (3) ICML (3) AAAI (1) ACL (1) CORL (1) EMNLP (1) IJCAI (1) RSS (1) WACV (1)

Top co-authors

Jan Kautz (20) Ting-Chun Wang (12) Arun Mallya (9) Qinsheng Zhang (7) Xiaodong Yang (7) Tsung-Yi Lin (7) Xun Huang (7) Oncel Tuzel (6) Ming-Hsuan Yang (5) Sanja Fidler (5)

Research topics

Computer Vision (1)

Keywords

generative model (7) semantic segmentation (6) video generation (6) generative adversarial network (5) unsupervised learning (5) image generation (4) image captioning (4) object detection (4) adversarial training (4) diffusion model (4) neural network (4) image-to-image translation (3) few-shot learning (3) image synthesis (3) video understanding (3) multimodal learning (3) 3d reconstruction (3) video synthesis (3) depth estimation (3) image translation (3)

Papers

DreamGen: Unlocking Generalization in Robot Learning through Video World Models CORL 2025 Describe Anything: Detailed Localized Image and Video Captioning ICCV 2025 Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling ICLR 2025 A Comprehensive Study of Decoder-Only LLMs for Text-to-Image Generation CVPR 2025 Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator ICML 2025 ArtiScene: Language-Driven Artistic 3D Scene Generation Through Image Intermediary CVPR 2025 Articulated Kinematics Distillation from Video Diffusion Models CVPR 2025 Dynamic Camera Poses and Where to Find Them CVPR 2025 One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation ICML 2025 High-Quality Joint Image and Video Tokenization with Causal VAE ICLR 2025 EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation ICLR 2025 HMAR: Efficient Hierarchical Masked Auto-Regressive Image Generation CVPR 2025 CoT-VLA: Visual Chain-of-Thought Reasoning for Vision-Language-Action Models CVPR 2025 Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation CVPR 2024 Condition-Aware Neural Network for Controlled Image Generation CVPR 2024 JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation CVPR 2024 DiffCollage: Parallel Generation of Large Content With Diffusion Models CVPR 2023 Magic3D: High-Resolution Text-to-3D Content Creation CVPR 2023 Neuralangelo: High-Fidelity Neural Surface Reconstruction CVPR 2023 Loss-Guided Diffusion Models for Plug-and-Play Controllable Generation ICML 2023 SPACE: Speech-driven Portrait Animation with Controllable Expression ICCV 2023 Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models ICCV 2023 ATT3D: Amortized Text-to-3D Object Synthesis ICCV 2023 Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning EMNLP 2023 Implicit Warping for Animation with Image Sets NIPS 2022 Generating Long Videos of Dynamic Scenes NIPS 2022 Multimodal Conditional Image Synthesis with Product-of-Experts GANs ECCV 2022 Implicit Neural Representations with Levels-of-Experts NIPS 2022 GANcraft: Unsupervised 3D Neural Rendering of Minecraft Worlds ICCV 2021 One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing CVPR 2021 Deep Marching Tetrahedra: a Hybrid Representation for High-Resolution 3D Shape Synthesis NIPS 2021 SymGAN: Orientation Estimation without Annotation for Symmetric Objects WACV 2020 Learning compositional functions via multiplicative weight updates NIPS 2020 On the distance between two neural networks and the stability of learning NIPS 2020 Learning to Generate Multiple Style Transfer Outputs for an Input Sentence ACL 2020 UNAS: Differentiable Architecture Search Meets Reinforcement Learning CVPR 2020 Instance-Aware, Context-Focused, and Memory-Efficient Weakly Supervised Object Detection CVPR 2020 COCO-FUNIT: Few-Shot Unsupervised Image Translation with a Content Conditioned Style Encoder ECCV 2020 World-Consistent Video-to-Video Synthesis ECCV 2020 UFO²: A Unified Framework towards Omni-supervised Object Detection ECCV 2020 STEP: Spatio-Temporal Progressive Learning for Video Action Detection CVPR 2019 Neural Turtle Graphics for Modeling City Road Layouts ICCV 2019 PointFlow: 3D Point Cloud Generation With Continuous Normalizing Flows ICCV 2019 Meta-Sim: Learning to Generate Synthetic Datasets ICCV 2019 Few-Shot Unsupervised Image-to-Image Translation ICCV 2019 Unsupervised Stylish Image Description Generation via Domain Layer Norm AAAI 2019 Dancing to Music NIPS 2019 Few-shot Video-to-Video Synthesis NIPS 2019 CityFlow: A City-Scale Benchmark for Multi-Target Multi-Camera Vehicle Tracking and Re-Identification CVPR 2019 Semantic Image Synthesis With Spatially-Adaptive Normalization CVPR 2019 PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume CVPR 2018 High-Resolution Image Synthesis and Semantic Manipulation With Conditional GANs CVPR 2018 MoCoGAN: Decomposing Motion and Content for Video Generation CVPR 2018 Video-to-Video Synthesis NIPS 2018 Context-aware Synthesis and Placement of Object Instances NIPS 2018 Learning Superpixels With Segmentation-Aware Affinity Loss CVPR 2018 A Closed-form Solution to Photorealistic Image Stylization ECCV 2018 Superpixel Sampling Networks ECCV 2018 Multimodal Unsupervised Image-to-image Translation ECCV 2018 Unsupervised Image-to-Image Translation Networks NIPS 2017 Deep 360 Pilot: Learning a Deep Agent for Piloting Through 360deg Sports Videos CVPR 2017 CASENet: Deep Category-Aware Semantic Edge Detection CVPR 2017 Tactics of Adversarial Attack on Deep Reinforcement Learning Agents IJCAI 2017 Gaussian Conditional Random Field Network for Semantic Segmentation CVPR 2016 Deep Gaussian Conditional Random Field Network: A Model-Based Deep Network for Discriminative Denoising CVPR 2016 Coupled Generative Adversarial Networks NIPS 2016 Layered Interpretation of Street View Images RSS 2015 Recursive Context Propagation Network for Semantic Scene Labeling NIPS 2014 Joint Geodesic Upsampling of Depth Images CVPR 2013