Willi Menapace

23 papers · 2020–2025 · 7 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🏃 Academic Marathon (5) 🌍 Conference Polyglot (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (5)

🌈 Renaissance Researcher (8) 🌍 Conference Polyglot (7) 🏃 Academic Marathon (5) 🤝 Dynamic Duo (19) 🔬 Deep Specialist (13) 💎 Century Club (23) ❓ The Questioner 📈 Trend Setter ⚡ Prolific Year (8) 🗃️ Keyword Collector (126) 🔥 Unstoppable (6)

Conferences

CVPR (13) NIPS (3) ECCV (2) ICCV (2) EMNLP (1) ICLR (1) ICML (1)

Top co-authors

Sergey Tulyakov (19) Aliaksandr Siarohin (18) Ivan Skorokhodov (12) Elisa Ricci (8) Hsin-Ying Lee (6) Jian Ren (5) Yuwei Fang (5) Tsai-Shien Chen (4) Ming-Hsuan Yang (3) Vladislav Golyanik (3)

Research topics

Architectures (1) Core AI (1)

Keywords

video generation (12) diffusion model (7) diffusion transformer (3) video diffusion (3) text-to-video generation (3) multimodal learning (3) video synthesis (2) transformer architecture (2) video diffusion transformer (2) large language model (2) novel view synthesis (2) transfer learning (1) data annotation (1) model architecture (1) self-supervised learning (1) image generation (1) 3d reconstruction (1) video captioning (1) visual grounding (1) zero-shot learning (1)

Papers

Improving the Diffusability of Autoencoders ICML 2025 Can Text-to-Video Generation help Video-Language Alignment? CVPR 2025 4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion CVPR 2025 Multi-subject Open-set Personalization in Video Generation CVPR 2025 AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers CVPR 2025 Mind the Time: Temporally-Controlled Multi-Event Video Generation CVPR 2025 AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation ICCV 2025 VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control ICLR 2025 Harnessing Large Language Models for Training-free Video Anomaly Detection CVPR 2024 Hierarchical Patch Diffusion Models for High-Resolution Video Generation CVPR 2024 Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers CVPR 2024 VIMI: Grounding Video Generation through Multi-modal Instruction EMNLP 2024 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models NIPS 2024 AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation NIPS 2024 SF-V: Single Forward Video Generation Model NIPS 2024 Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis CVPR 2024 Unsupervised Volumetric Animation CVPR 2023 Quantum Multi-Model Fitting CVPR 2023 InfiniCity: Infinite-Scale City Synthesis ICCV 2023 Quantum Motion Segmentation ECCV 2022 Playable Environments: Video Manipulation in Space and Time CVPR 2022 Playable Video Generation CVPR 2021 Learning to Cluster under Domain Shift ECCV 2020