Willi Menapace
23 papers · 2020–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
🏃 Academic Marathon (5) 🌍 Conference Polyglot (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (5)
🌈
Renaissance Researcher
(8)
🌍
Conference Polyglot
(7)
🏃
Academic Marathon
(5)
🤝
Dynamic Duo
(19)
🔬
Deep Specialist
(13)
💎
Century Club
(23)
❓
The Questioner
📈
Trend Setter
⚡
Prolific Year
(8)
🗃️
Keyword Collector
(126)
🔥
Unstoppable
(6)
Conferences
CVPR (13)
NIPS (3)
ECCV (2)
ICCV (2)
EMNLP (1)
ICLR (1)
ICML (1)
Top co-authors
Research topics
Keywords
video generation
(12)
diffusion model
(7)
diffusion transformer
(3)
video diffusion
(3)
text-to-video generation
(3)
multimodal learning
(3)
video synthesis
(2)
transformer architecture
(2)
video diffusion transformer
(2)
large language model
(2)
novel view synthesis
(2)
transfer learning
(1)
data annotation
(1)
model architecture
(1)
self-supervised learning
(1)
image generation
(1)
3d reconstruction
(1)
video captioning
(1)
visual grounding
(1)
zero-shot learning
(1)
Papers
Improving the Diffusability of Autoencoders
ICML 2025
Can Text-to-Video Generation help Video-Language Alignment?
CVPR 2025
4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion
CVPR 2025
Multi-subject Open-set Personalization in Video Generation
CVPR 2025
AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers
CVPR 2025
Mind the Time: Temporally-Controlled Multi-Event Video Generation
CVPR 2025
AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation
ICCV 2025
VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
ICLR 2025
Harnessing Large Language Models for Training-free Video Anomaly Detection
CVPR 2024
Hierarchical Patch Diffusion Models for High-Resolution Video Generation
CVPR 2024
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
CVPR 2024
VIMI: Grounding Video Generation through Multi-modal Instruction
EMNLP 2024
4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models
NIPS 2024
AsCAN: Asymmetric Convolution-Attention Networks for Efficient Recognition and Generation
NIPS 2024
SF-V: Single Forward Video Generation Model
NIPS 2024
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
CVPR 2024
Unsupervised Volumetric Animation
CVPR 2023
Quantum Multi-Model Fitting
CVPR 2023
InfiniCity: Infinite-Scale City Synthesis
ICCV 2023
Quantum Motion Segmentation
ECCV 2022
Playable Environments: Video Manipulation in Space and Time
CVPR 2022
Playable Video Generation
CVPR 2021
Learning to Cluster under Domain Shift
ECCV 2020