conftrace_

Adam Polyak

21 papers · 2017–2025 · 9 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+11 more ↓ 🌍 Conference Polyglot (9) πŸƒ Academic Marathon (8) 🧭 Keyword Pioneer πŸŒ‰ Interdisciplinary Bridge 🐝 Cross-Pollinator (5)
🐝 Cross-Pollinator (5) 🌈 Renaissance Researcher (7) πŸ—ΊοΈ Taxonomy Completionist (33) 🀝 Dynamic Duo (15) πŸ‘‘ Triple Crown 🧬 Topic Evolution πŸ”₯ Unstoppable (9) πŸ’Ž Century Club (21) πŸ—ƒοΈ Keyword Collector (67) ⚑ Prolific Year (5) πŸš€ Conference Pioneer

Conferences

ICLR (5) ICML (3) INTERSPEECH (3) ACL (2) CVPR (2) ECCV (2) EMNLP (2) ICCV (1) NIPS (1)

Papers

Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation CVPR 2025 VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models ICML 2025 Video Editing via Factorized Diffusion Distillation ECCV 2024 Emu Edit: Precise Image Editing via Recognition and Generation Tasks CVPR 2024 Make-A-Video: Text-to-Video Generation without Text-Video Data ICLR 2023 kNN-Diffusion: Image Generation via Large-Scale Retrieval ICLR 2023 Pick-a-Pic: An Open Dataset of User Preferences for Text-to-Image Generation NIPS 2023 Text-To-4D Dynamic Scene Generation ICML 2023 AudioGen: Textually Guided Audio Generation ICLR 2023 Text-Free Prosody-Aware Generative Spoken Language Modeling ACL 2022 Make-a-Scene: Scene-Based Text-to-Image Generation with Human Priors ECCV 2022 Direct Speech-to-Speech Translation With Discrete Units ACL 2022 Textless Speech Emotion Conversion using Discrete & Decomposed Representations EMNLP 2022 Speech Resynthesis from Discrete Disentangled Self-Supervised Representations INTERSPEECH 2021 fairseq Sˆ2: A Scalable and Integrable Speech Synthesis Toolkit EMNLP 2021 TTS Skins: Speaker Conversion via ASR INTERSPEECH 2020 Unsupervised Cross-Domain Singing Voice Conversion INTERSPEECH 2020 A Universal Music Translation Network ICLR 2019 Fitting New Speakers Based on a Short Untranscribed Sample ICML 2018 VoiceLoop: Voice Fitting and Synthesis via a Phonological Loop ICLR 2018 Unsupervised Creation of Parameterized Avatars ICCV 2017