Andrew Owens

51 papers · 2013–2026 · 9 conferences · across top CS/AI conferences

Achievements

+16 more ↓

🌍 Conference Polyglot (9) 🏃 Academic Marathon (13) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (5)

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌈 Renaissance Researcher (7) 🌟 Keyword Trendsetter Combo (3) 🏠 Conference Loyalist (24) 🤝 Dynamic Duo (12) 🌱 Topic Pioneer 🔬 Deep Specialist (15) 🏆 Keyword Champion (3) ⚡ Prolific Year (9) 📈 Trend Setter 🚀 Conference Pioneer ❓ The Questioner 🔥 Unstoppable (11) 🗃️ Keyword Collector (206) 💎 Century Club (51)

Conferences

CVPR (24) ECCV (8) ICCV (7) NIPS (4) CORL (3) WACV (2) COLING (1) EMNLP (1) ICLR (1)

Top co-authors

Ziyang Chen (12) Alexei A. Efros (7) Daniel Geng (7) Fengyu Yang (4) Yiming Dou (4) Ayush Shrivastava (4) Chao Feng (4) Oliver Wang (3) Antonio Torralba (3) Richard Zhang (2)

Keywords

multimodal learning (10) self-supervised learning (10) contrastive learning (6) image generation (6) tactile sensing (5) diffusion model (5) audio-visual learning (5) 3d reconstruction (4) depth estimation (4) representation learning (4) audio generation (4) sound localization (3) optical flow (3) random walk (3) video generation (3) sound generation (3) image classification (2) scene understanding (2) image synthesis (2) zero-shot learning (2)

Papers

Fine-grained Defocus Blur Control for Generative Image Models WACV 2026 Hearing Hands: Generating Sounds from Physical Interactions in 3D Scenes CVPR 2025 Masked Diffusion Captioning for Visual Feature Learning EMNLP 2025 Motion Prompting: Controlling Video Generation with Motion Trajectories CVPR 2025 Community Forensics: Using Thousands of Generators to Train Fake Image Detectors CVPR 2025 Cross-Sensor Touch Generation CORL 2025 Video-Guided Foley Sound Generation with Multimodal Controls CVPR 2025 Self-Supervised Spatial Correspondence Across Modalities CVPR 2025 Supervising Sound Localization by In-the-wild Egomotion CVPR 2025 GPS as a Control Signal for Image Generation CVPR 2025 Factorized Diffusion: Perceptual Illusions by Noise Decomposition ECCV 2024 Images that Sound: Composing Images and Sounds on a Single Canvas NIPS 2024 Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models CVPR 2024 Binding Touch to Everything: Learning Unified Multimodal Tactile Representations CVPR 2024 Tactile-Augmented Radiance Fields CVPR 2024 Efficient Vision-Language Pre-training by Cluster Masking CVPR 2024 Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and Benchmark CVPR 2024 Self-Supervised Any-Point Tracking by Contrastive Random Walks ECCV 2024 Self-Supervised Audio-Visual Soundscape Stylization ECCV 2024 Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators ICLR 2024 Conditional Generation of Audio From Video via Foley Analogies CVPR 2023 Self-Supervised Motion Magnification by Backpropagating Through Optical Flow NIPS 2023 Generating Visual Scenes from Touch ICCV 2023 Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation ICCV 2023 Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models ICCV 2023 Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment CVPR 2023 EXIF As Language: Learning Cross-Modal Associations Between Images and Camera Metadata CVPR 2023 GANmouflage: 3D Object Nondetection With Texture Fields CVPR 2023 Self-Supervised Video Forensics by Audio-Visual Anomaly Detection CVPR 2023 Sound Localization by Self-Supervised Time Delay Estimation ECCV 2022 Towards Understanding the Relation between Gestures and Language COLING 2022 Learning Visual Styles from Audio-Visual Associations ECCV 2022 Touch and Go: Learning from Human-Collected Vision and Touch NIPS 2022 Strumming to the Beat: Audio-Conditioned Contrastive Video Textures WACV 2022 Comparing Correspondences: Video Prediction With Correspondence-Wise Losses CVPR 2022 Learning Pixel Trajectories With Multiscale Contrastive Random Walks CVPR 2022 Mix and Localize: Localizing Sound Sources in Mixtures CVPR 2022 Planar Surface Reconstruction From Sparse Views ICCV 2021 Structure from Silence: Learning Scene Structure from Ambient Sound CORL 2021 CNN-Generated Images Are Surprisingly Easy to Spot... for Now CVPR 2020 Space-Time Correspondence as a Contrastive Random Walk NIPS 2020 Self-Supervised Learning of Audio-Visual Objects from Video ECCV 2020 Learning Individual Styles of Conversational Gesture CVPR 2019 Detecting Photoshopped Faces by Scripting Photoshop ICCV 2019 Audio-Visual Scene Analysis with Self-Supervised Multisensory Features ECCV 2018 Fighting Fake News: Image Splice Detection via Learned Self-Consistency ECCV 2018 The Feeling of Success: Does Touch Sensing Help Predict Grasp Outcomes? CORL 2017 Visually Indicated Sounds CVPR 2016 Camouflaging an Object from Many Viewpoints CVPR 2014 Shape Anchors for Data-Driven Multi-view Reconstruction ICCV 2013 SUN3D: A Database of Big Spaces Reconstructed Using SfM and Object Labels ICCV 2013