Andrew Owens
51 papers · 2013–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
🌍 Conference Polyglot (9) 🏃 Academic Marathon (13) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (5)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🌈
Renaissance Researcher
(7)
🌟
Keyword Trendsetter Combo
(3)
🏠
Conference Loyalist
(24)
🤝
Dynamic Duo
(12)
🌱
Topic Pioneer
🔬
Deep Specialist
(15)
🏆
Keyword Champion
(3)
⚡
Prolific Year
(9)
📈
Trend Setter
🚀
Conference Pioneer
❓
The Questioner
🔥
Unstoppable
(11)
🗃️
Keyword Collector
(206)
💎
Century Club
(51)
Conferences
CVPR (24)
ECCV (8)
ICCV (7)
NIPS (4)
CORL (3)
WACV (2)
COLING (1)
EMNLP (1)
ICLR (1)
Top co-authors
Keywords
multimodal learning
(10)
self-supervised learning
(10)
contrastive learning
(6)
image generation
(6)
tactile sensing
(5)
diffusion model
(5)
audio-visual learning
(5)
3d reconstruction
(4)
depth estimation
(4)
representation learning
(4)
audio generation
(4)
sound localization
(3)
optical flow
(3)
random walk
(3)
video generation
(3)
sound generation
(3)
image classification
(2)
scene understanding
(2)
image synthesis
(2)
zero-shot learning
(2)
Papers
Fine-grained Defocus Blur Control for Generative Image Models
WACV 2026
Hearing Hands: Generating Sounds from Physical Interactions in 3D Scenes
CVPR 2025
Masked Diffusion Captioning for Visual Feature Learning
EMNLP 2025
Motion Prompting: Controlling Video Generation with Motion Trajectories
CVPR 2025
Community Forensics: Using Thousands of Generators to Train Fake Image Detectors
CVPR 2025
Cross-Sensor Touch Generation
CORL 2025
Video-Guided Foley Sound Generation with Multimodal Controls
CVPR 2025
Self-Supervised Spatial Correspondence Across Modalities
CVPR 2025
Supervising Sound Localization by In-the-wild Egomotion
CVPR 2025
GPS as a Control Signal for Image Generation
CVPR 2025
Factorized Diffusion: Perceptual Illusions by Noise Decomposition
ECCV 2024
Images that Sound: Composing Images and Sounds on a Single Canvas
NIPS 2024
Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models
CVPR 2024
Binding Touch to Everything: Learning Unified Multimodal Tactile Representations
CVPR 2024
Tactile-Augmented Radiance Fields
CVPR 2024
Efficient Vision-Language Pre-training by Cluster Masking
CVPR 2024
Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and Benchmark
CVPR 2024
Self-Supervised Any-Point Tracking by Contrastive Random Walks
ECCV 2024
Self-Supervised Audio-Visual Soundscape Stylization
ECCV 2024
Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators
ICLR 2024
Conditional Generation of Audio From Video via Foley Analogies
CVPR 2023
Self-Supervised Motion Magnification by Backpropagating Through Optical Flow
NIPS 2023
Generating Visual Scenes from Touch
ICCV 2023
Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation
ICCV 2023
Text2Room: Extracting Textured 3D Meshes from 2D Text-to-Image Models
ICCV 2023
Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment
CVPR 2023
EXIF As Language: Learning Cross-Modal Associations Between Images and Camera Metadata
CVPR 2023
GANmouflage: 3D Object Nondetection With Texture Fields
CVPR 2023
Self-Supervised Video Forensics by Audio-Visual Anomaly Detection
CVPR 2023
Sound Localization by Self-Supervised Time Delay Estimation
ECCV 2022
Towards Understanding the Relation between Gestures and Language
COLING 2022
Learning Visual Styles from Audio-Visual Associations
ECCV 2022
Touch and Go: Learning from Human-Collected Vision and Touch
NIPS 2022
Strumming to the Beat: Audio-Conditioned Contrastive Video Textures
WACV 2022
Comparing Correspondences: Video Prediction With Correspondence-Wise Losses
CVPR 2022
Learning Pixel Trajectories With Multiscale Contrastive Random Walks
CVPR 2022
Mix and Localize: Localizing Sound Sources in Mixtures
CVPR 2022
Planar Surface Reconstruction From Sparse Views
ICCV 2021
Structure from Silence: Learning Scene Structure from Ambient Sound
CORL 2021
CNN-Generated Images Are Surprisingly Easy to Spot... for Now
CVPR 2020
Space-Time Correspondence as a Contrastive Random Walk
NIPS 2020
Self-Supervised Learning of Audio-Visual Objects from Video
ECCV 2020
Learning Individual Styles of Conversational Gesture
CVPR 2019
Detecting Photoshopped Faces by Scripting Photoshop
ICCV 2019
Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
ECCV 2018
Fighting Fake News: Image Splice Detection via Learned Self-Consistency
ECCV 2018
The Feeling of Success: Does Touch Sensing Help Predict Grasp Outcomes?
CORL 2017
Visually Indicated Sounds
CVPR 2016
Camouflaging an Object from Many Viewpoints
CVPR 2014
Shape Anchors for Data-Driven Multi-view Reconstruction
ICCV 2013
SUN3D: A Database of Big Spaces Reconstructed Using SfM and Object Labels
ICCV 2013