Bryan Russell

31 papers · 2007–2025 · 6 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🏃 Academic Marathon (18) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (6) 🐝 Cross-Pollinator (13)

🧭 Keyword Pioneer 🐝 Cross-Pollinator (13) 🌍 Conference Polyglot (6) 🌟 Keyword Trendsetter Combo (5) 🤝 Dynamic Duo (11) 🏆 Keyword Champion (2) 🧬 Topic Evolution 📈 Trend Setter 🔥 Unstoppable (10) 🗃️ Keyword Collector (169) 💎 Century Club (31) 🚀 Conference Pioneer

Conferences

CVPR (13) ICCV (7) NIPS (7) ECCV (2) EMNLP (1) ICLR (1)

Top co-authors

Josef Sivic (11) Justin Salamon (6) Fabian Caba Heilbron (4) Jimei Yang (3) Trevor Darrell (3) Abhinav Gupta (3) Reuben Tan (3) Kate Saenko (3) Antonio Torralba (3) Oliver Wang (3)

Keywords

video understanding (4) multimodal learning (4) self-supervised learning (4) cross-modal retrieval (3) audio-visual learning (3) view synthesis (2) convolutional neural network (2) surface normal prediction (2) video retrieval (2) neural rendering (2) contrastive learning (2) image matching (2) vision-language model (2) neural radiance field (2) pose estimation (2) 3d vision (2) music recommendation (2) depth estimation (2) scene understanding (2) video localization (2)

Papers

ResidualViT for Efficient Temporally Dense Video Encoding ICCV 2025 Improving Personalized Search with Regularized Low-Rank Parameter Updates CVPR 2025 Discovering Divergent Representations between Text-to-Image Models ICCV 2025 Video-Guided Foley Sound Generation with Multimodal Controls CVPR 2025 Koala: Key Frame-Conditioned Long Video-LLM CVPR 2024 Language-Guided Audio-Visual Source Separation via Trimodal Consistency CVPR 2023 Conditional Generation of Audio From Video via Foley Analogies CVPR 2023 Language-Guided Music Recommendation for Video via Prompt Analogies CVPR 2023 Meta-Personalizing Vision-Language Models To Find Named Instances in Video CVPR 2023 Neural Volumetric Object Selection CVPR 2022 It's Time for Artistic Correspondence in Music and Video CVPR 2022 Monocular Dynamic View Synthesis: A Reality Check NIPS 2022 Focal Length and Object Pose Estimation via Render and Compare CVPR 2022 Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions ICCV 2021 Editing Conditional Radiance Fields ICCV 2021 Look at What I’m Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos NIPS 2021 Contact and Human Dynamics from Monocular Video ECCV 2020 Telling Left From Right: Learning Spatial Correspondence of Sight and Sound CVPR 2020 Bounce and Learn: Modeling Scene Dynamics with Real-World Bounces ICLR 2019 Learning elementary structures for 3D shape generation and matching NIPS 2019 FreiHAND: A Dataset for Markerless Capture of Hand Pose and Shape From Single RGB Images ICCV 2019 Neural Re-Simulation for Generating Bounces in Single Images ICCV 2019 Localizing Moments in Video with Temporal Language EMNLP 2018 BodyNet: Volumetric Inference of 3D Human Body Shapes ECCV 2018 ActionVLAD: Learning Spatio-Temporal Aggregation for Action Classification CVPR 2017 Localizing Moments in Video With Natural Language ICCV 2017 Marr Revisited: 2D-3D Alignment via Surface Normal Prediction CVPR 2016 SURGE: Surface Regularized Geometry Estimation from a Single Image NIPS 2016 Localizing 3D cuboids in single-view images NIPS 2012 Segmenting Scenes by Matching Image Composites NIPS 2009 Object Recognition by Scene Alignment NIPS 2007