Bryan Russell
31 papers · 2007–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
š Academic Marathon (18) š Interdisciplinary Bridge š§ Keyword Pioneer š Conference Polyglot (6) š Cross-Pollinator (13)
š§
Keyword Pioneer
š
Cross-Pollinator
(13)
š
Conference Polyglot
(6)
š
Keyword Trendsetter Combo
(5)
š¤
Dynamic Duo
(11)
š
Keyword Champion
(2)
š§¬
Topic Evolution
š
Trend Setter
š„
Unstoppable
(10)
šļø
Keyword Collector
(169)
š
Century Club
(31)
š
Conference Pioneer
Conferences
CVPR (13)
ICCV (7)
NIPS (7)
ECCV (2)
EMNLP (1)
ICLR (1)
Top co-authors
Keywords
video understanding
(4)
multimodal learning
(4)
self-supervised learning
(4)
cross-modal retrieval
(3)
audio-visual learning
(3)
view synthesis
(2)
convolutional neural network
(2)
surface normal prediction
(2)
video retrieval
(2)
neural rendering
(2)
contrastive learning
(2)
image matching
(2)
vision-language model
(2)
neural radiance field
(2)
pose estimation
(2)
3d vision
(2)
music recommendation
(2)
depth estimation
(2)
scene understanding
(2)
video localization
(2)
Papers
ResidualViT for Efficient Temporally Dense Video Encoding
ICCV 2025
Improving Personalized Search with Regularized Low-Rank Parameter Updates
CVPR 2025
Discovering Divergent Representations between Text-to-Image Models
ICCV 2025
Video-Guided Foley Sound Generation with Multimodal Controls
CVPR 2025
Koala: Key Frame-Conditioned Long Video-LLM
CVPR 2024
Language-Guided Audio-Visual Source Separation via Trimodal Consistency
CVPR 2023
Conditional Generation of Audio From Video via Foley Analogies
CVPR 2023
Language-Guided Music Recommendation for Video via Prompt Analogies
CVPR 2023
Meta-Personalizing Vision-Language Models To Find Named Instances in Video
CVPR 2023
Neural Volumetric Object Selection
CVPR 2022
It's Time for Artistic Correspondence in Music and Video
CVPR 2022
Monocular Dynamic View Synthesis: A Reality Check
NIPS 2022
Focal Length and Object Pose Estimation via Render and Compare
CVPR 2022
Weakly Supervised Human-Object Interaction Detection in Video via Contrastive Spatiotemporal Regions
ICCV 2021
Editing Conditional Radiance Fields
ICCV 2021
Look at What Iām Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos
NIPS 2021
Contact and Human Dynamics from Monocular Video
ECCV 2020
Telling Left From Right: Learning Spatial Correspondence of Sight and Sound
CVPR 2020
Bounce and Learn: Modeling Scene Dynamics with Real-World Bounces
ICLR 2019
Learning elementary structures for 3D shape generation and matching
NIPS 2019
FreiHAND: A Dataset for Markerless Capture of Hand Pose and Shape From Single RGB Images
ICCV 2019
Neural Re-Simulation for Generating Bounces in Single Images
ICCV 2019
Localizing Moments in Video with Temporal Language
EMNLP 2018
BodyNet: Volumetric Inference of 3D Human Body Shapes
ECCV 2018
ActionVLAD: Learning Spatio-Temporal Aggregation for Action Classification
CVPR 2017
Localizing Moments in Video With Natural Language
ICCV 2017
Marr Revisited: 2D-3D Alignment via Surface Normal Prediction
CVPR 2016
SURGE: Surface Regularized Geometry Estimation from a Single Image
NIPS 2016
Localizing 3D cuboids in single-view images
NIPS 2012
Segmenting Scenes by Matching Image Composites
NIPS 2009
Object Recognition by Scene Alignment
NIPS 2007