Andrew Rouditchenko
12 papers · 2018–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π Academic Marathon (7) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (6) π Cross-Pollinator (12)
πΊοΈ
Taxonomy Completionist
(28)
π
Interdisciplinary Bridge
π§
Keyword Pioneer
π
Conference Pioneer
ποΈ
Keyword Collector
(54)
π₯
Unstoppable
(5)
π
Century Club
(12)
β
The Questioner
Conferences
INTERSPEECH (5)
CVPR (3)
ACL (1)
ECCV (1)
ICCV (1)
ICLR (1)
Top co-authors
Keywords
self-supervised learning
(7)
video retrieval
(4)
multimodal learning
(3)
zero-shot retrieval
(3)
transfer learning
(2)
contrastive learning
(2)
image retrieval
(2)
zero-shot learning
(2)
instructional video
(2)
audio-visual learning
(2)
video understanding
(1)
cross-lingual transfer
(1)
weakly supervised learning
(1)
action recognition
(1)
vector quantization
(1)
cross-modal retrieval
(1)
visual grounding
(1)
weakly-supervised learning
(1)
temporal alignment
(1)
video captioning
(1)
Papers
CAV-MAE Sync: Improving Contrastive Audio-Visual Mask Autoencoders via Fine-Grained Alignment
CVPR 2025
Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation
INTERSPEECH 2024
What When and Where? Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions
CVPR 2024
Contrastive Audio-Visual Masked Autoencoder
ICLR 2023
Comparison of Multilingual Self-Supervised and Weakly-Supervised Speech Pre-Training for Adaptation to Unseen Languages
INTERSPEECH 2023
Cross-Modal Discrete Representation Learning
ACL 2022
Everything at Once - Multi-Modal Fusion Transformer for Video Retrieval
CVPR 2022
Multimodal Clustering Networks for Self-Supervised Learning From Unlabeled Videos
ICCV 2021
Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset
INTERSPEECH 2021
Cascaded Multilingual Audio-Visual Learning from Videos
INTERSPEECH 2021
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos
INTERSPEECH 2021
The Sound of Pixels
ECCV 2018