Arda Senocak
10 papers · 2018–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+5 more ↓ Show less ↑
π Cross-Pollinator (15) πΊοΈ Taxonomy Completionist (18) π§ Keyword Pioneer π£ Hot Topic Early Bird π Conference Polyglot (5)
π
Academic Marathon
(7)
π
Interdisciplinary Bridge
π
Keyword Champion
(4)
π
Century Club
(10)
β
The Questioner
Conferences
CVPR (3)
WACV (3)
INTERSPEECH (2)
ICCV (1)
ICLR (1)
Top co-authors
Keywords
sound source localization
(4)
sound localization
(3)
multimodal learning
(2)
audio classification
(2)
audio-visual learning
(2)
cross-modal retrieval
(2)
audio spectrogram transformer
(2)
cross-modal learning
(2)
attention mechanism
(1)
video understanding
(1)
contrastive learning
(1)
visual grounding
(1)
model architecture
(1)
semantic matching
(1)
multisensory integration
(1)
semi-supervised learning
(1)
audio-visual fusion
(1)
audio-visual correspondence
(1)
semantic segmentation
(1)
representation learning
(1)
Papers
AVHBench: A Cross-Modal Hallucination Benchmark for Audio-Visual Large Language Models
ICLR 2025
Seeing Speech and Sound: Distinguishing and Locating Audio Sources in Visual Scenes
CVPR 2025
Can CLIP Help Sound Source Localization?
WACV 2024
ElasticAST: An Audio Spectrogram Transformer for All Length and Resolutions
INTERSPEECH 2024
FlexiAST: Flexibility is What AST Needs
INTERSPEECH 2023
Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment
CVPR 2023
Sound Source Localization is All about Cross-Modal Alignment
ICCV 2023
Event-Specific Audio-Visual Fusion Layers: A Simple and New Perspective on Video Understanding
WACV 2023
Less Can Be More: Sound Source Localization With a Classification Model
WACV 2022
Learning to Localize Sound Source in Visual Scenes
CVPR 2018