John R. Hershey

20 papers · 2006–2025 · 9 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (12) 🌍 Conference Polyglot (9)

🌈 Renaissance Researcher (5) 🗺️ Taxonomy Completionist (12) 🧭 Keyword Pioneer 🌟 Keyword Trendsetter Combo (3) 🏆 Keyword Champion 🧬 Topic Evolution 🌱 Topic Pioneer 🗃️ Keyword Collector (76) 📈 Trend Setter 💎 Century Club (20) 🔥 Unstoppable (10) 🚀 Conference Pioneer

Conferences

INTERSPEECH (10) ICLR (2) NIPS (2) ACL (1) CVPR (1) ECCV (1) ICCV (1) ICML (1) IJCNLP (1)

Top co-authors

Shinji Watanabe (8) Scott Wisdom (6) Takaaki Hori (5) Jonathan Le Roux (5) Hakan Erdogan (5) Efthymios Tzinis (3) Kevin Wilson (3) Hannah Muckenhirn (2) Mark Hamilton (2) Hiroshi Seki (2)

Keywords

speech recognition (6) neural network (5) speech separation (3) source separation (3) speaker separation (3) speech enhancement (3) sound separation (3) attention mechanism (2) multi-speaker recognition (2) end-to-end learning (2) automatic speech recognition (2) speaker embedding (2) speech synthesis (1) sequence labeling (1) self-supervised learning (1) domain adaptation (1) contrastive learning (1) factorial dynamics (1) spoken language understanding (1) semi-supervised learning (1)

Papers

I-Con: A Unifying Framework for Representation Learning ICLR 2025 Unsupervised Improved MVDR Beamforming for Sound Enhancement INTERSPEECH 2024 Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language CVPR 2024 TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition INTERSPEECH 2023 Distance-Based Sound Separation INTERSPEECH 2022 AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation ECCV 2022 CycleGAN-based Unpaired Speech Dereverberation INTERSPEECH 2022 Continuous Speech Separation Using Speaker Inventory for Long Recording INTERSPEECH 2021 Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds ICLR 2021 Unsupervised Sound Separation Using Mixture Invariant Training NIPS 2020 VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking INTERSPEECH 2019 End-to-End Multilingual Multi-Speaker Speech Recognition INTERSPEECH 2019 A Purely End-to-End System for Multi-speaker Speech Recognition ACL 2018 Multichannel End-to-end Speech Recognition ICML 2017 Attention-Based Multimodal Fusion for Video Description ICCV 2017 Single-Channel Multi-Speaker Separation Using Deep Clustering INTERSPEECH 2016 Improved MVDR Beamforming Using Single-Channel Mask Prediction Networks INTERSPEECH 2016 Context-Sensitive and Role-Dependent Spoken Language Understanding Using Bidirectional and Attention LSTMs INTERSPEECH 2016 Statistical Dialogue Management using Intention Dependency Graph IJCNLP 2013 Single Channel Speech Separation Using Factorial Dynamics NIPS 2006