John R. Hershey
20 papers · 2006–2025 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π£ Hot Topic Early Bird π Interdisciplinary Bridge π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (12) π Conference Polyglot (9)
π
Renaissance Researcher
(5)
πΊοΈ
Taxonomy Completionist
(12)
π§
Keyword Pioneer
π
Keyword Trendsetter Combo
(3)
π
Keyword Champion
π§¬
Topic Evolution
π±
Topic Pioneer
ποΈ
Keyword Collector
(76)
π
Trend Setter
π
Century Club
(20)
π₯
Unstoppable
(10)
π
Conference Pioneer
Conferences
INTERSPEECH (10)
ICLR (2)
NIPS (2)
ACL (1)
CVPR (1)
ECCV (1)
ICCV (1)
ICML (1)
IJCNLP (1)
Top co-authors
Keywords
speech recognition
(6)
neural network
(5)
speech separation
(3)
source separation
(3)
speaker separation
(3)
speech enhancement
(3)
sound separation
(3)
attention mechanism
(2)
multi-speaker recognition
(2)
end-to-end learning
(2)
automatic speech recognition
(2)
speaker embedding
(2)
speech synthesis
(1)
sequence labeling
(1)
self-supervised learning
(1)
domain adaptation
(1)
contrastive learning
(1)
factorial dynamics
(1)
spoken language understanding
(1)
semi-supervised learning
(1)
Papers
I-Con: A Unifying Framework for Representation Learning
ICLR 2025
Unsupervised Improved MVDR Beamforming for Sound Enhancement
INTERSPEECH 2024
Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language
CVPR 2024
TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition
INTERSPEECH 2023
Distance-Based Sound Separation
INTERSPEECH 2022
AudioScopeV2: Audio-Visual Attention Architectures for Calibrated Open-Domain On-Screen Sound Separation
ECCV 2022
CycleGAN-based Unpaired Speech Dereverberation
INTERSPEECH 2022
Continuous Speech Separation Using Speaker Inventory for Long Recording
INTERSPEECH 2021
Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds
ICLR 2021
Unsupervised Sound Separation Using Mixture Invariant Training
NIPS 2020
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
INTERSPEECH 2019
End-to-End Multilingual Multi-Speaker Speech Recognition
INTERSPEECH 2019
A Purely End-to-End System for Multi-speaker Speech Recognition
ACL 2018
Multichannel End-to-end Speech Recognition
ICML 2017
Attention-Based Multimodal Fusion for Video Description
ICCV 2017
Single-Channel Multi-Speaker Separation Using Deep Clustering
INTERSPEECH 2016
Improved MVDR Beamforming Using Single-Channel Mask Prediction Networks
INTERSPEECH 2016
Context-Sensitive and Role-Dependent Spoken Language Understanding Using Bidirectional and Attention LSTMs
INTERSPEECH 2016
Statistical Dialogue Management using Intention Dependency Graph
IJCNLP 2013
Single Channel Speech Separation Using Factorial Dynamics
NIPS 2006