Jonathan Le Roux
30 papers · 2013–2024 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
π§ Keyword Pioneer π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (21) π Renaissance Researcher (5) π Conference Polyglot (7)
π
Interdisciplinary Bridge
π
Conference Polyglot
(7)
π§
Keyword Pioneer
π
Keyword Trendsetter Combo
(5)
π
Conference Loyalist
(23)
π
Keyword Champion
(2)
π€
Dynamic Duo
(11)
π±
Topic Pioneer
π¬
Deep Specialist
(10)
π§¬
Topic Evolution
β‘
Prolific Year
(6)
π₯
Unstoppable
(9)
ποΈ
Keyword Collector
(63)
π
Century Club
(30)
π
Conference Pioneer
π
Trend Setter
Conferences
INTERSPEECH (23)
AAAI (2)
ACL (1)
CVPR (1)
ICCV (1)
IJCNLP (1)
NIPS (1)
Top co-authors
Keywords
automatic speech recognition
(8)
source separation
(6)
neural network
(5)
end-to-end speech recognition
(4)
speech separation
(4)
multimodal learning
(4)
speech recognition
(3)
connectionist temporal classification
(3)
speech enhancement
(3)
multi-speaker recognition
(2)
multi-task learning
(2)
speaker separation
(2)
video captioning
(2)
end-to-end system
(2)
long-context speech recognition
(2)
multi-modal learning
(2)
video question answering
(2)
scene graph
(2)
zero-shot learning
(2)
video understanding
(2)
Papers
Enhanced Reverberation as Supervision for Unsupervised Speech Separation
INTERSPEECH 2024
ZeroST: Zero-Shot Speech Translation
INTERSPEECH 2024
Sound Event Bounding Boxes
INTERSPEECH 2024
Speech dereverberation constrained on room impulse response characteristics
INTERSPEECH 2024
RILA: Reflective and Imaginative Language Agent for Zero-Shot Semantic Audio-Visual Navigation
CVPR 2024
PARIS: Pseudo-AutoRegressIve Siamese Training for Online Speech Separation
INTERSPEECH 2024
Style-transfer based Speech and Audio-visual Scene understanding for Robot Action Sequence Acquisition from Videos
INTERSPEECH 2023
(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering
AAAI 2022
Heterogeneous Target Speech Separation
INTERSPEECH 2022
Low-Latency Online Streaming VideoQA Using Audio-Visual Transformers
INTERSPEECH 2022
Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition
INTERSPEECH 2021
Optimizing Latency for Online Video Captioning Using Audio-Visual Transformers
INTERSPEECH 2021
Visual Scene Graphs for Audio Source Separation
ICCV 2021
Advanced Long-Context End-to-End Speech Recognition Using Context-Expanded Transformers
INTERSPEECH 2021
Dual Causal/Non-Causal Self-Attention for Streaming End-to-End Speech Recognition
INTERSPEECH 2021
Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers
AAAI 2021
All-in-One Transformer: Unifying Speech Recognition, Audio Tagging, and Event Detection
INTERSPEECH 2020
Detecting Audio Attacks on ASR Systems with Dropout Uncertainty
INTERSPEECH 2020
Transformer-Based Long-Context End-to-End Speech Recognition
INTERSPEECH 2020
Unidirectional Neural Network Architectures for End-to-End Automatic Speech Recognition
INTERSPEECH 2019
Vectorized Beam Search for CTC-Attention-Based Speech Recognition
INTERSPEECH 2019
End-to-End Multilingual Multi-Speaker Speech Recognition
INTERSPEECH 2019
WHAM!: Extending Speech Separation to Noisy Environments
INTERSPEECH 2019
A Purely End-to-End System for Multi-speaker Speech Recognition
ACL 2018
End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction
INTERSPEECH 2018
Coupled Initialization of Multi-Channel Non-Negative Matrix Factorization Based on Spatial and Spectral Information
INTERSPEECH 2017
Improved MVDR Beamforming Using Single-Channel Mask Prediction Networks
INTERSPEECH 2016
Single-Channel Multi-Speaker Separation Using Deep Clustering
INTERSPEECH 2016
Full-Capacity Unitary Recurrent Neural Networks
NIPS 2016
Statistical Dialogue Management using Intention Dependency Graph
IJCNLP 2013