Marc Delcroix
47 papers · 2016–2024 · 1 conference · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
π Academic Marathon (8) πΊοΈ Taxonomy Completionist (17) π§ Keyword Pioneer π Interdisciplinary Bridge π£ Hot Topic Early Bird
π
Renaissance Researcher
(6)
πΊοΈ
Taxonomy Completionist
(17)
π
Conference Loyalist
(47)
π€
Dynamic Duo
(22)
π§¬
Topic Evolution
π¬
Deep Specialist
(16)
π
Keyword Champion
(7)
π₯
Unstoppable
(9)
π
Conference Pioneer
β‘
Prolific Year
(5)
β
The Questioner
(4)
π
Trend Setter
π
Century Club
(47)
ποΈ
Keyword Collector
(50)
Conferences
INTERSPEECH (47)
Top co-authors
Keywords
speech enhancement
(10)
automatic speech recognition
(9)
source separation
(7)
neural network
(7)
word error rate
(6)
speech recognition
(6)
acoustic model
(5)
speaker separation
(5)
target speech extraction
(5)
speech separation
(5)
end-to-end speech recognition
(4)
attention mechanism
(4)
recurrent neural network
(3)
deep neural network
(3)
noise robustness
(3)
acoustic modeling
(3)
speaker adaptation
(3)
knowledge distillation
(3)
speaker embedding
(3)
speaker diarization
(3)
Papers
Array Geometry-Robust Attention-Based Neural Beamformer for Moving Speakers
INTERSPEECH 2024
Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation
INTERSPEECH 2024
Lightweight Zero-shot Text-to-Speech with Mixture of Adapters
INTERSPEECH 2024
SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling
INTERSPEECH 2024
Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data
INTERSPEECH 2023
Target Speech Extraction with Conditional Diffusion Model
INTERSPEECH 2023
Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss
INTERSPEECH 2023
Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization
INTERSPEECH 2023
Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization
INTERSPEECH 2023
SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge?
INTERSPEECH 2023
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations
INTERSPEECH 2022
How bad are artifacts?: Analyzing the impact of speech enhancement errors on ASR
INTERSPEECH 2022
Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model
INTERSPEECH 2022
Streaming Target-Speaker ASR with Neural Transducer
INTERSPEECH 2022
Utterance-by-utterance overlap-aware neural diarization with Graph-PIT
INTERSPEECH 2022
Listen only to me! How well can target speech extraction handle false alarms?
INTERSPEECH 2022
Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers
INTERSPEECH 2021
Should We Always Separate?: Switching Between Enhanced and Observed Signals for Overlapping Speech Recognition
INTERSPEECH 2021
Auxiliary Loss Function for Target Speech Extraction and Recognition with Weak Supervision Based on Speaker Characteristics
INTERSPEECH 2021
Streaming End-to-End Speech Recognition for Hybrid RNN-T/Attention Architecture
INTERSPEECH 2021
PILOT: Introducing Transformers for Probabilistic Sound Event Localization
INTERSPEECH 2021
Continuous Speech Separation Using Speaker Inventory for Long Recording
INTERSPEECH 2021
Few-Shot Learning of New Sound Classes for Target Sound Extraction
INTERSPEECH 2021
Advances in Integration of End-to-End Neural and Clustering-Based Diarization for Real Conversational Speech
INTERSPEECH 2021
Listen to What You Want: Neural Network-Based Universal Sound Selector
INTERSPEECH 2020
Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation
INTERSPEECH 2020
Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR
INTERSPEECH 2020
Language Model Data Augmentation Based on Text Domain Transfer
INTERSPEECH 2020
Self-Distillation for Improving CTC-Transformer-Based ASR Systems
INTERSPEECH 2020
Improving Transformer-Based End-to-End Speech Recognition with Connectionist Temporal Classification and Language Model Integration
INTERSPEECH 2019
Improved Deep Duel Model for Rescoring N-Best Speech Recognition List Using Backward LSTMLM and Ensemble Encoders
INTERSPEECH 2019
Multimodal SpeakerBeam: Single Channel Target Speech Extraction with Audio-Visual Speaker Clues
INTERSPEECH 2019
End-to-End SpeakerBeam for Single Channel Target Speech Recognition
INTERSPEECH 2019
Multi-task Learning with Augmentation Strategy for Acoustic-to-word Attention-based Encoder-decoder Speech Recognition
INTERSPEECH 2018
Integrating Neural Network Based Beamforming and Weighted Prediction Error Dereverberation
INTERSPEECH 2018
Auxiliary Feature Based Adaptation of End-to-end ASR Systems
INTERSPEECH 2018
Semi-Supervised End-to-End Speech Recognition
INTERSPEECH 2018
Improved Example-Based Speech Enhancement by Using Deep Neural Network Acoustic Model for Noise Robust Example Search
INTERSPEECH 2017
Unfolded Deep Recurrent Convolutional Neural Network with Jump Ahead Connections for Acoustic Modeling
INTERSPEECH 2017
Deep Clustering-Based Beamforming for Separation with Unknown Number of Sources
INTERSPEECH 2017
Neural Network-Based Spectrum Estimation for Online WPE Dereverberation
INTERSPEECH 2017
Uncertainty Decoding with Adaptive Sampling for Noise Robust DNN-Based Acoustic Modeling
INTERSPEECH 2017
Speaker-Aware Neural Network Based Beamformer for Speaker Extraction in Speech Mixtures
INTERSPEECH 2017
Forward-Backward Convolutional LSTM for Acoustic Modeling
INTERSPEECH 2017
Robust Example Search Using Bottleneck Features for Example-Based Speech Enhancement
INTERSPEECH 2016
Context Adaptive Neural Network for Rapid Adaptation of Deep CNN Based Acoustic Models
INTERSPEECH 2016
Data Selection by Sequence Summarizing Neural Network in Mismatch Condition Training
INTERSPEECH 2016