Marc Delcroix

47 papers · 2016–2024 · 1 conference · across top CS/AI conferences

Achievements

+14 more ↓

🏃 Academic Marathon (8) 🗺️ Taxonomy Completionist (17) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird

🌈 Renaissance Researcher (6) 🗺️ Taxonomy Completionist (17) 🏠 Conference Loyalist (47) 🤝 Dynamic Duo (22) 🧬 Topic Evolution 🔬 Deep Specialist (16) 🏆 Keyword Champion (7) 🔥 Unstoppable (9) 🚀 Conference Pioneer ⚡ Prolific Year (5) ❓ The Questioner (4) 📈 Trend Setter 💎 Century Club (47) 🗃️ Keyword Collector (50)

Conferences

INTERSPEECH (47)

Top co-authors

Tomohiro Nakatani (22) Keisuke Kinoshita (21) Atsunori Ogawa (18) Tsubasa Ochiai (17) Takafumi Moriya (12) Hiroshi Sato (10) Takanori Ashihara (9) Shigeki Karita (8) Shinji Watanabe (7) Tomohiro Tanaka (7)

Keywords

speech enhancement (10) automatic speech recognition (9) source separation (7) neural network (7) word error rate (6) speech recognition (6) acoustic model (5) speaker separation (5) target speech extraction (5) speech separation (5) end-to-end speech recognition (4) attention mechanism (4) recurrent neural network (3) deep neural network (3) noise robustness (3) acoustic modeling (3) speaker adaptation (3) knowledge distillation (3) speaker embedding (3) speaker diarization (3)

Papers

Array Geometry-Robust Attention-Based Neural Beamformer for Moving Speakers INTERSPEECH 2024 Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation INTERSPEECH 2024 Lightweight Zero-shot Text-to-Speech with Mixture of Adapters INTERSPEECH 2024 SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling INTERSPEECH 2024 Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data INTERSPEECH 2023 Target Speech Extraction with Conditional Diffusion Model INTERSPEECH 2023 Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss INTERSPEECH 2023 Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization INTERSPEECH 2023 Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization INTERSPEECH 2023 SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge? INTERSPEECH 2023 Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations INTERSPEECH 2022 How bad are artifacts?: Analyzing the impact of speech enhancement errors on ASR INTERSPEECH 2022 Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model INTERSPEECH 2022 Streaming Target-Speaker ASR with Neural Transducer INTERSPEECH 2022 Utterance-by-utterance overlap-aware neural diarization with Graph-PIT INTERSPEECH 2022 Listen only to me! How well can target speech extraction handle false alarms? INTERSPEECH 2022 Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers INTERSPEECH 2021 Should We Always Separate?: Switching Between Enhanced and Observed Signals for Overlapping Speech Recognition INTERSPEECH 2021 Auxiliary Loss Function for Target Speech Extraction and Recognition with Weak Supervision Based on Speaker Characteristics INTERSPEECH 2021 Streaming End-to-End Speech Recognition for Hybrid RNN-T/Attention Architecture INTERSPEECH 2021 PILOT: Introducing Transformers for Probabilistic Sound Event Localization INTERSPEECH 2021 Continuous Speech Separation Using Speaker Inventory for Long Recording INTERSPEECH 2021 Few-Shot Learning of New Sound Classes for Target Sound Extraction INTERSPEECH 2021 Advances in Integration of End-to-End Neural and Clustering-Based Diarization for Real Conversational Speech INTERSPEECH 2021 Listen to What You Want: Neural Network-Based Universal Sound Selector INTERSPEECH 2020 Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation INTERSPEECH 2020 Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR INTERSPEECH 2020 Language Model Data Augmentation Based on Text Domain Transfer INTERSPEECH 2020 Self-Distillation for Improving CTC-Transformer-Based ASR Systems INTERSPEECH 2020 Improving Transformer-Based End-to-End Speech Recognition with Connectionist Temporal Classification and Language Model Integration INTERSPEECH 2019 Improved Deep Duel Model for Rescoring N-Best Speech Recognition List Using Backward LSTMLM and Ensemble Encoders INTERSPEECH 2019 Multimodal SpeakerBeam: Single Channel Target Speech Extraction with Audio-Visual Speaker Clues INTERSPEECH 2019 End-to-End SpeakerBeam for Single Channel Target Speech Recognition INTERSPEECH 2019 Multi-task Learning with Augmentation Strategy for Acoustic-to-word Attention-based Encoder-decoder Speech Recognition INTERSPEECH 2018 Integrating Neural Network Based Beamforming and Weighted Prediction Error Dereverberation INTERSPEECH 2018 Auxiliary Feature Based Adaptation of End-to-end ASR Systems INTERSPEECH 2018 Semi-Supervised End-to-End Speech Recognition INTERSPEECH 2018 Improved Example-Based Speech Enhancement by Using Deep Neural Network Acoustic Model for Noise Robust Example Search INTERSPEECH 2017 Unfolded Deep Recurrent Convolutional Neural Network with Jump Ahead Connections for Acoustic Modeling INTERSPEECH 2017 Deep Clustering-Based Beamforming for Separation with Unknown Number of Sources INTERSPEECH 2017 Neural Network-Based Spectrum Estimation for Online WPE Dereverberation INTERSPEECH 2017 Uncertainty Decoding with Adaptive Sampling for Noise Robust DNN-Based Acoustic Modeling INTERSPEECH 2017 Speaker-Aware Neural Network Based Beamformer for Speaker Extraction in Speech Mixtures INTERSPEECH 2017 Forward-Backward Convolutional LSTM for Acoustic Modeling INTERSPEECH 2017 Robust Example Search Using Bottleneck Features for Example-Based Speech Enhancement INTERSPEECH 2016 Context Adaptive Neural Network for Rapid Adaptation of Deep CNN Based Acoustic Models INTERSPEECH 2016 Data Selection by Sequence Summarizing Neural Network in Mismatch Condition Training INTERSPEECH 2016