Mirco Ravanelli

24 papers · 2016–2024 · 8 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🌍 Conference Polyglot (8) 🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (10) 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (8)

🏃 Academic Marathon (8) 🐝 Cross-Pollinator (10) 🌈 Renaissance Researcher (5) 👑 Triple Crown 👥 Mega-Team (34) 🧬 Topic Evolution 🚀 Conference Pioneer 🔥 Unstoppable (9) 📈 Trend Setter 💎 Century Club (24) 🗃️ Keyword Collector (99) ❓ The Questioner (3) ⚡ Prolific Year (9)

Conferences

INTERSPEECH (16) ICLR (2) COLING (1) CVPR (1) EMNLP (1) ICML (1) JMLR (1) NIPS (1)

Top co-authors

Cem Subakan (7) Yoshua Bengio (6) Titouan Parcollet (5) Salah Zaiem (4) Francesco Paissan (4) Luca Della Libera (4) Artem Ploujnikov (3) Renato de Mori (3) Maurizio Omologo (2) Cheng Yu (2)

Keywords

recurrent neural network (3) speech enhancement (3) speech recognition (3) self-supervised learning (3) spoken language understanding (2) automatic speech recognition (2) large language model (2) long short-term memory (2) generative adversarial network (2) distant speech recognition (2) speaker embedding (2) acoustic model (2) representation learning (2) speech synthesis (1) neural network training (1) curriculum learning (1) zero-shot learning (1) one-shot learning (1) style transfer (1) neural network optimization (1)

Papers

Listenable Maps for Zero-Shot Audio Classifiers NIPS 2024 TARIC-SLU: A Tunisian Benchmark Dataset for Spoken Language Understanding COLING 2024 Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve? EMNLP 2024 Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task Datasets ICLR 2024 Listenable Maps for Audio Classifiers ICML 2024 How Should We Extract Discrete Audio Tokens from Self-Supervised Models? INTERSPEECH 2024 Audio Editing with Non-Rigid Text Prompts INTERSPEECH 2024 Phoneme Discretized Saliency Maps for Explainable Detection of AI-Generated Voice INTERSPEECH 2024 Open-Source Conversational AI with SpeechBrain 1.0 JMLR 2024 Simulated Annealing in Early Layers Leads to Better Generalization CVPR 2023 Speech Self-Supervised Representation Benchmarking: Are We Doing it Right? INTERSPEECH 2023 OSSEM: one-shot speaker adaptive speech enhancement using meta learning INTERSPEECH 2022 SoundChoice: Grapheme-to-Phoneme Models with Semantic Disambiguation INTERSPEECH 2022 MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement INTERSPEECH 2021 ECAPA-TDNN Embeddings for Speaker Diarization INTERSPEECH 2021 The Energy and Carbon Footprint of Training End-to-End Speech Recognizers INTERSPEECH 2021 Quaternion Neural Networks for Multi-Channel Distant Speech Recognition INTERSPEECH 2020 Learning Speaker Representations with Mutual Information INTERSPEECH 2019 Speech Model Pre-Training for End-to-End Spoken Language Understanding INTERSPEECH 2019 Learning Problem-Agnostic Speech Representations from Multiple Self-Supervised Tasks INTERSPEECH 2019 Quaternion Recurrent Neural Networks ICLR 2019 Twin Regularization for Online Speech Recognition INTERSPEECH 2018 Improving Speech Recognition by Revising Gated Recurrent Units INTERSPEECH 2017 Realistic Multi-Microphone Data Simulation for Distant Speech Recognition INTERSPEECH 2016