Mirco Ravanelli
24 papers · 2016–2024 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π Conference Polyglot (8) π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (10) π Interdisciplinary Bridge π Academic Marathon (8)
π
Academic Marathon
(8)
π
Cross-Pollinator
(10)
π
Renaissance Researcher
(5)
π
Triple Crown
π₯
Mega-Team
(34)
π§¬
Topic Evolution
π
Conference Pioneer
π₯
Unstoppable
(9)
π
Trend Setter
π
Century Club
(24)
ποΈ
Keyword Collector
(99)
β
The Questioner
(3)
β‘
Prolific Year
(9)
Conferences
INTERSPEECH (16)
ICLR (2)
COLING (1)
CVPR (1)
EMNLP (1)
ICML (1)
JMLR (1)
NIPS (1)
Top co-authors
Keywords
recurrent neural network
(3)
speech enhancement
(3)
speech recognition
(3)
self-supervised learning
(3)
spoken language understanding
(2)
automatic speech recognition
(2)
large language model
(2)
long short-term memory
(2)
generative adversarial network
(2)
distant speech recognition
(2)
speaker embedding
(2)
acoustic model
(2)
representation learning
(2)
speech synthesis
(1)
neural network training
(1)
curriculum learning
(1)
zero-shot learning
(1)
one-shot learning
(1)
style transfer
(1)
neural network optimization
(1)
Papers
Listenable Maps for Zero-Shot Audio Classifiers
NIPS 2024
TARIC-SLU: A Tunisian Benchmark Dataset for Spoken Language Understanding
COLING 2024
Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?
EMNLP 2024
Towards Foundational Models for Molecular Learning on Large-Scale Multi-Task Datasets
ICLR 2024
Listenable Maps for Audio Classifiers
ICML 2024
How Should We Extract Discrete Audio Tokens from Self-Supervised Models?
INTERSPEECH 2024
Audio Editing with Non-Rigid Text Prompts
INTERSPEECH 2024
Phoneme Discretized Saliency Maps for Explainable Detection of AI-Generated Voice
INTERSPEECH 2024
Open-Source Conversational AI with SpeechBrain 1.0
JMLR 2024
Simulated Annealing in Early Layers Leads to Better Generalization
CVPR 2023
Speech Self-Supervised Representation Benchmarking: Are We Doing it Right?
INTERSPEECH 2023
OSSEM: one-shot speaker adaptive speech enhancement using meta learning
INTERSPEECH 2022
SoundChoice: Grapheme-to-Phoneme Models with Semantic Disambiguation
INTERSPEECH 2022
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement
INTERSPEECH 2021
ECAPA-TDNN Embeddings for Speaker Diarization
INTERSPEECH 2021
The Energy and Carbon Footprint of Training End-to-End Speech Recognizers
INTERSPEECH 2021
Quaternion Neural Networks for Multi-Channel Distant Speech Recognition
INTERSPEECH 2020
Learning Speaker Representations with Mutual Information
INTERSPEECH 2019
Speech Model Pre-Training for End-to-End Spoken Language Understanding
INTERSPEECH 2019
Learning Problem-Agnostic Speech Representations from Multiple Self-Supervised Tasks
INTERSPEECH 2019
Quaternion Recurrent Neural Networks
ICLR 2019
Twin Regularization for Online Speech Recognition
INTERSPEECH 2018
Improving Speech Recognition by Revising Gated Recurrent Units
INTERSPEECH 2017
Realistic Multi-Microphone Data Simulation for Distant Speech Recognition
INTERSPEECH 2016