Alexei Baevski

30 papers · 2019–2024 · 9 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (11) 🧭 Keyword Pioneer 🌍 Conference Polyglot (9)

🐣 Hot Topic Early Bird 🐝 Cross-Pollinator (10) 🌍 Conference Polyglot (9) 🤝 Dynamic Duo (29) 🌱 Topic Pioneer 🔬 Deep Specialist (16) 🧬 Topic Evolution ⚡ Prolific Year (8) 🗃️ Keyword Collector (101) 🔥 Unstoppable (6) 💎 Century Club (30) 📈 Trend Setter

Conferences

INTERSPEECH (9) ACL (5) ICLR (3) IJCNLP (3) NIPS (3) EMNLP (2) ICML (2) NAACL (2) JMLR (1)

Top co-authors

Michael Auli (29) Wei-Ning Hsu (9) Alexis CONNEAU (8) Changhan Wang (6) Arun Babu (5) Juan Pino (5) Sergey Edunov (5) Qiantong Xu (4) Yun Tang (3) Abdelrahman Mohamed (3)

Keywords

self-supervised learning (14) speech recognition (11) representation learning (5) multilingual model (5) wav2vec 2.0 (5) speech translation (4) machine translation (3) named entity recognition (3) language modeling (3) zero-shot learning (3) masked autoencoder (2) pretrained model (2) transfer learning (2) contrastive learning (2) phoneme recognition (2) language model (2) automatic speech recognition (2) neural machine translation (2) spoken language understanding (2) sequence modeling (2)

Papers

Scaling Speech Technology to 1,000+ Languages JMLR 2024 Toward Joint Language Modeling for Speech Units and Text EMNLP 2023 Introducing Semantics into Speech Encoders ACL 2023 Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language ICML 2023 Simple and Effective Unsupervised Speech Synthesis INTERSPEECH 2022 Masked Autoencoders that Listen NIPS 2022 Simple and Effective Zero-shot Cross-lingual Phoneme Recognition INTERSPEECH 2022 Unified Speech-Text Pre-training for Speech Translation and Recognition ACL 2022 XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale INTERSPEECH 2022 data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language ICML 2022 On-demand compute reduction with stochastic wav2vec 2.0 INTERSPEECH 2022 Wav2Vec-Aug: Improved self-supervised training with limited data INTERSPEECH 2022 Unsupervised Cross-Lingual Representation Learning for Speech Recognition INTERSPEECH 2021 Unsupervised Speech Recognition NIPS 2021 Multilingual Speech Translation from Efficient Finetuning of Pretrained Models ACL 2021 Reservoir Transformers ACL 2021 Multilingual Speech Translation from Efficient Finetuning of Pretrained Models IJCNLP 2021 Reservoir Transformers IJCNLP 2021 Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training INTERSPEECH 2021 Large-Scale Self- and Semi-Supervised Learning for Speech Translation INTERSPEECH 2021 vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations ICLR 2020 wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations NIPS 2020 fairseq: A Fast, Extensible Toolkit for Sequence Modeling NAACL 2019 Facebook FAIR’s WMT19 News Translation Task Submission ACL 2019 Cloze-driven Pretraining of Self-attention Networks EMNLP 2019 Adaptive Input Representations for Neural Language Modeling ICLR 2019 Pay Less Attention with Lightweight and Dynamic Convolutions ICLR 2019 Cloze-driven Pretraining of Self-attention Networks IJCNLP 2019 wav2vec: Unsupervised Pre-Training for Speech Recognition INTERSPEECH 2019 Pre-trained language model representations for language generation NAACL 2019