Alexei Baevski
30 papers · 2019–2024 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π£ Hot Topic Early Bird π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (11) π§ Keyword Pioneer π Conference Polyglot (9)
π£
Hot Topic Early Bird
π
Cross-Pollinator
(10)
π
Conference Polyglot
(9)
π€
Dynamic Duo
(29)
π±
Topic Pioneer
π¬
Deep Specialist
(16)
π§¬
Topic Evolution
β‘
Prolific Year
(8)
ποΈ
Keyword Collector
(101)
π₯
Unstoppable
(6)
π
Century Club
(30)
π
Trend Setter
Conferences
INTERSPEECH (9)
ACL (5)
ICLR (3)
IJCNLP (3)
NIPS (3)
EMNLP (2)
ICML (2)
NAACL (2)
JMLR (1)
Top co-authors
Keywords
self-supervised learning
(14)
speech recognition
(11)
representation learning
(5)
multilingual model
(5)
wav2vec 2.0
(5)
speech translation
(4)
machine translation
(3)
named entity recognition
(3)
language modeling
(3)
zero-shot learning
(3)
masked autoencoder
(2)
pretrained model
(2)
transfer learning
(2)
contrastive learning
(2)
phoneme recognition
(2)
language model
(2)
automatic speech recognition
(2)
neural machine translation
(2)
spoken language understanding
(2)
sequence modeling
(2)
Papers
Scaling Speech Technology to 1,000+ Languages
JMLR 2024
Toward Joint Language Modeling for Speech Units and Text
EMNLP 2023
Introducing Semantics into Speech Encoders
ACL 2023
Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language
ICML 2023
Simple and Effective Unsupervised Speech Synthesis
INTERSPEECH 2022
Masked Autoencoders that Listen
NIPS 2022
Simple and Effective Zero-shot Cross-lingual Phoneme Recognition
INTERSPEECH 2022
Unified Speech-Text Pre-training for Speech Translation and Recognition
ACL 2022
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
INTERSPEECH 2022
data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
ICML 2022
On-demand compute reduction with stochastic wav2vec 2.0
INTERSPEECH 2022
Wav2Vec-Aug: Improved self-supervised training with limited data
INTERSPEECH 2022
Unsupervised Cross-Lingual Representation Learning for Speech Recognition
INTERSPEECH 2021
Unsupervised Speech Recognition
NIPS 2021
Multilingual Speech Translation from Efficient Finetuning of Pretrained Models
ACL 2021
Reservoir Transformers
ACL 2021
Multilingual Speech Translation from Efficient Finetuning of Pretrained Models
IJCNLP 2021
Reservoir Transformers
IJCNLP 2021
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training
INTERSPEECH 2021
Large-Scale Self- and Semi-Supervised Learning for Speech Translation
INTERSPEECH 2021
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations
ICLR 2020
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations
NIPS 2020
fairseq: A Fast, Extensible Toolkit for Sequence Modeling
NAACL 2019
Facebook FAIRβs WMT19 News Translation Task Submission
ACL 2019
Cloze-driven Pretraining of Self-attention Networks
EMNLP 2019
Adaptive Input Representations for Neural Language Modeling
ICLR 2019
Pay Less Attention with Lightweight and Dynamic Convolutions
ICLR 2019
Cloze-driven Pretraining of Self-attention Networks
IJCNLP 2019
wav2vec: Unsupervised Pre-Training for Speech Recognition
INTERSPEECH 2019
Pre-trained language model representations for language generation
NAACL 2019