Papers
MAE-AST: Masked Autoencoding Audio Spectrogram Transformer
INTERSPEECH 2022
UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022
INTERSPEECH 2022
Wav2vec behind the Scenes: How end2end Models learn Phonetics
INTERSPEECH 2022
A BERT-based Language Modeling Framework
INTERSPEECH 2022