Papers
Event-related data conditioning for acoustic event classification
INTERSPEECH 2022
A compact transformer-based GAN vocoder
INTERSPEECH 2022
Streaming Align-Refine for Non-autoregressive Deliberation
INTERSPEECH 2022
Context-aware Multimodal Fusion for Emotion Recognition
INTERSPEECH 2022
On the Prediction Network Architecture in RNN-T for ASR
INTERSPEECH 2022
MAE-AST: Masked Autoencoding Audio Spectrogram Transformer
INTERSPEECH 2022