Andros Tjandra
14 papers · 2017–2024 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+5 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (18) π§ Keyword Pioneer π£ Hot Topic Early Bird π Academic Marathon (7) π Cross-Pollinator (14)
π
Interdisciplinary Bridge
π
Conference Polyglot
(5)
π§¬
Topic Evolution
π
Century Club
(14)
π₯
Unstoppable
(6)
Conferences
INTERSPEECH (10)
ICLR (1)
ICML (1)
IJCNLP (1)
JMLR (1)
Top co-authors
Keywords
automatic speech recognition
(6)
speech recognition
(4)
unsupervised learning
(3)
text-to-speech synthesis
(3)
speech synthesis
(3)
attention mechanism
(2)
self-supervised learning
(2)
speaker recognition
(2)
semi-supervised learning
(2)
deep learning
(1)
language identification
(1)
knowledge distillation
(1)
low-rank adaptation
(1)
multimodal learning
(1)
disentangled representation
(1)
latent representation
(1)
vector quantization
(1)
sequence-to-sequence learning
(1)
machine translation
(1)
image generation
(1)
Papers
Generative Pre-training for Speech with Flow Matching
ICLR 2024
MusicFlow: Cascaded Flow Matching for Text Guided Music Generation
ICML 2024
Learning Fine-Grained Controllability on Speech Generation via Efficient Fine-Tuning
INTERSPEECH 2024
Scaling Speech Technology to 1,000+ Languages
JMLR 2024
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
INTERSPEECH 2022
Unsupervised Learning of Disentangled Speech Content and Style Representation
INTERSPEECH 2021
Incremental Machine Speech Chain Towards Enabling Listening While Speaking in Real-Time
INTERSPEECH 2020
Transformer VQ-VAE for Unsupervised Unit Discovery and Speech Synthesis: ZeroSpeech 2020 Challenge
INTERSPEECH 2020
Augmenting Images for ASR and TTS Through Single-Loop and Dual-Loop Multimodal Chain Framework
INTERSPEECH 2020
Sequence-to-Sequence Learning via Attention Transfer for Incremental Speech Recognition
INTERSPEECH 2019
VQVAE Unsupervised Unit Discovery and Multi-Scale Code2Spec Inverter for Zerospeech Challenge 2019
INTERSPEECH 2019
Compressing End-to-end ASR Networks by Tensor-Train Decomposition
INTERSPEECH 2018
Machine Speech Chain with One-shot Speaker Adaptation
INTERSPEECH 2018
Local Monotonic Attention Mechanism for End-to-End Speech And Language Processing
IJCNLP 2017