Lirong Dai

22 papers · 2015–2025 · 8 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🌍 Conference Polyglot (8) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (10)

🌍 Conference Polyglot (8) 🏃 Academic Marathon (10) 🧭 Keyword Pioneer 🧬 Topic Evolution 🗃️ Keyword Collector (97) 💎 Century Club (22) 🔥 Unstoppable (6) 📈 Trend Setter ⚡ Prolific Year (5)

Conferences

INTERSPEECH (8) ACL (5) AAAI (3) IJCNLP (2) EMNLP (1) ICCV (1) ICML (1) JMLR (1)

Top co-authors

Jie Zhang (6) Ian McLoughlin (4) Yan Song (4) Weitai Zhang (3) Hui Jiang (3) Shiliang Zhang (3) Dan Liu (3) Xiaoxi Li (3) Shihao Chen (3) Yuchen Hu (3)

Research topics

Analysis (1)

Keywords

speech translation (5) automatic speech recognition (4) encoder-decoder architecture (3) data augmentation (3) convolutional neural network (3) speech recognition (3) end-to-end model (3) cascaded model (2) contrastive learning (2) speech emotion recognition (2) representation learning (2) simultaneous translation (2) multichannel speech (2) multimodal learning (2) semi-supervised learning (2) model ensemble (2) self-supervised learning (2) variational autoencoder (2) language modeling (1) embedding learning (1)

Papers

CSSinger: End-to-End Chunkwise Streaming Singing Voice Synthesis System Based on Conditional Variational Autoencoder AAAI 2025 An Effective Local Prototypical Mapping Network for Speech Emotion Recognition INTERSPEECH 2024 Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation AAAI 2024 LDM-SVC: Latent Diffusion Model Based Zero-Shot Any-to-Any Singing Voice Conversion with Singer Guidance INTERSPEECH 2024 Speech4Mesh: Speech-Assisted Monocular 3D Facial Reconstruction for Speech-Driven 3D Facial Animation ICCV 2023 The USTC’s Dialect Speech Translation System for IWSLT 2023 ACL 2023 Submission of USTC’s System for the IWSLT 2023 - Offline Speech Translation Track ACL 2023 External Text Based Data Augmentation for Low-Resource Speech Recognition in the Constrained Condition of OpenASR21 Challenge INTERSPEECH 2022 SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training EMNLP 2022 A Complementary Joint Training Approach Using Unpaired Speech and Text A Complementary Joint Training Approach Using Unpaired Speech and Text INTERSPEECH 2022 The USTC-NELSLIP Offline Speech Translation Systems for IWSLT 2022 ACL 2022 Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data INTERSPEECH 2022 The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021 ACL 2021 TaLNet: Voice Reconstruction from Tongue and Lip Articulation with Transfer Learning from Text-to-Speech Synthesis AAAI 2021 The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021 IJCNLP 2021 A Tree-Structured Decoder for Image-to-Markup Generation ICML 2020 An Improved Deep Embedding Learning Method for Short Duration Speaker Verification INTERSPEECH 2018 Acoustic Modeling with Densely Connected Residual Network for Multichannel Speech Recognition INTERSPEECH 2018 An Attention Pooling Based Representation Learning Method for Speech Emotion Recognition INTERSPEECH 2018 Hybrid Orthogonal Projection and Estimation (HOPE): A New Framework to Learn Neural Networks JMLR 2016 The Fixed-Size Ordinally-Forgetting Encoding Method for Neural Network Language Models IJCNLP 2015 The Fixed-Size Ordinally-Forgetting Encoding Method for Neural Network Language Models ACL 2015