Lirong Dai
22 papers · 2015–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π Conference Polyglot (8) π§ Keyword Pioneer π£ Hot Topic Early Bird π Interdisciplinary Bridge π Academic Marathon (10)
π
Conference Polyglot
(8)
π
Academic Marathon
(10)
π§
Keyword Pioneer
π§¬
Topic Evolution
ποΈ
Keyword Collector
(97)
π
Century Club
(22)
π₯
Unstoppable
(6)
π
Trend Setter
β‘
Prolific Year
(5)
Conferences
INTERSPEECH (8)
ACL (5)
AAAI (3)
IJCNLP (2)
EMNLP (1)
ICCV (1)
ICML (1)
JMLR (1)
Top co-authors
Research topics
Keywords
speech translation
(5)
automatic speech recognition
(4)
encoder-decoder architecture
(3)
data augmentation
(3)
convolutional neural network
(3)
speech recognition
(3)
end-to-end model
(3)
cascaded model
(2)
contrastive learning
(2)
speech emotion recognition
(2)
representation learning
(2)
simultaneous translation
(2)
multichannel speech
(2)
multimodal learning
(2)
semi-supervised learning
(2)
model ensemble
(2)
self-supervised learning
(2)
variational autoencoder
(2)
language modeling
(1)
embedding learning
(1)
Papers
CSSinger: End-to-End Chunkwise Streaming Singing Voice Synthesis System Based on Conditional Variational Autoencoder
AAAI 2025
An Effective Local Prototypical Mapping Network for Speech Emotion Recognition
INTERSPEECH 2024
Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation
AAAI 2024
LDM-SVC: Latent Diffusion Model Based Zero-Shot Any-to-Any Singing Voice Conversion with Singer Guidance
INTERSPEECH 2024
Speech4Mesh: Speech-Assisted Monocular 3D Facial Reconstruction for Speech-Driven 3D Facial Animation
ICCV 2023
The USTCβs Dialect Speech Translation System for IWSLT 2023
ACL 2023
Submission of USTCβs System for the IWSLT 2023 - Offline Speech Translation Track
ACL 2023
External Text Based Data Augmentation for Low-Resource Speech Recognition in the Constrained Condition of OpenASR21 Challenge
INTERSPEECH 2022
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training
EMNLP 2022
A Complementary Joint Training Approach Using Unpaired Speech and Text A Complementary Joint Training Approach Using Unpaired Speech and Text
INTERSPEECH 2022
The USTC-NELSLIP Offline Speech Translation Systems for IWSLT 2022
ACL 2022
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data
INTERSPEECH 2022
The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021
ACL 2021
TaLNet: Voice Reconstruction from Tongue and Lip Articulation with Transfer Learning from Text-to-Speech Synthesis
AAAI 2021
The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 2021
IJCNLP 2021
A Tree-Structured Decoder for Image-to-Markup Generation
ICML 2020
An Improved Deep Embedding Learning Method for Short Duration Speaker Verification
INTERSPEECH 2018
Acoustic Modeling with Densely Connected Residual Network for Multichannel Speech Recognition
INTERSPEECH 2018
An Attention Pooling Based Representation Learning Method for Speech Emotion Recognition
INTERSPEECH 2018
Hybrid Orthogonal Projection and Estimation (HOPE): A New Framework to Learn Neural Networks
JMLR 2016
The Fixed-Size Ordinally-Forgetting Encoding Method for Neural Network Language Models
IJCNLP 2015
The Fixed-Size Ordinally-Forgetting Encoding Method for Neural Network Language Models
ACL 2015