Li-Rong Dai
35 papers · 2016–2023 · 1 conference · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (20) π Renaissance Researcher (5) π Interdisciplinary Bridge π£ Hot Topic Early Bird
π§
Keyword Pioneer
πΊοΈ
Taxonomy Completionist
(20)
π
Conference Loyalist
(35)
π¬
Deep Specialist
(13)
π
Keyword Champion
(3)
π€
Dynamic Duo
(11)
π
Century Club
(35)
π
Trend Setter
π
Conference Pioneer
π₯
Unstoppable
(8)
β‘
Prolific Year
(7)
ποΈ
Keyword Collector
(66)
Conferences
INTERSPEECH (35)
Top co-authors
Keywords
speaker verification
(8)
deep neural network
(6)
long short-term memory
(4)
speaker embedding
(4)
embedding learning
(4)
speech synthesis
(4)
semi-supervised learning
(3)
recurrent neural network
(3)
convolutional neural network
(3)
attention mechanism
(3)
sound event detection
(3)
voice conversion
(3)
neural network
(3)
audio classification
(2)
signal-to-noise ratio
(2)
acoustic model
(2)
domain adaptation
(2)
transfer learning
(2)
automatic speech recognition
(2)
language identification
(2)
Papers
Robust Prototype Learning for Anomalous Sound Detection
INTERSPEECH 2023
Real-Time Causal Spectro-Temporal Voice Activity Detection Based on Convolutional Encoding and Residual Decoding
INTERSPEECH 2023
Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction
INTERSPEECH 2023
CASA-ASR: Context-Aware Speaker-Attributed ASR
INTERSPEECH 2023
Fine-tuning Audio Spectrogram Transformer with Task-aware Adapters for Sound Event Detection
INTERSPEECH 2023
Class-Aware Distribution Alignment based Unsupervised Domain Adaptation for Speaker Verification
INTERSPEECH 2022
Differential Time-frequency Log-mel Spectrogram Features for Vision Transformer Based Infant Cry Recognition
INTERSPEECH 2022
UnitNet-Based Hybrid Speech Synthesis
INTERSPEECH 2021
Automatic Lip-Reading with Hierarchical Pyramidal Convolution and Self-Attention for Image Sequences with No Word Boundaries
INTERSPEECH 2021
A Weight Moving Average Based Alternate Decoupled Learning Algorithm for Long-Tailed Language Identification
INTERSPEECH 2021
An Improved Wav2Vec 2.0 Pre-Training Approach Using Enhanced Local Dependency Modeling for Speech Recognition
INTERSPEECH 2021
An Effective Mutual Mean Teaching Based Domain Adaptation Method for Sound Event Detection
INTERSPEECH 2021
An Effective Perturbation Based Semi-Supervised Learning Method for Sound Event Detection
INTERSPEECH 2020
Semi-Supervised End-to-End ASR via Teacher-Student Learning with Conditional Posterior Distribution
INTERSPEECH 2020
Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning
INTERSPEECH 2020
An Effective Speaker Recognition Method Based on Joint Identification and Verification Supervisions
INTERSPEECH 2020
An Effective Deep Embedding Learning Architecture for Speaker Verification
INTERSPEECH 2019
Improving Aggregation and Loss Function for Better Embedding Learning in End-to-End Speaker Verification System
INTERSPEECH 2019
Multi-Task Learning with High-Order Statistics for x-Vector Based Text-Independent Speaker Verification
INTERSPEECH 2019
Deep Neural Network Embeddings with Gating Mechanisms for Text-Independent Speaker Verification
INTERSPEECH 2019
A Chinese Dataset for Identifying Speakers in Novels
INTERSPEECH 2019
Singing Voice Synthesis Using Deep Autoregressive Neural Networks for Acoustic Modeling
INTERSPEECH 2019
Neural Text Clustering with Document-Level Attention Based on Dynamic Soft Labels
INTERSPEECH 2019
WaveNet Vocoder with Limited Training Data for Voice Conversion
INTERSPEECH 2018
Learning and Modeling Unit Embeddings for Improving HMM-based Unit Selection Speech Synthesis
INTERSPEECH 2018
Gaussian Prediction Based Attention for Online End-to-End Speech Recognition
INTERSPEECH 2017
End-to-End Language Identification Using High-Order Utterance Representation with Bilinear Pooling
INTERSPEECH 2017
A Maximum Likelihood Approach to Deep Neural Network Based Nonlinear Spectral Mapping for Single-Channel Speech Separation
INTERSPEECH 2017
The USTC System for Voice Conversion Challenge 2016: Neural Network Based Approaches for Spectrum, Aperiodicity and F0Conversion
INTERSPEECH 2016
Speech Bandwidth Extension Using Bottleneck Features and Deep Recurrent Neural Networks
INTERSPEECH 2016
Articulatory-to-Acoustic Conversion with Cascaded Prediction of Spectral and Excitation Features Using Neural Networks
INTERSPEECH 2016
SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement
INTERSPEECH 2016
Future Context Attention for Unidirectional LSTM Based Acoustic Model
INTERSPEECH 2016
Compact Feedforward Sequential Memory Networks for Large Vocabulary Continuous Speech Recognition
INTERSPEECH 2016
RNN-BLSTM Based Multi-Pitch Estimation
INTERSPEECH 2016