Li-Rong Dai

35 papers · 2016–2023 · 1 conference · across top CS/AI conferences

Achievements

+12 more ↓

🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (20) 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird

🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (20) 🏠 Conference Loyalist (35) 🔬 Deep Specialist (13) 🏆 Keyword Champion (3) 🤝 Dynamic Duo (11) 💎 Century Club (35) 📈 Trend Setter 🚀 Conference Pioneer 🔥 Unstoppable (8) ⚡ Prolific Year (7) 🗃️ Keyword Collector (66)

Conferences

INTERSPEECH (35)

Top co-authors

Yan Song (11) Ian McLoughlin (11) Zhen-Hua Ling (10) Lin Liu (7) Jun Du (6) Shiliang Zhang (5) Jie Zhang (5) Wu Guo (4) Chin-Hui Lee (3) Yiheng Jiang (3)

Keywords

speaker verification (8) deep neural network (6) long short-term memory (4) speaker embedding (4) embedding learning (4) speech synthesis (4) semi-supervised learning (3) recurrent neural network (3) convolutional neural network (3) attention mechanism (3) sound event detection (3) voice conversion (3) neural network (3) audio classification (2) signal-to-noise ratio (2) acoustic model (2) domain adaptation (2) transfer learning (2) automatic speech recognition (2) language identification (2)

Papers

Robust Prototype Learning for Anomalous Sound Detection INTERSPEECH 2023 Real-Time Causal Spectro-Temporal Voice Activity Detection Based on Convolutional Encoding and Residual Decoding INTERSPEECH 2023 Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction INTERSPEECH 2023 CASA-ASR: Context-Aware Speaker-Attributed ASR INTERSPEECH 2023 Fine-tuning Audio Spectrogram Transformer with Task-aware Adapters for Sound Event Detection INTERSPEECH 2023 Class-Aware Distribution Alignment based Unsupervised Domain Adaptation for Speaker Verification INTERSPEECH 2022 Differential Time-frequency Log-mel Spectrogram Features for Vision Transformer Based Infant Cry Recognition INTERSPEECH 2022 UnitNet-Based Hybrid Speech Synthesis INTERSPEECH 2021 Automatic Lip-Reading with Hierarchical Pyramidal Convolution and Self-Attention for Image Sequences with No Word Boundaries INTERSPEECH 2021 A Weight Moving Average Based Alternate Decoupled Learning Algorithm for Long-Tailed Language Identification INTERSPEECH 2021 An Improved Wav2Vec 2.0 Pre-Training Approach Using Enhanced Local Dependency Modeling for Speech Recognition INTERSPEECH 2021 An Effective Mutual Mean Teaching Based Domain Adaptation Method for Sound Event Detection INTERSPEECH 2021 An Effective Perturbation Based Semi-Supervised Learning Method for Sound Event Detection INTERSPEECH 2020 Semi-Supervised End-to-End ASR via Teacher-Student Learning with Conditional Posterior Distribution INTERSPEECH 2020 Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning INTERSPEECH 2020 An Effective Speaker Recognition Method Based on Joint Identification and Verification Supervisions INTERSPEECH 2020 An Effective Deep Embedding Learning Architecture for Speaker Verification INTERSPEECH 2019 Improving Aggregation and Loss Function for Better Embedding Learning in End-to-End Speaker Verification System INTERSPEECH 2019 Multi-Task Learning with High-Order Statistics for x-Vector Based Text-Independent Speaker Verification INTERSPEECH 2019 Deep Neural Network Embeddings with Gating Mechanisms for Text-Independent Speaker Verification INTERSPEECH 2019 A Chinese Dataset for Identifying Speakers in Novels INTERSPEECH 2019 Singing Voice Synthesis Using Deep Autoregressive Neural Networks for Acoustic Modeling INTERSPEECH 2019 Neural Text Clustering with Document-Level Attention Based on Dynamic Soft Labels INTERSPEECH 2019 WaveNet Vocoder with Limited Training Data for Voice Conversion INTERSPEECH 2018 Learning and Modeling Unit Embeddings for Improving HMM-based Unit Selection Speech Synthesis INTERSPEECH 2018 Gaussian Prediction Based Attention for Online End-to-End Speech Recognition INTERSPEECH 2017 End-to-End Language Identification Using High-Order Utterance Representation with Bilinear Pooling INTERSPEECH 2017 A Maximum Likelihood Approach to Deep Neural Network Based Nonlinear Spectral Mapping for Single-Channel Speech Separation INTERSPEECH 2017 The USTC System for Voice Conversion Challenge 2016: Neural Network Based Approaches for Spectrum, Aperiodicity and F0Conversion INTERSPEECH 2016 Speech Bandwidth Extension Using Bottleneck Features and Deep Recurrent Neural Networks INTERSPEECH 2016 Articulatory-to-Acoustic Conversion with Cascaded Prediction of Spectral and Excitation Features Using Neural Networks INTERSPEECH 2016 SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement INTERSPEECH 2016 Future Context Attention for Unidirectional LSTM Based Acoustic Model INTERSPEECH 2016 Compact Feedforward Sequential Memory Networks for Large Vocabulary Continuous Speech Recognition INTERSPEECH 2016 RNN-BLSTM Based Multi-Pitch Estimation INTERSPEECH 2016