Papers
Domain Adaptation for Enhancing Speech-Based Depression Detection in Natural Environmental Conditions Using Dilated CNNs
Zhaocheng Huang, Julien Epps, Dale Joachim et al.
Domain Adaptation Using Class Similarity for Robust Speech Recognition
Han Zhu, Jiangjiang Zhao, Yuling Ren et al.
Domain Adversarial Neural Networks for Dysarthric Speech Recognition
Dominika Woszczyk, Stavros Petridis, David Millard
Domain Aware Training for Far-Field Small-Footprint Keyword Spotting
Haiwei Wu, Yan Jia, Yuanfei Nie et al.
Domain-Invariant Speaker Vector Projection by Model-Agnostic Meta-Learning
Jiawen Kang, Ruiqi Liu, Lantian Li et al.
Double Adversarial Network Based Monaural Speech Enhancement for Robust Speech Recognition
Zhihao Du, Jiqing Han, Xueliang Zhang
Dual-Adversarial Domain Adaptation for Generalized Replay Attack Detection
Hongji Wang, Heinrich Dinkel, Shuai Wang et al.
Dual Attention in Time and Frequency Domain for Voice Activity Detection
Joohyung Lee, Youngmoon Jung, Hoirin Kim
Dual-Path Transformer Network: Direct Context-Aware Modeling for End-to-End Monaural Speech Separation
Jingjing Chen, Qirong Mao, Dong Liu
Dual-Signal Transformation LSTM Network for Real-Time Noise Suppression
Nils L. Westhausen, Bernd T. Meyer
Dual Stage Learning Based Dynamic Time-Frequency Mask Generation for Audio Event Classification
Donghyeon Kim, Jaihyun Park, David K. Han et al.
DurIAN: Duration Informed Attention Network for Speech Synthesis
Chengzhu Yu, Heng Lu, Na Hu et al.
DurIAN-SC: Duration Informed Attention Network Based Singing Voice Conversion System
Liqiang Zhang, Chengzhu Yu, Heng Lu et al.
Dynamic Margin Softmax Loss for Speaker Verification
Dao Zhou, Longbiao Wang, Kong Aik Lee et al.
Dynamic Prosody Generation for Speech Synthesis Using Linguistics-Driven Acoustic Embedding Selection
Shubhi Tyagi, Marco Nicolis, Jonas Rohnke et al.
Dynamic Soft Windowing and Language Dependent Style Token for Code-Switching End-to-End Speech Synthesis
Ruibo Fu, Jianhua Tao, Zhengqi Wen et al.
Dynamic Speaker Representations Adjustment and Decoder Factorization for Speaker Adaptation in End-to-End Speech Synthesis
Ruibo Fu, Jianhua Tao, Zhengqi Wen et al.
Dysarthria Detection and Severity Assessment Using Rhythm-Based Metrics
Abner Hernandez, Eun Jung Yeo, Sunhee Kim et al.
Dysarthric Speech Recognition Based on Deep Metric Learning
Yuki Takashima, Ryoichi Takashima, Tetsuya Takiguchi et al.
Early Stage LM Integration Using Local and Global Log-Linear Combination
Wilfried Michel, Ralf Schlüter, Hermann Ney
ECAPA-TDNN: Emphasized Channel Attention, Propagation and Aggregation in TDNN Based Speaker Verification
Brecht Desplanques, Jenthe Thienpondt, Kris Demuynck
EEG-Based Short-Time Auditory Attention Detection Using Multi-Task Deep Learning
Zhuo Zhang, Gaoyan Zhang, Jianwu Dang et al.
Effect of Adding Positional Information on Convolutional Neural Networks for End-to-End Speech Recognition
Jinhwan Park, Wonyong Sung
Effect of Microphone Position Measurement Error on RIR and its Impact on Speech Intelligibility and Quality
Aditya Raikar, Karan Nathwani, Ashish Panda et al.