Papers
Data Augmentation for End-to-end Silent Speech Recognition for Laryngectomees
Beiming Cao, Kristin Teplansky, Nordine Sebkhi et al.
Data Augmentation for Low-Resource Quechua ASR Improvement
Rodolfo Zevallos, Núria Bel, Guillermo Cámbara et al.
Data Augmentation Using McAdams-Coefficient-Based Speaker Anonymization for Fake Audio Detection
Kai Li, Sheng Li, Xugang Lu et al.
Data-augmented cross-lingual synthesis in a teacher-student framework
Marcel de Korte, Jaebok Kim, Aki Kunikoshi et al.
Dataset Pruning for Resource-constrained Spoofed Audio Detection
Abdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza
DAVIS: Driver’s Audio-Visual Speech recognition
Denis Ivanko, Dmitry Ryumin, Alexey Kashevnik et al.
DCTCN:Deep Complex Temporal Convolutional Network for Long Time Speech Enhancement
Ren Jigang, Mao Qirong
DDKtor: Automatic Diadochokinetic Speech Analysis
Yael Segal, Kasia Hitczenko, Matt Goldrick et al.
DDOS: A MOS Prediction Framework utilizing Domain Adaptive Pre-training and Distribution of Opinion Scores
Wei-Cheng Tseng, Wei-Tsung Kao, Hung-yi Lee
DDS: A new device-degraded speech dataset for speech enhancement
Haoyu Li, Junichi Yamagishi
Dealing with Unknowns in Continual Learning for End-to-end Automatic Speech Recognition
Martin Sustek, Samik Sadhu, Hynek Hermansky
Deciphering Speech: a Zero-Resource Approach to Cross-Lingual Transfer in ASR
Ondrej Klejch, Electra Wallington, Peter Bell
Decoupled Federated Learning for ASR with Non-IID Data
Han Zhu, Jindong Wang, Gaofeng Cheng et al.
Decoupled Pronunciation and Prosody Modeling in Meta-Learning-based Multilingual Speech Synthesis
Yukun Peng, Zhenhua Ling
Deep Audio Waveform Prior
Arnon Turetzky, Tzvi Michelson, Yossi Adi et al.
Deep CNN-based Inductive Transfer Learning for Sarcasm Detection in Speech
Xiyuan Gao, Shekhar Nayak, Matt Coler
DeepFry: Identifying Vocal Fry Using Deep Neural Networks
Bronya Roni Chernyak, Talia Ben Simon, Yael Segal et al.
Deep Learning Approaches for Detecting Alzheimer’s Dementia from Conversational Speech of ILSE Study
Ayimnisagul Ablimit, Karen Scholz, Tanja Schultz
Deep Learning for Prosody-Based Irony Classification in Spontaneous Speech
Helen Gent, Chase Adams, Yan Tang et al.
Deep LSTM Spoken Term Detection using Wav2Vec 2.0 Recognizer
Jan Švec, Jan Lehečka, Luboš Šmídl
Deep Neural Convolutive Matrix Factorization for Articulatory Representation Decomposition
Jiachen Lian, Alan W Black, Louis Goldstein et al.
Deep residual spiking neural network for keyword spotting in low-resource settings
Qu Yang, Qi Liu, Haizhou Li
Deep Segment Model for Acoustic Scene Classification
Yajian Wang, Jun Du, Hang Chen et al.
Deep Self-Supervised Learning of Speech Denoising from Noisy Speeches
Yutaro Sanada, Takumi Nakagawa, Yuichiro Wada et al.
Deep Sparse Conformer for Speech Recognition
Xianchao Wu