Papers
Coughing-Based Recognition of Covid-19 with Spatial Attentive ConvLSTM Recurrent Neural Networks
Tianhao Yan, Hao Meng, Emilia Parada-Cabaleiro et al.
COVID-19 Detection from Spectral Features on the DiCOVA Dataset
Kotra Venkata Sai Ritwik, Shareef Babu Kalluri, Deepu Vijayasenan
CoVoST 2 and Massively Multilingual Speech Translation
Changhan Wang, Anne Wu, Jiatao Gu et al.
Cramér-Rao Lower Bound for DOA Estimation with an Array of Directional Microphones in Reverberant Environments
Weiguang Chen, Cheng Xue, Xionghu Zhong
Cross-Database Replay Detection in Terminal-Dependent Speaker Verification
Xingliang Cheng, Mingxing Xu, Thomas Fang Zheng
Cross-Domain Speech Recognition with Unsupervised Character-Level Distribution Matching
Wenxin Hou, Jindong Wang, Xu Tan et al.
Crossfire Conditional Generative Adversarial Networks for Singing Voice Extraction
Weitao Yuan, Shengbei Wang, Xiangrui Li et al.
Cross-Lingual Low Resource Speaker Adaptation Using Phonological Features
Georgia Maniati, Nikolaos Ellinas, Konstantinos Markopoulos et al.
Cross-Lingual Speaker Adaptation Using Domain Adaptation and Speaker Consistency Loss for Text-To-Speech Synthesis
Detai Xin, Yuki Saito, Shinnosuke Takamichi et al.
Cross-Lingual Voice Conversion with a Cycle Consistency Loss on Linguistic Representation
Yi Zhou, Xiaohai Tian, Zhizheng Wu et al.
Cross-Lingual Voice Conversion with Disentangled Universal Linguistic Representations
Zhenchuan Yang, Weibin Zhang, Yufei Liu et al.
Cross-Linguistic Perception of the Japanese Singleton/Geminate Contrast: Korean, Mandarin and Mongolian Compared
Kimiko Tsukada, Yurong, Joo-Yeon Kim et al.
Cross-Modal Knowledge Distillation Method for Automatic Cued Speech Recognition
Jianrong Wang, Ziyue Tang, Xuewei Li et al.
Cross-Modal Learning for Audio-Visual Video Parsing
Jatin Lamba, Abhishek, Jayaprakash Akula et al.
Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition
Tomohiro Tanaka, Ryo Masumura, Mana Ihori et al.
Cross-Speaker Style Transfer with Prosody Bottleneck in Neural Speech Synthesis
Shifeng Pan, Lei He
Ctrl-P: Temporal Control of Prosodic Variation for Speech Synthesis
Devang S. Ram Mohan, Vivian Hu, Tian Huey Teh et al.
Cue Interaction in the Perception of Prosodic Prominence: The Role of Voice Quality
Bogdan Ludusan, Petra Wagner, Marcin Włodarczak
CVC: Contrastive Learning for Non-Parallel Voice Conversion
Tingle Li, Yichen Liu, Chenxu Hu et al.
Data Augmentation for Spoken Language Understanding via Pretrained Language Models
Baolin Peng, Chenguang Zhu, Michael Zeng et al.
Data Augmentation Methods for End-to-End Speech Recognition on Distant-Talk Scenarios
Emiru Tsunoo, Kentaro Shibata, Chaitanya Narisetty et al.
Data Quality as Predictor of Voice Anti-Spoofing Generalization
Bhusan Chettri, Rosa González Hautamäki, Md. Sahidullah et al.
DBNet: A Dual-Branch Network Architecture Processing on Spectrum and Waveform for Single-Channel Speech Enhancement
Kanghao Zhang, Shulin He, Hao Li et al.
DCCRN+: Channel-Wise Subband DCCRN with SNR Estimation for Speech Enhancement
Shubo Lv, Yanxin Hu, Shimin Zhang et al.