Papers
Correlation Between Prosody and Pragmatics: Case Study of Discourse Markers in French and English
Lou Lee, Denis Jouvet, Katarina Bartkova et al.
Cortical Oscillatory Hierarchy for Natural Sentence Processing
Bin Zhao, Jianwu Dang, Gaoyan Zhang et al.
Cosine-Distance Virtual Adversarial Training for Semi-Supervised Speaker-Discriminative Acoustic Embeddings
Florian L. Kreyssig, Philip C. Woodland
Coswara — A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis
Neeraj Sharma, Prashant Krishnan, Rohit Kumar et al.
Cotatron: Transcription-Guided Speech Encoder for Any-to-Many Voice Conversion Without Parallel Data
Seung-won Park, Doo-young Kim, Myun-chul Joe
Cross Attention with Monotonic Alignment for Speech Transformer
Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung et al.
Cross-Domain Adaptation of Spoken Language Identification for Related Languages: The Curious Case of Slavic Languages
Badr M. Abdullah, Tania Avgustinova, Bernd Möbius et al.
Cross-Domain Adaptation with Discrepancy Minimization for Text-Independent Forensic Speaker Verification
Zhenyu Wang, Wei Xia, John H.L. Hansen
Cross-Lingual Speaker Verification with Domain-Balanced Hard Prototype Mining and Language-Dependent Score Normalization
Jenthe Thienpondt, Brecht Desplanques, Kris Demuynck
Cross-Lingual Text-To-Speech Synthesis via Domain Adaptation and Perceptual Similarity Regression in Speaker Space
Detai Xin, Yuki Saito, Shinnosuke Takamichi et al.
Crossmodal Sound Retrieval Based on Specific Target Co-Occurrence Denoted with Weak Labels
Masahiro Yasuda, Yasunori Ohishi, Yuma Koizumi et al.
CSL-EMG_Array: An Open Access Corpus for EMG-to-Speech Conversion
Lorenz Diener, Mehrdad Roustay Vishkasougheh, Tanja Schultz
CTC-Synchronous Training for Monotonic Attention Model
Hirofumi Inaguma, Masato Mimura, Tatsuya Kawahara
CUCHILD: A Large-Scale Cantonese Corpus of Child Speech for Phonology and Articulation Assessment
Si-Ioi Ng, Cymie Wing-Yee Ng, Jiarui Wang et al.
Cues for Perception of Gender in Synthetic Voices and the Role of Identity
Maxwell Hope, Jason Lilley
CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-Spectrogram Conversion
Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka et al.
Cyclic Spectral Modeling for Unsupervised Unit Discovery into Voice Conversion with Excitation and Waveform Modeling
Patrick Lumban Tobing, Tomoki Hayashi, Yi-Chiao Wu et al.
DARTS-ASR: Differentiable Architecture Search for Multilingual Speech Recognition and Adaptation
Yi-Chen Chen, Jui-Yang Hsu, Cheng-Kuang Lee et al.
Data Augmentation for Code-Switch Language Modeling by Fusing Multiple Text Generation Methods
Xinhui Hu, Qi Zhang, Lei Yang et al.
Data Augmentation Using Prosody and False Starts to Recognize Non-Native Children’s Speech
Hemant Kathania, Mittul Singh, Tamás Grósz et al.
Data Balancing for Boosting Performance of Low-Frequency Classes in Spoken Language Understanding
Judith Gaspers, Quynh Do, Fabian Triefenbach
Data Efficient Voice Cloning from Noisy Samples with Domain Adversarial Training
Jian Cong, Shan Yang, Lei Xie et al.
Datasets and Benchmarks for Task-Oriented Log Dialogue Ranking Task
Xinnuo Xu, Yizhe Zhang, Lars Liden et al.