Papers
Comparing CTC and LFMMI for Out-of-Domain Adaptation of wav2vec 2.0 Acoustic Model
Apoorv Vyas, Srikanth Madikeri, Hervé Bourlard
Comparing Speech Enhancement Techniques for Voice Adaptation-Based Speech Synthesis
Nicholas Eng, C.T. Justine Hui, Yusuke Hioka et al.
Comparing Supervised Models and Learned Speech Representations for Classifying Intelligibility of Disordered Speech on Selected Phrases
Subhashini Venugopalan, Joel Shor, Manoj Plakal et al.
Comparison Between Lumped-Mass Modeling and Flow Simulation of the Reed-Type Artificial Vocal Fold
Rafia Inaam, Tsukasa Yoshinaga, Takayuki Arai et al.
Comparison of Remote Experiments Using Crowdsourcing and Laboratory Experiments on Speech Intelligibility
Ayako Yamamoto, Toshio Irino, Kenichi Arai et al.
Comparison of the Finite Element Method, the Multimodal Method and the Transmission-Line Model for the Computation of Vocal Tract Transfer Functions
Rémi Blandin, Marc Arnela, Simon Félix et al.
Compressing 1D Time-Channel Separable Convolutions Using Sparse Random Ternary Matrices
Gonçalo Mordido, Matthijs Van keirsbilck, Alexander Keller
Conditional Independence for Pretext Task Selection in Self-Supervised Speech Representation Learning
Salah Zaiem, Titouan Parcollet, Slim Essid
Confidence Intervals for ASR-Based TTS Evaluation
Jason Taylor, Korin Richmond
Configurable Privacy-Preserving Automatic Speech Recognition
Ranya Aloufi, Hamed Haddadi, David Boyle
Conformer Parrotron: A Faster and Stronger End-to-End Speech Conversion and Recognition Model for Atypical Speech
Zhehuai Chen, Bhuvana Ramabhadran, Fadi Biadsy et al.
Context and Co-Text Influence on the Accuracy Production of Italian L2 Non-Native Sounds
Sonia d'Apolito, Barbara Gili Fivela
Contextual Density Ratio for Language Model Biasing of Sequence to Sequence ASR Systems
Jesús Andrés-Ferrer, Dario Albesano, Puming Zhan et al.
Contextualized Attention-Based Knowledge Transfer for Spoken Conversational Question Answering
Chenyu You, Nuo Chen, Yuexian Zou
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion
Duc Le, Mahaveer Jain, Gil Keren et al.
Contextual Semi-Supervised Learning: An Approach to Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems
Juan Zuluaga-Gomez, Iuliia Nigmatulina, Amrutha Prasad et al.
Continual Learning for Fake Audio Detection
Haoxin Ma, Jiangyan Yi, Jianhua Tao et al.
Continuous Speech Separation Using Speaker Inventory for Long Recording
Cong Han, Yi Luo, Chenda Li et al.
Continuous Wavelet Vocoder-Based Decomposition of Parametric Speech Waveform Synthesis
Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Csaba Zainkó et al.
Contrastive Learning of Cough Descriptors for Automatic COVID-19 Preliminary Diagnosis
Swapnil Bhosale, Upasana Tiwari, Rupayan Chakraborty et al.
Controllable Context-Aware Conversational Speech Synthesis
Jian Cong, Shan Yang, Na Hu et al.
Conversion of Airborne to Bone-Conducted Speech with Deep Neural Networks
Michael Pucher, Thomas Woltron
Coreference Augmentation for Multi-Domain Task-Oriented Dialogue State Tracking
Ting Han, Chongxuan Huang, Wei Peng
Correcting Automated and Manual Speech Transcription Errors Using Warped Language Models
Mahdi Namazifar, John Malik, Li Erran Li et al.
Cough-Based COVID-19 Detection with Contextual Attention Convolutional Neural Networks and Gender Information
Adria Mallol-Ragolta, Helena Cuesta, Emilia Gómez et al.