Papers
Conformer: Convolution-augmented Transformer for Speech Recognition
Anmol Gulati, James Qin, Chung-Cheng Chiu et al.
Constrained Ratio Mask for Speech Enhancement Using DNN
Hongjiang Yu, Wei-Ping Zhu, Yuhong Yang
Context-Aware Goodness of Pronunciation for Computer-Assisted Pronunciation Training
Jiatong Shi, Nan Huo, Qin Jin
Context-Dependent Acoustic Modeling Without Explicit Phone Clustering
Tina Raissi, Eugen Beck, Ralf Schlüter et al.
Context-Dependent Domain Adversarial Neural Network for Multimodal Emotion Recognition
Zheng Lian, Jianhua Tao, Bin Liu et al.
Context Dependent RNNLM for Automatic Transcription of Conversations
Srikanth Raj Chetupalli, Sriram Ganapathy
ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context
Wei Han, Zhengdong Zhang, Yu Zhang et al.
Contextualized Translation of Automatically Segmented Speech
Marco Gaido, Mattia A. Di Gangi, Matteo Negri et al.
Contextualizing ASR Lattice Rescoring with Hybrid Pointer Network Language Model
Da-Rong Liu, Chunxi Liu, Frank Zhang et al.
Contextual RNN-T for Open Domain ASR
Mahaveer Jain, Gil Keren, Jay Mahadeokar et al.
Continual Learning for Multi-Dialect Acoustic Models
Brady Houston, Katrin Kirchhoff
Continual Learning in Automatic Speech Recognition
Samik Sadhu, Hynek Hermansky
Contrastive Predictive Coding of Audio with an Adversary
Luyu Wang, Kazuya Kawakami, Aaron van den Oord
Contribution of RMS-Level-Based Speech Segments to Target Speech Decoding Under Noisy Conditions
Lei Wang, Ed X. Wu, Fei Chen
Controllable Neural Prosody Synthesis
Max Morrison, Zeyu Jin, Justin Salamon et al.
Controllable Neural Text-to-Speech Synthesis Using Intuitive Prosodic Features
Tuomo Raitio, Ramya Rasipuram, Dan Castellani
Controlling the Strength of Emotions in Speech-Like Emotional Sound Generated by WaveNet
Kento Matsumoto, Sunao Hara, Masanobu Abe
Conversational Emotion Recognition Using Self-Attention Mechanisms and Graph Neural Networks
Zheng Lian, Jianhua Tao, Bin Liu et al.
Converting Anyone’s Emotion: Towards Speaker-Independent Emotional Voice Conversion
Kun Zhou, Berrak Sisman, Mingyang Zhang et al.
Conv-TasSAN: Separative Adversarial Network Based on Conv-TasNet
Chengyun Deng, Yi Zhang, Shiqian Ma et al.
Conv-Transformer Transducer: Low Latency, Low Frame Rate, Streamable End-to-End Speech Recognition
Wenyong Huang, Wenchao Hu, Yu Ting Yeung et al.
CopyCat: Many-to-Many Fine-Grained Prosody Transfer for Neural Text-to-Speech
Sri Karlapati, Alexis Moinet, Arnaud Joly et al.
Correlating Cepstra with Formant Frequencies: Implications for Phonetically-Informed Forensic Voice Comparison
Vincent Hughes, Frantz Clermont, Philip Harrison