Papers
Nonlinear ISA with Auxiliary Variables for Learning Speech Representations
Amrith Setlur, Barnabás Póczos, Alan W. Black
Nonlinear Residual Echo Suppression Based on Multi-Stream Conv-TasNet
Hongsheng Chen, Teng Xiang, Kai Chen et al.
Nonlinear Residual Echo Suppression Using a Recurrent Neural Network
Lukas Pfeifenberger, Franz Pernkopf
Non-Native Children’s Automatic Speech Recognition: The INTERSPEECH 2020 Shared Task ALTA Systems
Kate M. Knill, Linlin Wang, Yu Wang et al.
Nonparallel Emotional Speech Conversion Using VAE-GAN
Yuexin Cao, Zhengchen Liu, Minchuan Chen et al.
Non-Parallel Emotion Conversion Using a Deep-Generative Hybrid Network and an Adversarial Pair Discriminator
Ravi Shankar, Jacob Sager, Archana Venkataraman
Non-Parallel Many-to-Many Voice Conversion with PSR-StarGAN
Yanping Li, Dongxiang Xu, Yan Zhang et al.
Nonparallel Training of Exemplar-Based Voice Conversion System Using INCA-Based Alignment Technique
Hitoshi Suda, Gaku Kotani, Daisuke Saito
Non-Parallel Voice Conversion with Fewer Labeled Data by Conditional Generative Adversarial Networks
Minchuan Chen, Weijian Hou, Jun Ma et al.
Now You’re Speaking My Language: Visual Language Identification
Triantafyllos Afouras, Joon Son Chung, Andrew Zisserman
NPU Speaker Verification System for INTERSPEECH 2020 Far-Field Speaker Verification Challenge
Li Zhang, Jian Wu, Lei Xie
One Model, Many Languages: Meta-Learning for Multilingual Text-to-Speech
Tomáš Nekvinda, Ondřej Dušek
On Front-End Gain Invariant Modeling for Wake Word Spotting
Yixin Gao, Noah D. Stein, Chieh-Chi Kao et al.
Ongoing Phonologization of Word-Final Voicing Alternations in Two Romance Languages: Romanian and French
Mathilde Hutin, Adèle Jatteau, Ioana Vasilescu et al.
On Improving Code Mixed Speech Synthesis with Mixlingual Grapheme-to-Phoneme Model
Shubham Bansal, Arijit Mukherjee, Sandeepkumar Satpal et al.
Online Blind Reverberation Time Estimation Using CRNNs
Shuwen Deng, Wolfgang Mack, Emanuël A.P. Habets
Online Directional Speech Enhancement Using Geometrically Constrained Independent Vector Analysis
Li Li, Kazuhito Koishida, Shoji Makino
Online Monaural Speech Enhancement Using Delayed Subband LSTM
Xiaofei Li, Radu Horaud
On Loss Functions and Recurrency Training for GAN-Based Speech Enhancement Systems
Zhuohuang Zhang, Chengyun Deng, Yi Shen et al.
On Parameter Adaptation in Softmax-Based Cross-Entropy Loss for Improved Convergence Speed and Accuracy in DNN-Based Speaker Recognition
Magdalena Rybicka, Konrad Kowalczyk
On Semi-Supervised LF-MMI Training of Acoustic Models with Limited Data
Imran Sheikh, Emmanuel Vincent, Irina Illina
On Synthesis for Supervised Monaural Speech Separation in Time Domain
Jingjing Chen, Qirong Mao, Dong Liu
On the Comparison of Popular End-to-End Models for Large Scale Speech Recognition
Jinyu Li, Yu Wu, Yashesh Gaur et al.
On the Robustness and Training Dynamics of Raw Waveform Models
Erfan Loweimi, Peter Bell, Steve Renals
On the Usage of Multi-Feature Integration for Speaker Verification and Language Identification
Zheng Li, Miao Zhao, Jing Li et al.