Papers
Neural Vocoder is All You Need for Speech Super-resolution
Haohe Liu, Woosung Choi, Xubo Liu et al.
Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM
Hayato Futami, Hirofumi Inaguma, Sei Ueno et al.
Non-contrastive self-supervised learning of utterance-level speech representations
Jaejin Cho, Raghavendra Pappagari, Piotr Żelasko et al.
Non-intrusive Speech Intelligibility Metric Prediction for Hearing Impaired Individuals
George Close, Samuel Hollands, Stefan Goetze et al.
Non-intrusive Speech Quality Assessment with a Multi-Task Learning based Subband Adaptive Attention Temporal Convolutional Neural Network
Xiaofeng Shu, Yanjie Chen, Chuxiang Shang et al.
Non-Linear Pairwise Language Mappings for Low-Resource Multilingual Acoustic Model Fusion
Muhammad Umar Farooq, Darshan Adiga Haniya Narayana, Thomas Hain
Non-Parallel Voice Conversion for ASR Augmentation
Gary Wang, Andrew Rosenberg, Bhuvana Ramabhadran et al.
Nonwords Pronunciation Classification in Language Development Tests for Preschool Children
Ilja Baumann, Dominik Wagner, Sebastian Bayerl et al.
Normalization of code-switched text for speech synthesis
Sreeram Manghat, Sreeja Manghat, Tanja Schultz
Norm-constrained Score-level Ensemble for Spoofing Aware Speaker Verification
Peng Zhang, Peng Hu, Xueliang Zhang
Novel Augmentation Schemes for Device Robust Acoustic Scene Classification
Sukanya Sonowal, Anish Tamse
NRI-FGSM: An Efficient Transferable Adversarial Attack for Speaker Recognition Systems
Hao Tan, Junjian Zhang, Huan Zhang et al.
NTF of Spectral and Spatial Features for Tracking and Separation of Moving Sound Sources in Spherical Harmonic Domain
Mateusz Guzik, Konrad Kowalczyk
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates
Seungu Han, Junhyeok Lee
Objective Metrics to Evaluate Residual-Echo Suppression During Double-Talk in the Stereophonic Case
Amir Ivry, Israel Cohen, Baruch Berdugo
OCTRA – An Innovative Approach to Orthographic Transcription
Christoph Draxler, Julian Pomp
Oktoechos Classification in Liturgical Music Using SBU-LSTM/GRU
Rajeev Rajan, Ananya Ayasi
On Adaptive Weight Interpolation of the Hybrid Autoregressive Transducer
Ehsan Variani, Michael Riley, David Rybach et al.
On Breathing Pattern Information in Synthetic Speech
Zohreh Mostaani, Mathew Magimai Doss
On Combining Global and Localized Self-Supervised Models of Speech
Sri Harsha Dumpala, Chandramouli Shama Sastry, Rudolf Uher et al.
On-demand compute reduction with stochastic wav2vec 2.0
Apoorv Vyas, Wei-Ning Hsu, Michael Auli et al.
One-Shot Speaker Adaptation Based on Initialization by Generative Adversarial Networks for TTS
Jaeuk Lee, Joon-Hyuk Chang
One-step models in pitch perception: Experimental evidence from Japanese
Takeshi Kishiyama, Chuyu Huang, Yuki Hirose
On joint training with interfaces for spoken language understanding
Anirudh Raju, Milind Rao, Gautam Tiwari et al.