Papers
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information
Rongzhi Gu, Lianwu Chen, Shi-Xiong Zhang et al.
Neural Text Clustering with Document-Level Attention Based on Dynamic Soft Labels
Zhi Chen, Wu Guo, Li-Rong Dai et al.
Neural Transfer Learning for Cry-Based Diagnosis of Perinatal Asphyxia
Charles C. Onu, Jonathan Lebensold, William L. Hamilton et al.
Neural Transition Systems for Modeling Hierarchical Semantic Representations
Riyaz Bhat, John Chen, Rashmi Prasad et al.
Neural Whispered Speech Detection with Imbalanced Learning
Takanori Ashihara, Yusuke Shinohara, Hiroshi Sato et al.
NIESR: Nuisance Invariant End-to-End Speech Recognition
I-Hung Hsu, Ayush Jaiswal, Premkumar Natarajan
NITK Kids’ Speech Corpus
Pravin Bhaskar Ramteke, Sujata Supanekar, Pradyoth Hegde et al.
No Distributional Learning in Adults from Attended Listening to Non-Speech
Ellen Marklund, Johan Sjons, Lisa Gustavsson et al.
Noise Adaptive Speech Enhancement Using Domain Adversarial Training
Chien-Feng Liao, Yu Tsao, Hung-Yi Lee et al.
Noisy BiLSTM-Based Models for Disfluency Detection
Nguyen Bach, Fei Huang
Nonparallel Emotional Speech Conversion
Jian Gao, Deep Chakraborty, Hamidou Tembine et al.
Non-Parallel Voice Conversion Using Weighted Generative Adversarial Networks
Dipjyoti Paul, Yannis Pantazis, Yannis Stylianou
Non-Parallel Voice Conversion with Cyclic Variational Autoencoder
Patrick Lumban Tobing, Yi-Chiao Wu, Tomoki Hayashi et al.
NUS Speak-to-Sing: A Web Platform for Personalized Speech-to-Singing Conversion
Chitralekha Gupta, Karthika Vijayan, Bidisha Sharma et al.
Objective Assessment of Social Skills Using Automated Language Analysis for Identification of Schizophrenia and Bipolar Disorder
Rohit Voleti, Stephanie Woolridge, Julie M. Liss et al.
Off the Cuff: Exploring Extemporaneous Speech Delivery with TTS
Éva Székely, Gustav Eje Henter, Jonas Beskow et al.
One-Pass Single-Channel Noisy Speech Recognition Using a Combination of Noisy and Enhanced Features
Masakiyo Fujimoto, Hisashi Kawai
One-Shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization
Ju-chieh Chou, Hung-Yi Lee
One-Shot Voice Conversion with Disentangled Representations by Leveraging Phonetic Posteriorgrams
Seyed Hamidreza Mohammadi, Taehwan Kim
One-Shot Voice Conversion with Global Speaker Embeddings
Hui Lu, Zhiyong Wu, Dongyang Dai et al.
One-vs-All Models for Asynchronous Training: An Empirical Analysis
Rahul Gupta, Aman Alok, Shankar Ananthakrishnan
On Learning Interpretable CNNs with Parametric Modulated Kernel-Based Filters
Erfan Loweimi, Peter Bell, Steve Renals
Online Hybrid CTC/Attention Architecture for End-to-End Speech Recognition
Haoran Miao, Gaofeng Cheng, Pengyuan Zhang et al.
Online Speech Processing and Analysis Suite
Wikus Pienaar, Daan Wissing