Papers
8,761 papers found
Multitask Sequence-to-Sequence Models for Grapheme-to-Phoneme Conversion
Benjamin Milde, Christoph Schmidt, Joachim Köhler
Multiview Representation Learning via Deep CCA for Silent Speech Recognition
Myungjong Kim, Beiming Cao, Ted Mau et al.
Musical Speech: A New Methodology for Transcribing Speech Prosody
Alexsandro R. Meireles, Antônio R.M. Simões, Antonio Celso Ribeiro et al.
Music Tempo Estimation Using Sub-Band Synchrony
Shreyan Chowdhury, Tanaya Guha, Rajesh M. Hegde
Mylly — The Mill: A New Platform for Processing Speech and Text Corpora Easily and Efficiently
Mietta Lennes, Jussi Piitulainen, Martin Matthiesen
Nativization of Foreign Names in TTS for Automatic Reading of World News in Swahili
Joseph Mendelson, Pilar Oplustil, Oliver Watts et al.
Nature of Contrast and Coarticulation: Evidence from Mizo Tones and Assamese Vowel Harmony
Indranil Dutta, Irfan S., Pamir Gogoi et al.
Neural Network-Based Spectrum Estimation for Online WPE Dereverberation
Keisuke Kinoshita, Marc Delcroix, Haeyong Kwon et al.
Neural Speech Recognizer: Acoustic-to-Word LSTM Model for Large Vocabulary Speech Recognition
Hagen Soltau, Hank Liao, Haşim Sak
NMT-Based Segmentation and Punctuation Insertion for Real-Time Spoken Language Translation
Eunah Cho, Jan Niehues, Alex Waibel
Node Pruning Based on Entropy of Weights and Node Activity for Small-Footprint Acoustic Model Based on Deep Neural Networks
Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani
Non-Local Estimation of Speech Signal for Vowel Onset Point Detection in Varied Environments
Avinash Kumar, S. Shahnawazuddin, Gayadhar Pradhan
Nonparametrically Trained Probabilistic Linear Discriminant Analysis for i-Vector Speaker Verification
Abbas Khosravani, Mohammad Mehdi Homayounpour
Non-Uniform MCE Training of Deep Long Short-Term Memory Recurrent Neural Networks for Keyword Spotting
Zhong Meng, Biing-Hwang Juang
Nora the Empathetic Psychologist
Genta Indra Winata, Onno Kampman, Yang Yang et al.
Novel Shifted Real Spectrum for Exact Signal Reconstruction
Meet H. Soni, Rishabh Tak, Hemant A. Patil
Novel Variable Length Teager Energy Separation Based Instantaneous Frequency Features for Replay Detection
Hemant A. Patil, Madhu R. Kamble, Tanvina B. Patel et al.
NTCD-TIMIT: A New Database and Baseline for Noise-Robust Audio-Visual Speech Recognition
Ahmed Hussen Abdelaziz
Nuance - Politecnico di Torino’s 2016 NIST Speaker Recognition Evaluation System
Daniele Colibro, Claudio Vair, Emanuele Dalmasso et al.
Null-Hypothesis LLR: A Proposal for Forensic Automatic Speaker Recognition
Yosef A. Solewicz, Michael Jessen, David van der Vloed
Objective Severity Assessment from Disordered Voice Using Estimated Glottal Airflow
Yu-Ren Chien, Michal Borský, Jón Guðnason
Occupancy Detection in Commercial and Residential Environments Using Audio Signal
Shabnam Ghaffarzadegan, Attila Reiss, Mirko Ruhs et al.
Off-Topic Spoken Response Detection Using Siamese Convolutional Neural Networks
Chong Min Lee, Su-Youn Yoon, Xihao Wang et al.
Off-Topic Spoken Response Detection with Word Embeddings
Su-Youn Yoon, Chong Min Lee, Ikkyu Choi et al.
On Building Mixed Lingual Speech Synthesis Systems
SaiKrishna Rallabandi, Alan W. Black