Papers
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung, Arsha Nagrani, Andrew Zisserman
Waveform-Based Speaker Representations for Speech Synthesis
Moquan Wan, Gilles Degottex, Mark J.F. Gales
Wavelet Analysis of Speaker Dependent and Independent Prosody for Voice Conversion
Berrak Sisman, Haizhou Li
Wavelet Transform Based Mel-scaled Features for Acoustic Scene Classification
Shefali Waldekar, Goutam Saha
WaveNet Vocoder with Limited Training Data for Voice Conversion
Li-Juan Liu, Zhen-Hua Ling, Yuan Jiang et al.
Weighting of Coda Voicing Cues: Glottalisation and Vowel Duration
Joshua Penney, Felicity Cox, Anita Szakay
Weighting Pitch Contour and Loudness Contour in Mandarin Tone Perception in Cochlear Implant Listeners
Qinglin Meng, Nengheng Zheng, Ambika Prasad Mishra et al.
Weighting Time-Frequency Representation of Speech Using Auditory Saliency for Automatic Speech Recognition
Cong-Thanh Do, Yannis Stylianou
What Do Classifiers Actually Learn? a Case Study on Emotion Recognition Datasets
Patrick Meyer, Eric Buschermöhle, Tim Fingscheidt
What to Expect from Expected Kneser-Ney Smoothing
Michael Levit, Sarangarajan Parthasarathy, Shuangyu Chang
Whispered Speech to Neutral Speech Conversion Using Bidirectional LSTMs
G. Nisha Meenakshi, Prasanta Kumar Ghosh
Whistle-blowing ASRs: Evaluating the Need for More Inclusive Speech Recognition Systems
Meredith Moore, Hemanth Venkateswara, Sethuraman Panchanathan
Who Are You Listening to? Towards a Dynamic Measure of Auditory Attention to Speech-on-speech.
Moïra-Phoebé Huet, Christophe Micheyl, Etienne Gaudrain et al.
Who Said That? a Comparative Study of Non-negative Matrix Factorization Techniques
Teun Krikke, Frank Broz, David Lane
Wide Learning for Auditory Comprehension
Elnaz Shafaei-Bajestan, R. Harald Baayen
Word Emphasis Prediction for Expressive Text to Speech
Yosi Mass, Slava Shechtman, Moran Mordechay et al.
Wuxi Speakers’ Production and Perception of Coda Nasals in Mandarin
Lei Wang, Jie Cui, Ying Chen
ZCU-NTIS Speaker Diarization System for the DIHARD 2018 Challenge
Zbyněk Zajíc, Marie Kunešová, Jan Zelinka et al.
2016 BUT Babel System: Multilingual BLSTM Acoustic Model with i-Vector Based Adaptation
Martin Karafiát, Murali Karthick Baskar, Pavel Matějka et al.
A Batch Noise Contrastive Estimation Approach for Training Large Vocabulary Language Models
Youssef Oualil, Dietrich Klakow
Accurate Synchronization of Speech and EGG Signal Using Phase Information
Sunil Kumar S.B., K. Sreenivasa Rao, Tanumay Mandal
A Comparative Evaluation of GMM-Free State Tying Methods for ASR
Tamás Grósz, Gábor Gosztolya, László Tóth
A Comparison of Danish Listeners’ Processing Cost in Judging the Truth Value of Norwegian, Swedish, and English Sentences
Ocke-Schwen Bohn, Trine Askjær-Jørgensen
A Comparison of Perceptually Motivated Loss Functions for Binary Mask Estimation in Speech Separation
Danny Websdale, Ben Milner