Papers
VESUS: A Crowd-Annotated Database to Study Emotion Production and Perception in Spoken English
Jacob Sager, Ravi Shankar, Jacob Reinhold et al.
Video-Driven Speech Reconstruction Using Generative Adversarial Networks
Konstantinos Vougioukas, Pingchuan Ma, Stavros Petridis et al.
Vietnamese Learners Tackling the German /ʃt/ in Perception
Anke Sennema, Silke Hamann
Visualization and Interpretation of Latent Spaces for Controlling Expressive Speech Synthesis Through Audio Analysis
Noé Tits, Fengna Wang, Kevin El Haddad et al.
ViVoLAB Speaker Diarization System for the DIHARD 2019 Challenge
Ignacio Viñals, Pablo Gimeno, Alfonso Ortega et al.
Vocal Biomarker Assessment Following Pediatric Traumatic Brain Injury: A Retrospective Cohort Study
Camille Noufi, Adam C. Lammert, Daryush D. Mehta et al.
Vocal Pitch Extraction in Polyphonic Music Using Convolutional Residual Network
Mingye Dong, Jie Wu, Jian Luan
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking
Quan Wang, Hannah Muckenhirn, Kevin Wilson et al.
VoiceID Loss: Speech Enhancement for Speaker Verification
Suwon Shon, Hao Tang, James Glass
Voice Quality and Between-Frame Entropy for Sleepiness Estimation
Vijay Ravi, Soo Jin Park, Amber Afshan et al.
Voice Quality as a Turn-Taking Cue
Mattias Heldner, Marcin Włodarczak, Štefan Beňuš et al.
Vowels and Diphthongs in the Xupu Xiang Chinese Dialect
Zhenrui Zhang, Fang Hu
Vowel-Tone Interaction in Two Tibeto-Burman Languages
Wendy Lalhminghlui, Viyazonuo Terhiija, Priyankoo Sarmah
VQVAE Unsupervised Unit Discovery and Multi-Scale Code2Spec Inverter for Zerospeech Challenge 2019
Andros Tjandra, Berrak Sisman, Mingyang Zhang et al.
V-to-V Coarticulation Induced Acoustic and Articulatory Variability of Vowels: The Effect of Pitch-Accent
Andrea Deme, Márton Bartók, Tekla Etelka Gráczi et al.
wav2vec: Unsupervised Pre-Training for Speech Recognition
Steffen Schneider, Alexei Baevski, Ronan Collobert et al.
Weakly Supervised Syllable Segmentation by Vowel-Consonant Peak Classification
Ravi Shankar, Archana Venkataraman
Web-Based Speech Synthesis Editor
Martin Grůber, Jakub Vít, Jindřich Matoušek
WHAM!: Extending Speech Separation to Noisy Environments
Gordon Wichern, Joe Antognini, Michael Flynn et al.
Whether to Pretrain DNN or not?: An Empirical Analysis for Voice Conversion
Nirmesh J. Shah, Hardik B. Sailor, Hemant A. Patil
Which Ones Are Speaking? Speaker-Inferred Model for Multi-Talker Speech Separation
Jing Shi, Jiaming Xu, Bo Xu
Whisper to Neutral Mapping Using Cosine Similarity Maximization in i-Vector Space for Speaker Verification
Abinay Reddy Naini, Achuth Rao M.V., Prasanta Kumar Ghosh
Who Needs Words? Lexicon-Free Speech Recognition
Tatiana Likhomanenko, Gabriel Synnaeve, Ronan Collobert
Who Said That?: Audio-Visual Speaker Diarisation of Real-World Meetings
Joon Son Chung, Bong-Jin Lee, Icksang Han
x-Vector DNN Refinement with Full-Length Recordings for Speaker Recognition
Daniel Garcia-Romero, David Snyder, Gregory Sell et al.