Papers
Using Text and Acoustic Features in Predicting Glottal Excitation Waveforms for Parametric Speech Synthesis with Recurrent Neural Networks
Lauri Juvela, Xin Wang, Shinji Takaki et al.
Using Zero-Frequency Resonator to Extract Multilingual Intonation Structure
Jinfu Ni, Yoshinori Shiga, Hisashi Kawai
Utterance Verification for Text-Dependent Speaker Recognition: A Comparative Assessment Using the RedDots Corpus
Tomi Kinnunen, Md. Sahidullah, Ivan Kukanov et al.
Variation in Spoken North Sami Language
Kristiina Jokinen, Trung Ngo Trong, Ville Hautamäki
Velum Control for Oral Sounds
Reed Blaylock, Louis Goldstein, Shrikanth S. Narayanan
Virtual Adversarial Training Applied to Neural Higher-Order Factors for Phone Classification
Martin Ratajczak, Sebastian Tschiatschek, Franz Pernkopf
Virtual Machines and Containers as a Platform for Experimentation
Florian Metze, Eric Riebling, Anne S. Warlaumont et al.
Visual Speech Synthesis Using Dynamic Visemes, Contextual Features and DNNs
Ausdang Thangthai, Ben Milner, Sarah Taylor
Vocal Effort Modification for Singing Synthesis
Olivier Perrotin, Christophe d’Alessandro
Vocal Tract Length Normalization for Speaker Independent Acoustic-to-Articulatory Speech Inversion
Ganesh Sivaraman, Vikramjit Mitra, Hosung Nam et al.
Voice Conversion Based on Matrix Variate Gaussian Mixture Model Using Multiple Frame Features
Yi Yang, Hidetsugu Uchida, Daisuke Saito et al.
Voice Conversion Based on Trajectory Model Training of Neural Networks Considering Global Variance
Naoki Hosaka, Kei Hashimoto, Keiichiro Oura et al.
Voice Quality Control Using Perceptual Expressions for Statistical Parametric Speech Synthesis Based on Cluster Adaptive Training
Yamato Ohtani, Koichiro Mori, Masahiro Morita
Voice-Quality Difference Between the Vowels in Filled Pauses and Ordinary Lexical Items
Kikuo Maekawa, Hiroki Mori
Voting Detector: A Combination of Anomaly Detectors to Reveal Annotation Errors in TTS Corpora
Jindřich Matoušek, Daniel Tihelka
Vowel Characteristics in the Assessment of L2 English Pronunciation
Calbert Graham, Paula Buttery, Francis Nolan
Vowels and Diphthongs in Cangnan Southern Min Chinese Dialect
Fang Hu, Chunyu Ge
Vowels and Diphthongs in the Taiyuan Jin Chinese Dialect
Liping Xia, Fang Hu
Waveform Generation Based on Signal Reshaping for Statistical Parametric Speech Synthesis
Felipe Espic, Cassia Valentini-Botinhao, Zhizheng Wu et al.
webASR 2 — Improved Cloud Based Speech Technology
Thomas Hain, Jeremy Christian, Oscar Saz et al.
Web Data Selection Based on Word Embedding for Low-Resource Speech Recognition
Chuandong Xie, Wu Guo, Guoping Hu et al.
Who Do You Think Will Speak Next? Perception of Turn-Taking Cues in Slovak and Argentine Spanish
Agustín Gravano, Pablo Brusco, Štefan Beňuš
Why do ASR Systems Despite Neural Nets Still Depend on Robust Features
Angel Mario Castro Martinez, Marc René Schädler