Papers
An RNN-Based Quantized F0 Model with Multi-Tier Feedback Links for Text-to-Speech Synthesis
Xin Wang, Shinji Takaki, Junichi Yamagishi
An RNN Model of Text Normalization
Richard Sproat, Navdeep Jaitly
An Ultrasound Study of Alveolar and Retroflex Consonants in Arrernte: Stressed and Unstressed Syllables
Marija Tabain, Richard Beare
Apkinson — A Mobile Monitoring Solution for Parkinson’s Disease
Philipp Klumpp, Thomas Janu, Tomás Arias-Vergara et al.
A Post-Filtering Approach Based on Locally Linear Embedding Difference Compensation for Speech Enhancement
Yi-Chiao Wu, Hsin-Te Hwang, Syu-Siang Wang et al.
Applications of the BBN Sage Speech Processing Platform
Ralf Meermeier, Sean Colbath
Approaches for Neural-Network Language Model Adaptation
Min Ma, Michael Nirschl, Fadi Biadsy et al.
Approaching Human Performance in Behavior Estimation in Couples Therapy Using Deep Sentence Embeddings
Shao-Yen Tseng, Brian Baucom, Panayiotis Georgiou
Approximated and Domain-Adapted LSTM Language Models for First-Pass Decoding in Speech Recognition
Mittul Singh, Youssef Oualil, Dietrich Klakow
Approximating Phonotactic Input in Children’s Linguistic Environments from Orthographic Transcripts
Sofia Strömbergsson, Jens Edlund, Jana Götze et al.
A Preliminary Phonetic Investigation of Alphabetic Words in Mandarin Chinese
Hongwei Ding, Yuanyuan Zhang, Hongchao Liu et al.
A Preliminary Study of Prosodic Disambiguation by Chinese EFL Learners
Yuanyuan Zhang, Hongwei Ding
A Quantitative Measure of the Impact of Coarticulation on Phone Discriminability
Thomas Schatz, Rory Turnbull, Francis Bach et al.
Areal and Phylogenetic Features for Multilingual Speech Synthesis
Alexander Gutkin, Richard Sproat
A Relevance Score Estimation for Spoken Term Detection Based on RNN-Generated Pronunciation Embeddings
Jan Švec, Josef V. Psutka, Luboš Šmídl et al.
A Rescoring Approach for Keyword Search Using Lattice Context Information
Zhipeng Chen, Ji Wu
A Robust and Alternative Approach to Zero Frequency Filtering Method for Epoch Extraction
P. Gangamohan, B. Yegnanarayana
A Robust Medical Speech-to-Speech/Speech-to-Sign Phraselator
Farhia Ahmed, Pierrette Bouillon, Chelle Destefano et al.
A Robust Voiced/Unvoiced Phoneme Classification from Whispered Speech Using the ‘Color’ of Whispered Phonemes and Deep Neural Network
G. Nisha Meenakshi, Prasanta Kumar Ghosh
Articulation Rate in Swedish Child-Directed Speech Increases as a Function of the Age of the Child Even When Surprisal is Controlled for
Johan Sjons, Thomas Hörberg, Robert Östling et al.
Articulatory Text-to-Speech Synthesis Using the Digital Waveguide Mesh Driven by a Deep Neural Network
Amelia J. Gully, Takenori Yoshimura, Damian T. Murphy et al.
A Semi-Polar Grid Strategy for the Three-Dimensional Finite Element Simulation of Vowel-Vowel Sequences
Marc Arnela, Saeed Dabbaghchian, Oriol Guasch et al.
A Semi-Supervised Learning Approach for Acoustic-Prosodic Personality Perception in Under-Resourced Domains
Rubén Solera-Ureña, Helena Moniz, Fernando Batista et al.
A Signal Processing Approach for Speaker Separation Using SFF Analysis
Nivedita Chennupati, B.H.V.S. Narayana Murthy, B. Yegnanarayana