Papers
8,761 papers found
Generative Adversarial Network-Based Postfilter for STFT Spectrograms
Takuhiro Kaneko, Shinji Takaki, Hirokazu Kameoka et al.
Global SNR Estimation of Speech Signals for Unknown Noise Conditions Using Noise Adapted Non-Linear Regression
Pavlos Papadopoulos, Ruchir Travadi, Shrikanth S. Narayanan
Global Syllable Vectors for Building TTS Front-End with Deep Learning
Jinfu Ni, Yoshinori Shiga, Hisashi Kawai
Glottal Model Based Speech Beamforming for ad-hoc Microphone Arrays
Yang Zhang, Dinei Florêncio, Mark Hasegawa-Johnson
Glottal Opening and Strategies of Production of Fricatives
Benjamin Elie, Yves Laprie
Glottal Source Estimation from Coded Telephone Speech Using a Deep Neural Network
N.P. Narendra, Manu Airaksinen, Paavo Alku
Glottal Source Features for Automatic Speech-Based Depression Assessment
Olympia Simantiraki, Paulos Charonyktakis, Anastasia Pampouchidou et al.
Google’s Next-Generation Real-Time Unit-Selection Synthesizer Using Sequence-to-Sequence LSTM-Based Autoencoders
Vincent Wan, Yannis Agiomyrgiannakis, Hanna Silen et al.
Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery
Janek Ebbers, Jahn Heymann, Lukas Drude et al.
Hierarchical Constrained Bayesian Optimization for Feature, Acoustic Model and Decoder Parameter Optimization
Akshay Chandrashekaran, Ian Lane
Hierarchical LSTMs with Joint Learning for Estimating Customer Satisfaction from Contact Center Calls
Atsushi Ando, Ryo Masumura, Hosana Kamiyama et al.
Hierarchical Recurrent Neural Network for Story Segmentation
Emiru Tsunoo, Peter Bell, Steve Renals
Highway-LSTM and Recurrent Highway Networks for Speech Recognition
Golan Pundak, Tara N. Sainath
HomeBank: A Repository for Long-Form Real-World Audio Recordings of Children
Anne S. Warlaumont, Mark VanDam, Elika Bergelson et al.
Homogeneity Measure Impact on Target and Non-Target Trials in Forensic Voice Comparison
Moez Ajili, Jean-François Bonastre, Waad Ben Kheder et al.
How are Four-Level Length Distinctions Produced? Evidence from Moroccan Arabic
Giuseppina Turco, Karim Shoul, Rachid Ridouane
How Does the Absence of Shared Knowledge Between Interlocutors Affect the Production of French Prosodic Forms?
Amandine Michelas, Cecile Cau, Maud Champagne-Lavau
How Long is Too Long? How Pause Features After Requests Affect the Perceived Willingness of Affirmative Answers
Lea S. Kohtz, Oliver Niebuhr
Human and Automated Scoring of Fluency, Pronunciation and Intonation During Human–Machine Spoken Dialog Interactions
Vikram Ramanarayanan, Patrick L. Lange, Keelan Evanini et al.
Humans do not Maximize the Probability of Correct Decision When Recognizing DANTALE Words in Noise
Mohsen Zareian Jahromi, Jan Østergaard, Jesper Jensen
Hybrid Acoustic-Lexical Deep Learning Approach for Deception Detection
Gideon Mendels, Sarah Ita Levitan, Kai-Zhan Lee et al.
Hyperarticulation of Corrections in Multilingual Dialogue Systems
Ivan Kraljevski, Diane Hirschfeld
Hypernasality Severity Analysis in Cleft Lip and Palate Speech Using Vowel Space Area
Nikitha K., Sishir Kalita, C.M. Vikram et al.
Ideal Ratio Mask Estimation Using Deep Neural Networks for Monaural Speech Segregation in Noisy Reverberant Conditions
Xu Li, Junfeng Li, Yonghong Yan