Papers
Frame and Segment Level Recurrent Neural Networks for Phone Classification
Martin Ratajczak, Sebastian Tschiatschek, Franz Pernkopf
Frame-Wise Dynamic Threshold Based Polyphonic Acoustic Event Detection
Xianjun Xia, Roberto Togneri, Ferdous Sohel et al.
Gain Compensation for Fast i-Vector Extraction Over Short Duration
Kong Aik Lee, Haizhou Li
Gate Activation Signal Analysis for Gated Recurrent Neural Networks and its Correlation with Phoneme Boundaries
Yu-Hsuan Wang, Cheng-Tao Chung, Hung-Yi Lee
Gaussian Prediction Based Attention for Online End-to-End Speech Recognition
Junfeng Hou, Shiliang Zhang, Li-Rong Dai
Generalized Distillation Framework for Speaker Normalization
Neethu Mariam Joy, Sandeep Reddy Kothinti, S. Umesh et al.
Generation of Large-Scale Simulated Utterances in Virtual Rooms to Train Deep-Neural Networks for Far-Field Speech Recognition in Google Home
Chanwoo Kim, Ananya Misra, Kean Chin et al.
Generative Adversarial Network-Based Glottal Waveform Model for Statistical Parametric Speech Synthesis
Bajibabu Bollepalli, Lauri Juvela, Paavo Alku
Generative Adversarial Network-Based Postfilter for STFT Spectrograms
Takuhiro Kaneko, Shinji Takaki, Hirokazu Kameoka et al.
Global SNR Estimation of Speech Signals for Unknown Noise Conditions Using Noise Adapted Non-Linear Regression
Pavlos Papadopoulos, Ruchir Travadi, Shrikanth S. Narayanan
Global Syllable Vectors for Building TTS Front-End with Deep Learning
Jinfu Ni, Yoshinori Shiga, Hisashi Kawai
Glottal Model Based Speech Beamforming for ad-hoc Microphone Arrays
Yang Zhang, Dinei Florêncio, Mark Hasegawa-Johnson
Glottal Opening and Strategies of Production of Fricatives
Benjamin Elie, Yves Laprie
Glottal Source Estimation from Coded Telephone Speech Using a Deep Neural Network
N.P. Narendra, Manu Airaksinen, Paavo Alku
Glottal Source Features for Automatic Speech-Based Depression Assessment
Olympia Simantiraki, Paulos Charonyktakis, Anastasia Pampouchidou et al.
Google’s Next-Generation Real-Time Unit-Selection Synthesizer Using Sequence-to-Sequence LSTM-Based Autoencoders
Vincent Wan, Yannis Agiomyrgiannakis, Hanna Silen et al.
Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery
Janek Ebbers, Jahn Heymann, Lukas Drude et al.
Hierarchical Constrained Bayesian Optimization for Feature, Acoustic Model and Decoder Parameter Optimization
Akshay Chandrashekaran, Ian Lane
Hierarchical LSTMs with Joint Learning for Estimating Customer Satisfaction from Contact Center Calls
Atsushi Ando, Ryo Masumura, Hosana Kamiyama et al.
Hierarchical Recurrent Neural Network for Story Segmentation
Emiru Tsunoo, Peter Bell, Steve Renals
Highway-LSTM and Recurrent Highway Networks for Speech Recognition
Golan Pundak, Tara N. Sainath
HomeBank: A Repository for Long-Form Real-World Audio Recordings of Children
Anne S. Warlaumont, Mark VanDam, Elika Bergelson et al.
Homogeneity Measure Impact on Target and Non-Target Trials in Forensic Voice Comparison
Moez Ajili, Jean-François Bonastre, Waad Ben Kheder et al.