Research Explorer

Frame and Segment Level Recurrent Neural Networks for Phone Classification

Martin Ratajczak, Sebastian Tschiatschek, Franz Pernkopf

2017 INTERSPEECH

Frame-Wise Dynamic Threshold Based Polyphonic Acoustic Event Detection

Xianjun Xia, Roberto Togneri, Ferdous Sohel et al.

2017 INTERSPEECH

Functional Principal Component Analysis of Vocal Tract Area Functions

Jorge C. Lucero

2017 INTERSPEECH

Gain Compensation for Fast i-Vector Extraction Over Short Duration

Kong Aik Lee, Haizhou Li

2017 INTERSPEECH

Gate Activation Signal Analysis for Gated Recurrent Neural Networks and its Correlation with Phoneme Boundaries

Yu-Hsuan Wang, Cheng-Tao Chung, Hung-Yi Lee

2017 INTERSPEECH

Gaussian Prediction Based Attention for Online End-to-End Speech Recognition

Junfeng Hou, Shiliang Zhang, Li-Rong Dai

2017 INTERSPEECH

Generalized Distillation Framework for Speaker Normalization

Neethu Mariam Joy, Sandeep Reddy Kothinti, S. Umesh et al.

2017 INTERSPEECH

Generation of Large-Scale Simulated Utterances in Virtual Rooms to Train Deep-Neural Networks for Far-Field Speech Recognition in Google Home

Chanwoo Kim, Ananya Misra, Kean Chin et al.

2017 INTERSPEECH

Generative Adversarial Network-Based Glottal Waveform Model for Statistical Parametric Speech Synthesis

Bajibabu Bollepalli, Lauri Juvela, Paavo Alku

2017 INTERSPEECH

Generative Adversarial Network-Based Postfilter for STFT Spectrograms

Takuhiro Kaneko, Shinji Takaki, Hirokazu Kameoka et al.

2017 INTERSPEECH

Global SNR Estimation of Speech Signals for Unknown Noise Conditions Using Noise Adapted Non-Linear Regression

Pavlos Papadopoulos, Ruchir Travadi, Shrikanth S. Narayanan

2017 INTERSPEECH

Global Syllable Vectors for Building TTS Front-End with Deep Learning

Jinfu Ni, Yoshinori Shiga, Hisashi Kawai

2017 INTERSPEECH

Glottal Model Based Speech Beamforming for ad-hoc Microphone Arrays

Yang Zhang, Dinei Florêncio, Mark Hasegawa-Johnson

2017 INTERSPEECH

Glottal Opening and Strategies of Production of Fricatives

Benjamin Elie, Yves Laprie

2017 INTERSPEECH

Glottal Source Estimation from Coded Telephone Speech Using a Deep Neural Network

N.P. Narendra, Manu Airaksinen, Paavo Alku

2017 INTERSPEECH

Glottal Source Features for Automatic Speech-Based Depression Assessment

Olympia Simantiraki, Paulos Charonyktakis, Anastasia Pampouchidou et al.

2017 INTERSPEECH

Google’s Next-Generation Real-Time Unit-Selection Synthesizer Using Sequence-to-Sequence LSTM-Based Autoencoders

Vincent Wan, Yannis Agiomyrgiannakis, Hanna Silen et al.

2017 INTERSPEECH

Harvest: A High-Performance Fundamental Frequency Estimator from Speech Signals

Masanori Morise

2017 INTERSPEECH

Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery

Janek Ebbers, Jahn Heymann, Lukas Drude et al.

2017 INTERSPEECH

Hierarchical Constrained Bayesian Optimization for Feature, Acoustic Model and Decoder Parameter Optimization

Akshay Chandrashekaran, Ian Lane

2017 INTERSPEECH

Hierarchical LSTMs with Joint Learning for Estimating Customer Satisfaction from Contact Center Calls

Atsushi Ando, Ryo Masumura, Hosana Kamiyama et al.

2017 INTERSPEECH

Hierarchical Recurrent Neural Network for Story Segmentation

Emiru Tsunoo, Peter Bell, Steve Renals

2017 INTERSPEECH

Highway-LSTM and Recurrent Highway Networks for Speech Recognition

Golan Pundak, Tara N. Sainath

2017 INTERSPEECH

HomeBank: A Repository for Long-Form Real-World Audio Recordings of Children

Anne S. Warlaumont, Mark VanDam, Elika Bergelson et al.

2017 INTERSPEECH

Homogeneity Measure Impact on Target and Non-Target Trials in Forensic Voice Comparison

Moez Ajili, Jean-François Bonastre, Waad Ben Kheder et al.

2017 INTERSPEECH

Papers