Papers
Exploiting Phone Log-Likelihood Ratio Features for the Detection of the Native Language of Non-Native English Speakers
Alberto Abad, Eugénio Ribeiro, Fábio Kepler et al.
Exploring Collections of Multimedia Archives Through Innovative Interfaces in the Context of Digital Humanities
Géraldine Damnati, Delphine Charlet, Marc Denjean
Exploring Session Variability and Template Aging in Speaker Verification for Fixed Phrase Short Utterances
Rohan Kumar Das, Sarfaraz Jelil, S.R. Mahadeva Prasanna
Exploring the Correlation of Pitch Accents and Semantic Slots for Spoken Language Understanding
Sabrina Stehwien, Ngoc Thang Vu
Exploring Word Mover’s Distance and Semantic-Aware Embedding Techniques for Extractive Broadcast News Summarization
Shih-Hung Liu, Kuan-Yu Chen, Yu-Lun Hsieh et al.
Expressive Control of Singing Voice Synthesis Using Musical Contexts and a Parametric F0 Model
Luc Ardaillon, Celine Chabot-Canet, Axel Roebel
Expressive Singing Synthesis Based on Unit Selection for the Singing Synthesis Challenge 2016
Jordi Bonada, Martí Umbert, Merlijn Blaauw
Expressive Speech Driven Talking Avatar Synthesis with DBLSTM Using Limited Amount of Emotional Bimodal Data
Xu Li, Zhiyong Wu, Helen Meng et al.
F0Contour Analysis Based on Empirical Mode Decomposition for DNN Acoustic Modeling in Mandarin Speech Recognition
Xiaoyun Wang, Xugang Lu, Hisashi Kawai et al.
Facing Realism in Spontaneous Emotion Recognition from Speech: Feature Enhancement by Autoencoder with LSTM Neural Networks
Zixing Zhang, Fabien Ringeval, Jing Han et al.
Factor Analysis Based Speaker Normalisation for Continuous Emotion Prediction
Ting Dang, Vidhyasaharan Sethu, Eliathamby Ambikairajah
Factor Analysis Based Speaker Verification Using ASR
Hang Su, Steven Wegmann
Factorized Linear Input Network for Acoustic Model Adaptation in Noisy Conditions
Dung T. Tran, Marc Delroix, Atsunori Ogawa et al.
Factors Affecting the Intelligibility of Sine-Wave Speech
Fei Chen, Daniel Fogerty
Far-Field ASR Without Parallel Data
Vijayaditya Peddinti, Vimal Manohar, Yiming Wang et al.
Fast, Compact, and High Quality LSTM-RNN Based Statistical Parametric Speech Synthesizers for Mobile Devices
Heiga Zen, Yannis Agiomyrgiannakis, Niels Egberts et al.
Feature Learning and Automatic Segmentation for Dolphin Communication Analysis
Daniel Kohlsdorf, Denise Herzing, Thad Starner
Feature Learning with Raw-Waveform CLDNNs for Voice Activity Detection
Ruben Zazo, Tara N. Sainath, Gabor Simko et al.
First Step Towards End-to-End Parametric TTS Synthesis: Generating Spectral Parameters with Neural Attention
Wenfu Wang, Shuang Xu, Bo Xu
Flexible, Rapid Authoring of Goal-Orientated, Multi-Turn Dialogues Using the Task Completion Platform
Alex Marin, Paul Crook, Omar Zia Khan et al.
Formant Estimation and Tracking Using Deep Learning
Yehoshua Dissen, Joseph Keshet
Frequency Estimation from Waveforms Using Multi-Layered Neural Networks
Prateek Verma, Ronald W. Schafer
Fusing Acoustic Feature Representations for Computational Paralinguistics Tasks
Heysem Kaya, Alexey A. Karpov