Papers
Articulatory Synthesis Based on Real-Time Magnetic Resonance Imaging Data
Asterios Toutios, Tanner Sorensen, Krishna Somandepalli et al.
Articulatory-to-Acoustic Conversion with Cascaded Prediction of Spectral and Excitation Features Using Neural Networks
Zheng-Chen Liu, Zhen-Hua Ling, Li-Rong Dai
Artificial Neural Network-Based Feature Combination for Spatial Voice Activity Detection
Stefan Meier, Walter Kellermann
A Sequence-to-Sequence Model for User Simulation in Spoken Dialogue Systems
Layla El Asri, Jing He, Kaheer Suleman
A Sparse Spherical Harmonic-Based Model in Subbands for Head-Related Transfer Functions
Xiaoke Qi, Jianhua Tao
A Speaker Diarization System for Studying Peer-Led Team Learning Groups
Harishchandra Dubey, Lakshmish Kaushik, Abhijeet Sangwan et al.
A Speaker Recognition System for the SITW Challenge
Oleg Kudashev, Sergey Novoselov, Konstantin Simonchik et al.
A Spectral Modulation Sensitivity Weighted Pre-Emphasis Filter for Active Noise Control System
Kah-Meng Cheong, Yuh-Yuan Wang, Tai-Shih Chi
ASR Confidence Estimation with Speaker-Adapted Recurrent Neural Networks
Miguel Ángel del-Agua, Santiago Piqueras, Adrià Giménez et al.
ASR for South Slavic Languages Developed in Almost Automated Way
Jan Nouza, Radek Safarik, Petr Cerva
Assessing Idiosyncrasies in a Bayesian Model of Speech Communication
Marie-Lou Barnaud, Julien Diard, Pierre Bessière et al.
Assessing Level-Dependent Segmental Contribution to the Intelligibility of Speech Processed by Single-Channel Noise-Suppression Algorithms
Tian Guan, Guangxing Chu, Fei Chen et al.
Assessing Speech Quality in Speech-Aware Hearing Aids Based on Phoneme Posteriorgrams
Constantin Spille, Hendrik Kayser, Hynek Hermansky et al.
A Step Beyond Local Observations with a Dialog Aware Bidirectional GRU Network for Spoken Language Understanding
Vedran Vukotić, Christian Raymond, Guillaume Gravier
A Stochastic Model for Computer-Aided Human-Human Dialogue
Merwan Barlier, Romain Laroche, Olivier Pietquin
A Template-Based Approach for Speech Synthesis Intonation Generation Using LSTMs
Srikanth Ronanki, Gustav Eje Henter, Zhizheng Wu et al.
Attention Assisted Discovery of Sub-Utterance Structure in Speech Emotion Recognition
Che-Wei Huang, Shrikanth S. Narayanan
Attention-Based Convolutional Neural Networks for Sentence Classification
Zhiwei Zhao, Youzheng Wu
At the Border of Acoustics and Linguistics: Bag-of-Audio-Words for the Recognition of Emotions in Speech
Maximilian Schmitt, Fabien Ringeval, Björn Schuller
Audio-Based Distributional Representations of Meaning Using a Fusion of Feature Encodings
Giannis Karamanolakis, Elias Iosif, Athanasia Zlatintsi et al.
Audio-to-Visual Speech Conversion Using Deep Neural Networks
Sarah Taylor, Akihiro Kato, Iain Matthews et al.
Audio-Visual Speech Recognition Using Bimodal-Trained Bottleneck Features for a Person with Severe Hearing Loss
Yuki Takashima, Ryo Aihara, Tetsuya Takiguchi et al.
Audiovisual Speech Scene Analysis in the Context of Competing Sources
Attigodu C. Ganesh, Frédéric Berthommier, Jean-Luc Schwartz