Papers
Novel Shifted Real Spectrum for Exact Signal Reconstruction
Meet H. Soni, Rishabh Tak, Hemant A. Patil
Novel Variable Length Teager Energy Separation Based Instantaneous Frequency Features for Replay Detection
Hemant A. Patil, Madhu R. Kamble, Tanvina B. Patel et al.
NTCD-TIMIT: A New Database and Baseline for Noise-Robust Audio-Visual Speech Recognition
Ahmed Hussen Abdelaziz
Nuance - Politecnico di Torino’s 2016 NIST Speaker Recognition Evaluation System
Daniele Colibro, Claudio Vair, Emanuele Dalmasso et al.
Null-Hypothesis LLR: A Proposal for Forensic Automatic Speaker Recognition
Yosef A. Solewicz, Michael Jessen, David van der Vloed
Objective Severity Assessment from Disordered Voice Using Estimated Glottal Airflow
Yu-Ren Chien, Michal Borský, Jón Guðnason
Occupancy Detection in Commercial and Residential Environments Using Audio Signal
Shabnam Ghaffarzadegan, Attila Reiss, Mirko Ruhs et al.
Off-Topic Spoken Response Detection Using Siamese Convolutional Neural Networks
Chong Min Lee, Su-Youn Yoon, Xihao Wang et al.
Off-Topic Spoken Response Detection with Word Embeddings
Su-Youn Yoon, Chong Min Lee, Ikkyu Choi et al.
On Building Mixed Lingual Speech Synthesis Systems
SaiKrishna Rallabandi, Alan W. Black
On Design of Robust Deep Models for CHiME-4 Multi-Channel Speech Recognition with Multiple Configurations of Array Microphones
Yan-Hui Tu, Jun Du, Lei Sun et al.
On Improving Acoustic Models for TORGO Dysarthric Speech Database
Neethu Mariam Joy, S. Umesh, Basil Abraham
Online Adaptation of an Attention-Based Neural Network for Natural Language Generation
Matthieu Riou, Bassam Jabaian, Stéphane Huet et al.
Online End-of-Turn Detection from Speech Based on Stacked Time-Asynchronous Sequential Networks
Ryo Masumura, Taichi Asami, Hirokazu Masataki et al.
On Multi-Domain Training and Adaptation of End-to-End RNN Acoustic Models for Distant Speech Recognition
Seyedmahdad Mirsamadi, John H.L. Hansen
On the Duration of Mandarin Tones
Jing Yang, Yu Zhang, Aijun Li et al.
On the Influence of Modifying Magnitude and Phase Spectrum to Enhance Noisy Speech Signals
Hans-Günter Hirsch, Michael Gref
On the Quality and Intelligibility of Noisy Speech Processed for Near-End Listening Enhancement
Tudor-Cătălin Zorilă, Yannis Stylianou
On the Use of Band Importance Weighting in the Short-Time Objective Intelligibility Measure
Asger Heidemann Andersen, Jan Mark de Haan, Zheng-Hua Tan et al.
OpenMM: An Open-Source Multimodal Feature Extraction Tool
Michelle Renee Morales, Stefan Scherer, Rivka Levitan
Opinion Dynamics Modeling for Movie Review Transcripts Classification with Hidden Conditional Random Fields
Valentin Barriere, Chloé Clavel, Slim Essid
Optimizing DNN Adaptation for Recognition of Enhanced Speech
Marco Matassoni, Alessio Brutti, Daniele Falavigna