Papers
Articulatory-to-speech Conversion Using Bi-directional Long Short-term Memory
Fumiaki Taguchi, Tokihiko Kaburagi
Artificial Bandwidth Extension with Memory Inclusion Using Semi-supervised Stacked Auto-encoders
Pramod Bachhav, Massimiliano Todisco, Nicholas Evans
ASe: Acoustic Scene Embedding Using Deep Archetypal Analysis and GMM
Pulkit Sharma, Vinayak Abrol, Anshul Thakur
A Shifted Delta Coefficient Objective for Monaural Speech Separation Using Multi-task Learning
Chenglin Xu, Wei Rao, Eng Siong Chng et al.
A Simple Model for Detection of Rare Sound Events
Weiran Wang, Chieh-Chi Kao, Chao Wang
Assessing Speaker Engagement in 2-Person Debates: Overlap Detection in United States Presidential Debates
Midia Yousefi, Navid Shokouhi, John H.L. Hansen
A Study of Enhancement, Augmentation and Autoencoder Methods for Domain Adaptation in Distant Speech Recognition
Hao Tang, Wei-Ning Hsu, François Grondin et al.
A Study of Lexical and Prosodic Cues to Segmentation in a Hindi-English Code-switched Discourse
Preeti Rao, Mugdha Pandya, Kamini Sabu et al.
A Study of Objective Measurement of Comprehensibility through Native Speakers' Shadowing of Learners' Utterances
Yusuke Inoue, Suguru Kabashima, Daisuke Saito et al.
A Three-Layer Emotion Perception Model for Valence and Arousal-Based Detection from Multilingual Speech
Xingfeng Li, Masato Akagi
Attention-based End-to-End Models for Small-Footprint Keyword Spotting
Changhao Shan, Junbo Zhang, Yujun Wang et al.
Attention-based Sequence Classification for Affect Detection
Cristina Gorrostieta, Richard Brutti, Kye Taylor et al.
Attentive Statistics Pooling for Deep Speaker Embedding
Koji Okabe, Takafumi Koshinaka, Koichi Shinoda
Audio-Visual Prediction of Head-Nod and Turn-Taking Events in Dyadic Interactions
Bekir Berker Türker, Engin Erzin, Yücel Yemez et al.
Audiovisual Speech Activity Detection with Advanced Long Short-Term Memory
Fei Tao, Carlos Busso
Audio-visual Voice Conversion Using Deep Canonical Correlation Analysis for Deep Bottleneck Features
Satoshi Tamura, Kento Horio, Hajime Endo et al.
Auditory Filterbank Learning for Temporal Modulation Features in Replay Spoof Speech Detection
Hardik Sailor, Madhu Kamble, Hemant Patil
Auditory Filterbank Learning Using ConvRBM for Infant Cry Classification
Hardik B. Sailor, Hemant Patil
auMina™ - Enterprise Speech Analytics
Umesh Sachdev, Rajagopal Jayaraman, Zainab Millwala
A Unified Framework for the Generation of Glottal Signals in Deep Learning-based Parametric Speech Synthesis Systems
Min-Jae Hwang, Eunwoo Song, Jin-Seob Kim et al.
Automated Classification of Children’s Linguistic versus Non-Linguistic Vocalisations
Zixing Zhang, Alejandrina Cristia, Anne Warlaumont et al.
Automated Classification of Vowel-Gesture Parameters Using External Broadband Excitation
Balamurali B T, Jer-Ming Chen
Automatically Measuring L2 Speech Fluency without the Need of ASR: A Proof-of-concept Study with Japanese Learners of French
Lionel Fontan, Maxime Le Coz, Sylvain Detey