Papers
Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning
Yuanchao Li, Tianyu Zhao, Tatsuya Kawahara
Improved Low-Resource Somali Speech Recognition by Semi-Supervised Acoustic and Language Model Training
Astik Biswas, Raghav Menon, Ewald van der Westhuizen et al.
Improved Speaker-Dependent Separation for CHiME-5 Challenge
Jian Wu, Yong Xu, Shi-Xiong Zhang et al.
Improved Speech Separation with Time-and-Frequency Cross-Domain Joint Embedding and Clustering
Gene-Ping Yang, Chao-I Tuan, Hung-Yi Lee et al.
Improved Vocal Tract Length Perturbation for a State-of-the-Art End-to-End Speech Recognition System
Chanwoo Kim, Minkyu Shin, Abhinav Garg et al.
Improvement and Assessment of Spectro-Temporal Modulation Analysis for Speech Intelligibility Estimation
Amin Edraki, Wai-Yip Chan, Jesper Jensen et al.
Improving Aggregation and Loss Function for Better Embedding Learning in End-to-End Speaker Verification System
Zhifu Gao, Yan Song, Ian McLoughlin et al.
Improving ASR Confidence Scores for Alexa Using Acoustic and Hypothesis Embeddings
Prakhar Swarup, Roland Maas, Sri Garimella et al.
Improving ASR Systems for Children with Autism and Language Impairment Using Domain-Focused DNN Transfer Techniques
Robert Gale, Liu Chen, Jill Dolata et al.
Improving Automatically Induced Lexicons for Highly Agglutinating Languages Using Data-Driven Morphological Segmentation
Wiehan Agenbag, Thomas Niesler
Improving Code-Switched Language Modeling Performance Using Cognate Features
Victor Soto, Julia Hirschberg
Improving Conversation-Context Language Models with Multiple Spoken Language Understanding Models
Ryo Masumura, Tomohiro Tanaka, Atsushi Ando et al.
Improving Emotion Identification Using Phone Posteriors in Raw Speech Waveform Based DNN
Mousmita Sarma, Pegah Ghahremani, Daniel Povey et al.
Improving Keyword Spotting and Language Identification via Neural Architecture Search at Scale
Hanna Mazzawi, Xavi Gonzalvo, Aleks Kracun et al.
Improving Large Vocabulary Urdu Speech Recognition System Using Deep Neural Networks
Muhammad Umar Farooq, Farah Adeeba, Sahar Rauf et al.
Improving Performance of End-to-End ASR on Numeric Sequences
Cal Peyser, Hao Zhang, Tara N. Sainath et al.
Improving Speech Synthesis with Discourse Relations
Adèle Aubin, Alessandra Cervone, Oliver Watts et al.
Improving Transformer-Based End-to-End Speech Recognition with Connectionist Temporal Classification and Language Model Integration
Shigeki Karita, Nelson Enrique Yalta Soplin, Shinji Watanabe et al.
Improving Transformer-Based Speech Recognition Systems with Compressed Structure and Speech Attributes Augmentation
Sheng Li, Dabre Raj, Xugang Lu et al.
Incorporating Symbolic Sequential Modeling for Speech Enhancement
Chien-Feng Liao, Yu Tsao, Xugang Lu et al.
Individual Difference of Relative Tongue Size and its Acoustic Effects
Xiaohan Zhang, Chongke Bi, Kiyoshi Honda et al.
Individual Differences in Implicit Attention to Phonetic Detail in Speech Perception
Natalie Lewandowski, Daniel Duran
Individual Differences of Airflow and Sound Generation in the Vocal Tract of Sibilant /s/
Tsukasa Yoshinaga, Kazunori Nozaki, Shigeo Wada
Individual Variation in Cognitive Processing Style Predicts Differences in Phonetic Imitation of Device and Human Voices
Cathryn Snyder, Michelle Cohn, Georgia Zellou