Papers
Improving DNNs Trained with Non-Native Transcriptions Using Knowledge Distillation and Target Interpolation
Amit Das, Mark Hasegawa-Johnson
Improving Gender Identification in Movie Audio Using Cross-Domain Data
Rajat Hebbar, Krishna Somandepalli, Shrikanth Narayanan
Improving Language Modeling with an Adversarial Critic for Automatic Speech Recognition
Yike Zhang, Pengyuan Zhang, Yonghong Yan
Improving Mandarin Tone Recognition Using Convolutional Bidirectional Long Short-Term Memory with Attention
Longfei Yang, Yanlu Xie, Jinsong Zhang
Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model
Rui Liu, Feilong Bao, Guanglai Gao et al.
Improving Response Time of Active Speaker Detection Using Visual Prosody Information Prior to Articulation
Fasih Haider, Saturnino Luz, Carl Vogel et al.
Improving Sparse Representations in Exemplar-Based Voice Conversion with a Phoneme-Selective Objective Function
Shaojin Ding, Guanlong Zhao, Christopher Liberatore et al.
Incremental TTS for Japanese Language
Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura
Indian Languages ASR: A Multilingual Phone Recognition Framework with IPA Based Common Phone-set, Predicted Articulatory Features and Feature fusion
Manjunath K E, K. Sreenivasa Rao, Dinesh Babu Jayagopi et al.
Infant Emotional Outbursts Detection in Infant-parent Spoken Interactions
Yijia Xu, Mark Hasegawa-Johnson, Nancy McElwain
Inference-Invariant Transformation of Batch Normalization for Domain Adaptation of Acoustic Models
Masayuki Suzuki, Tohru Nagano, Gakuto Kurata et al.
Influences of Fundamental Oscillation on Speaker Identification in Vocalic Utterances by Humans and Computers
Volker Dellwo, Thayabaran Kathiresan, Elisa Pellegrino et al.
Information Bottleneck Based Percussion Instrument Diarization System for Taniavartanam Segments of Carnatic Music Concerts
Nauman Dawalatabad, Jom Kuriakose, Chandra Sekhar Chellu et al.
Information Encoding by Deep Neural Networks: What Can We Learn?
Louis ten Bosch, Lou Boves
Information Structure, Affect and Prenuclear Prominence in American English
Eleanor Chodroff, Jennifer Cole
Integrated Presentation Attack Detection and Automatic Speaker Verification: Common Features and Gaussian Back-end Fusion
Massimiliano Todisco, Héctor Delgado, Kong Aik Lee et al.
Integrating Neural Network Based Beamforming and Weighted Prediction Error Dereverberation
Lukas Drude, Christoph Boeddeker, Jahn Heymann et al.
Integrating Recurrence Dynamics for Speech Emotion Recognition
Efthymios Tzinis, Georgios Paraskevopoulos, Christos Baziotis et al.
Integrating Spectral and Spatial Features for Multi-Channel Speaker Separation
Zhong-Qiu Wang, DeLiang Wang
Intent Discovery Through Unsupervised Semantic Text Clustering
A Padmasundari, Srinivas Bangalore
Interaction Mechanisms between Glottal Source and Vocal Tract in Pitch Glides
Tiina Murtola, Jarmo Malinen
Interactions between Vowels and Nasal Codas in Mandarin Speakers’ Perception of Nasal Finals
Chong Cao, Wei Wei, Wei Wang et al.
Intonation tutor by SPIRE (In-SPIRE): An Online Tool for an Automatic Feedback to the Second Language Learners in Learning Intonation
Anand P A, Chiranjeevi Yarra, Kausthubha N K et al.
Investigating Accuracy of Pitch-accent Annotations in Neural Network-based Speech Synthesis and Denoising Effects
Hieu-Thi Luong, Xin Wang, Junichi Yamagishi et al.
Investigating Generative Adversarial Networks Based Speech Dereverberation for Robust Speech Recognition
Ke Wang, Junbo Zhang, Sining Sun et al.