Papers
Improving Response Time of Active Speaker Detection Using Visual Prosody Information Prior to Articulation
Fasih Haider, Saturnino Luz, Carl Vogel et al.
Improving Sparse Representations in Exemplar-Based Voice Conversion with a Phoneme-Selective Objective Function
Shaojin Ding, Guanlong Zhao, Christopher Liberatore et al.
Incremental TTS for Japanese Language
Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura
Indian Languages ASR: A Multilingual Phone Recognition Framework with IPA Based Common Phone-set, Predicted Articulatory Features and Feature fusion
Manjunath K E, K. Sreenivasa Rao, Dinesh Babu Jayagopi et al.
Infant Emotional Outbursts Detection in Infant-parent Spoken Interactions
Yijia Xu, Mark Hasegawa-Johnson, Nancy McElwain
Inference-Invariant Transformation of Batch Normalization for Domain Adaptation of Acoustic Models
Masayuki Suzuki, Tohru Nagano, Gakuto Kurata et al.
Influences of Fundamental Oscillation on Speaker Identification in Vocalic Utterances by Humans and Computers
Volker Dellwo, Thayabaran Kathiresan, Elisa Pellegrino et al.
Information Bottleneck Based Percussion Instrument Diarization System for Taniavartanam Segments of Carnatic Music Concerts
Nauman Dawalatabad, Jom Kuriakose, Chandra Sekhar Chellu et al.
Information Encoding by Deep Neural Networks: What Can We Learn?
Louis ten Bosch, Lou Boves
Information Structure, Affect and Prenuclear Prominence in American English
Eleanor Chodroff, Jennifer Cole
Integrated Presentation Attack Detection and Automatic Speaker Verification: Common Features and Gaussian Back-end Fusion
Massimiliano Todisco, Héctor Delgado, Kong Aik Lee et al.
Integrating Neural Network Based Beamforming and Weighted Prediction Error Dereverberation
Lukas Drude, Christoph Boeddeker, Jahn Heymann et al.
Integrating Recurrence Dynamics for Speech Emotion Recognition
Efthymios Tzinis, Georgios Paraskevopoulos, Christos Baziotis et al.
Integrating Spectral and Spatial Features for Multi-Channel Speaker Separation
Zhong-Qiu Wang, DeLiang Wang
Intent Discovery Through Unsupervised Semantic Text Clustering
A Padmasundari, Srinivas Bangalore
Interaction Mechanisms between Glottal Source and Vocal Tract in Pitch Glides
Tiina Murtola, Jarmo Malinen
Interactions between Vowels and Nasal Codas in Mandarin Speakers’ Perception of Nasal Finals
Chong Cao, Wei Wei, Wei Wang et al.
Intonation tutor by SPIRE (In-SPIRE): An Online Tool for an Automatic Feedback to the Second Language Learners in Learning Intonation
Anand P A, Chiranjeevi Yarra, Kausthubha N K et al.
Investigating Accuracy of Pitch-accent Annotations in Neural Network-based Speech Synthesis and Denoising Effects
Hieu-Thi Luong, Xin Wang, Junichi Yamagishi et al.
Investigating Generative Adversarial Networks Based Speech Dereverberation for Robust Speech Recognition
Ke Wang, Junbo Zhang, Sining Sun et al.
Investigating Objective Intelligibility in Real-Time EMG-to-Speech Conversion
Lorenz Diener, Tanja Schultz
Investigating Speech Enhancement and Perceptual Quality for Speech Emotion Recognition
Anderson R. Avila, Md Jahangir Alam, Douglas O'Shaughnessy et al.
Investigating Speech Features for Continuous Turn-Taking Prediction Using LSTMs
Matthew Roddy, Gabriel Skantze, Naomi Harte
Investigating the Effect of Audio Duration on Dementia Detection Using Acoustic Features
Jochen Weiner, Miguel Angrick, Srinivasan Umesh et al.
Investigating the Role of Familiar Face and Voice Cues in Speech Processing in Noise
Jeesun Kim, Sonya Karisma, Vincent Aubanel et al.