Papers
A Storyteller’s Tale: Literature Audiobooks Genre Classification Using CNN and RNN Architectures
Nehory Carmi, Azaria Cohen, Mireille Avigal et al.
A Strategy for Improved Phone-Level Lyrics-to-Audio Alignment for Speech-to-Singing Synthesis
David Ayllón, Fernando Villavicencio, Pierre Lanchantin
A Study for Improving Device-Directed Speech Detection Toward Frictionless Human-Machine Interaction
Che-Wei Huang, Roland Maas, Sri Harish Mallidi et al.
A Study of a Cross-Language Perception Based on Cortical Analysis Using Biomimetic STRFs
Sangwook Park, David K. Han, Mounya Elhilali
A Study of Soprano Singing in Light of the Source-Filter Interaction
Tokihiko Kaburagi
A Study of x-Vector Based Speaker Recognition on Short Utterances
A. Kanagasundaram, S. Sridharan, G. Sriram et al.
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection
Massimiliano Todisco, Xin Wang, Ville Vestman et al.
A System for Real-Time Privacy Preserving Data Collection for Ambient Assisted Living
Fasih Haider, Saturnino Luz
A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting
Ye Bai, Jiangyan Yi, Jianhua Tao et al.
Attention Based Hybrid i-Vector BLSTM Model for Language Recognition
Bharat Padi, Anand Mohan, Sriram Ganapathy
Attention-Based Word Vector Prediction with LSTMs and its Application to the OOV Problem in ASR
Alejandro Coucheiro-Limeres, Fernando Fernández-Martínez, Rubén San-Segundo et al.
Attention-Enhanced Connectionist Temporal Classification for Discrete Speech Emotion Recognition
Ziping Zhao, Zhongtian Bao, Zixing Zhang et al.
Attention Model for Articulatory Features Detection
Ievgen Karaulov, Dmytro Tkanov
Attentive to Individual: A Multimodal Emotion Recognition Network with Personalized Attention Profile
Jeng-Lin Li, Chi-Chun Lee
Audio Classification of Bit-Representation Waveform
Masaki Okawa, Takuya Saito, Naoki Sawada et al.
Audio Tagging with Compact Feedforward Sequential Memory Network and Audio-to-Audio Ratio Based Data Augmentation
Zhiying Huang, Shiliang Zhang, Ming Lei
Augmented CycleGANs for Continuous Scale Normal-to-Lombard Speaking Style Conversion
Shreyas Seshadri, Lauri Juvela, Paavo Alku et al.
A Unified Bayesian Source Modelling for Determined Blind Source Separation
Chaitanya Narisetty
A Unified Framework for Speaker and Utterance Verification
Tianchi Liu, Maulik Madhavi, Rohan Kumar Das et al.
Autoencoder-Based Semi-Supervised Curriculum Learning for Out-of-Domain Speaker Verification
Siqi Zheng, Gang Liu, Hongbin Suo et al.
Auto-Encoding Nearest Neighbor i-Vectors for Speaker Verification
Umair Khan, Miquel India, Javier Hernando
Automated Emotion Morphing in Speech Based on Diffeomorphic Curve Registration and Highway Networks
Ravi Shankar, Hsi-Wei Hsieh, Nicolas Charon et al.
Automated Estimation of Oral Reading Fluency During Summer Camp e-Book Reading with MyTurnToRead
Anastassia Loukina, Beata Beigman Klebanov, Patrick Lange et al.
Automatic Assessment of Language Impairment Based on Raw ASR Output
Ying Qin, Tan Lee, Anthony Pak Hin Kong