Papers
Assessing Parkinson’s Disease from Speech Using Fisher Vectors
José Vicente Egas López, Juan Rafael Orozco-Arroyave, Gábor Gosztolya
Assessing the Semantic Space Bias Caused by ASR Error Propagation and its Effect on Spoken Document Summarization
Máté Ákos Tündik, Valér Kaszás, György Szaszák
A Statistically Principled and Computationally Efficient Approach to Speech Enhancement Using Variational Autoencoders
Manuel Pariente, Antoine Deleforge, Emmanuel Vincent
A Storyteller’s Tale: Literature Audiobooks Genre Classification Using CNN and RNN Architectures
Nehory Carmi, Azaria Cohen, Mireille Avigal et al.
A Strategy for Improved Phone-Level Lyrics-to-Audio Alignment for Speech-to-Singing Synthesis
David Ayllón, Fernando Villavicencio, Pierre Lanchantin
A Study for Improving Device-Directed Speech Detection Toward Frictionless Human-Machine Interaction
Che-Wei Huang, Roland Maas, Sri Harish Mallidi et al.
A Study of a Cross-Language Perception Based on Cortical Analysis Using Biomimetic STRFs
Sangwook Park, David K. Han, Mounya Elhilali
A Study of Soprano Singing in Light of the Source-Filter Interaction
Tokihiko Kaburagi
A Study of x-Vector Based Speaker Recognition on Short Utterances
A. Kanagasundaram, S. Sridharan, G. Sriram et al.
ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection
Massimiliano Todisco, Xin Wang, Ville Vestman et al.
A System for Real-Time Privacy Preserving Data Collection for Ambient Assisted Living
Fasih Haider, Saturnino Luz
A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting
Ye Bai, Jiangyan Yi, Jianhua Tao et al.
Attention Based Hybrid i-Vector BLSTM Model for Language Recognition
Bharat Padi, Anand Mohan, Sriram Ganapathy
Attention-Based Word Vector Prediction with LSTMs and its Application to the OOV Problem in ASR
Alejandro Coucheiro-Limeres, Fernando Fernández-Martínez, Rubén San-Segundo et al.
Attention-Enhanced Connectionist Temporal Classification for Discrete Speech Emotion Recognition
Ziping Zhao, Zhongtian Bao, Zixing Zhang et al.
Attention Model for Articulatory Features Detection
Ievgen Karaulov, Dmytro Tkanov
Attentive to Individual: A Multimodal Emotion Recognition Network with Personalized Attention Profile
Jeng-Lin Li, Chi-Chun Lee
Audio Classification of Bit-Representation Waveform
Masaki Okawa, Takuya Saito, Naoki Sawada et al.
Audio Tagging with Compact Feedforward Sequential Memory Network and Audio-to-Audio Ratio Based Data Augmentation
Zhiying Huang, Shiliang Zhang, Ming Lei
Augmented CycleGANs for Continuous Scale Normal-to-Lombard Speaking Style Conversion
Shreyas Seshadri, Lauri Juvela, Paavo Alku et al.
A Unified Bayesian Source Modelling for Determined Blind Source Separation
Chaitanya Narisetty
A Unified Framework for Speaker and Utterance Verification
Tianchi Liu, Maulik Madhavi, Rohan Kumar Das et al.
Autoencoder-Based Semi-Supervised Curriculum Learning for Out-of-Domain Speaker Verification
Siqi Zheng, Gang Liu, Hongbin Suo et al.
Auto-Encoding Nearest Neighbor i-Vectors for Speaker Verification
Umair Khan, Miquel India, Javier Hernando