Papers
Exploiting Cross-Domain Visual Feature Generation for Disordered Speech Recognition
Shansong Liu, Xurong Xie, Jianwei Yu et al.
Exploiting Deep Sentential Context for Expressive End-to-End Speech Synthesis
Fengyu Yang, Shan Yang, Qinghua Wu et al.
Exploiting Multi-Modal Features from Pre-Trained Networks for Alzheimer’s Dementia Recognition
Junghyun Koo, Jie Hwan Lee, Jaewoo Pyo et al.
Exploration of Acoustic and Lexical Cues for the INTERSPEECH 2020 Computational Paralinguistic Challenge
Ziqing Yang, Zifan An, Zehao Fan et al.
Exploration of Audio Quality Assessment and Anomaly Localisation Using Attention Models
Qiang Huang, Thomas Hain
Exploration of End-to-End Synthesisers for Zero Resource Speech Challenge 2020
Karthik Pandia D.S., Anusha Prakash, Mano Ranjith Kumar M. et al.
Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement
Jun Qi, Hu Hu, Yannan Wang et al.
Exploring Lexicon-Free Modeling Units for End-to-End Korean and Korean-English Code-Switching Speech Recognition
Jisung Wang, Jihwan Kim, Sangki Kim et al.
Exploring Listeners’ Speech Rate Preferences
Olympia Simantiraki, Martin Cooke
Exploring MMSE Score Prediction Using Verbal and Non-Verbal Cues
Shahla Farzana, Natalie Parde
Exploring Text and Audio Embeddings for Multi-Dimension Elderly Emotion Recognition
Mariana Julião, Alberto Abad, Helena Moniz
Exploring the Use of an Artificial Accent of English to Assess Phonetic Learning in Monolingual and Bilingual Speakers
Laura Spinu, Jiwon Hwang, Nadya Pincus et al.
Exploring the Use of an Unsupervised Autoregressive Model as a Shared Encoder for Text-Dependent Speaker Verification
Vijay Ravi, Ruchao Fan, Amber Afshan et al.
Exploring Transformers for Large-Scale Speech Recognition
Liang Lu, Changliang Liu, Jinyu Li et al.
Exploring TTS Without T Using Biologically/Psychologically Motivated Neural Network Modules (ZeroSpeech 2020)
Takashi Morita, Hiroki Koda
Extended Study on the Use of Vocal Tract Variables to Quantify Neuromotor Coordination in Depression
Nadee Seneviratne, James R. Williamson, Adam C. Lammert et al.
Extrapolating False Alarm Rates in Automatic Speaker Verification
Alexey Sholokhov, Tomi Kinnunen, Ville Vestman et al.
F0 Patterns in Mandarin Statements of Mandarin and Cantonese Speakers
Yike Yang, Si Chen, Xi Chen
F0 Slope and Mean: Cues to Speech Segmentation in French
Maria del Mar Cordero, Fanny Meunier, Nicolas Grimault et al.
Face2Speech: Towards Multi-Speaker Text-to-Speech Synthesis Using an Embedding Vector Predicted from a Face Image
Shunsuke Goto, Kotaro Onishi, Yuki Saito et al.
FaceFilter: Audio-Visual Speech Separation Using Still Images
Soo-Whan Chung, Soyeon Choe, Joon Son Chung et al.
Fast and Lightweight On-Device TTS with Tacotron2 and LPCNet
Vadim Popov, Stanislav Kamenev, Mikhail Kudinov et al.
Fast and Slow Acoustic Model
Kshitiz Kumar, Emilian Stoimenov, Hosam Khalil et al.
Faster, Simpler and More Accurate Hybrid ASR Systems Using Wordpieces
Frank Zhang, Yongqiang Wang, Xiaohui Zhang et al.
FEARLESS STEPS Challenge (FS-2): Supervised Learning with Massive Naturalistic Apollo Data
Aditya Joglekar, John H.L. Hansen, Meena Chandra Shekar et al.