Papers
Attentron: Few-Shot Text-to-Speech Utilizing Attention-Based Variable-Length Embedding
Seungwoo Choi, Seungju Han, Dongyoung Kim et al.
Audio Dequantization for High Fidelity Audio Generation in Flow-Based Neural Vocoder
Hyun-Wook Yoon, Sang-Hoon Lee, Hyeong-Rae Noh et al.
Audiovisual Correspondence Learning in Humans and Machines
Venkat Krishnamohan, Akshara Soman, Anshul Gupta et al.
Audio-Visual Multi-Channel Recognition of Overlapped Speech
Jianwei Yu, Bo Wu, Rongzhi Gu et al.
Audio-Visual Multi-Speaker Tracking Based on the GLMB Framework
Shoufeng Lin, Xinyuan Qian
Audio-Visual Speaker Recognition with a Cross-Modal Discriminative Network
Ruijie Tao, Rohan Kumar Das, Haizhou Li
Augmenting Generative Adversarial Networks for Speech Emotion Recognition
Siddique Latif, Muhammad Asim, Rajib Rana et al.
Augmenting Images for ASR and TTS Through Single-Loop and Dual-Loop Multimodal Chain Framework
Johanes Effendi, Andros Tjandra, Sakriani Sakti et al.
Augmenting Turn-Taking Prediction with Wearable Eye Activity During Conversation
Hang Li, Siyuan Chen, Julien Epps
A Unified Framework for Low-Latency Speaker Extraction in Cocktail Party Environments
Yunzhe Hao, Jiaming Xu, Jing Shi et al.
Autoencoder Bottleneck Features with Multi-Task Optimisation for Improved Continuous Dysarthric Speech Recognition
Zhengjun Yue, Heidi Christensen, Jon Barker
Automated Screening for Alzheimer’s Dementia Through Spontaneous Speech
Muhammad Shehram Shah Syed, Zafi Sherhan Syed, Margaret Lech et al.
Automatic Analysis of Speech Prosody in Dutch
Na Hu, Berit Janssen, Judith Hanssen et al.
Automatic Assessment of Dysarthric Severity Level Using Audio-Video Cross-Modal Approach in Deep Learning
Han Tong, Hamid Sharifzadeh, Ian McLoughlin
Automatic Detection of Accent and Lexical Pronunciation Errors in Spontaneous Non-Native English Speech
Konstantinos Kyriakopoulos, Kate M. Knill, Mark J.F. Gales
Automatic Discrimination of Apraxia of Speech and Dysarthria Using a Minimalistic Set of Handcrafted Features
Ina Kodrasi, Michaela Pernon, Marina Laganaro et al.
Automatic Estimation of Intelligibility Measure for Consonants in Speech
Ali Abavisani, Mark Hasegawa-Johnson
Automatic Estimation of Pathological Voice Quality Based on Recurrent Neural Network Using Amplitude and Phase Spectrogram
Shunsuke Hidaka, Yogaku Lee, Kohei Wakamiya et al.
Automatic Glottis Detection and Segmentation in Stroboscopic Videos Using Convolutional Networks
Divya Degala, Achuth Rao M.V., Rahul Krishnamurthy et al.
Automatic Prediction of Confidence Level from Children’s Oral Reading Recordings
Kamini Sabu, Preeti Rao
Automatic Prediction of Speech Intelligibility Based on X-Vectors in the Context of Head and Neck Cancer
Sebastião Quintas, Julie Mauclair, Virginie Woisard et al.
Automatic Quality Assessment for Audio-Visual Verification Systems. The LOVe Submission to NIST SRE Challenge 2019
Grigory Antipov, Nicolas Gengembre, Olivier Le Blouch et al.
Automatic Scoring at Multi-Granularity for L2 Pronunciation
Binghuai Lin, Liyuan Wang, Xiaoli Feng et al.
Automatic Speech Recognition Benchmark for Air-Traffic Communications
Juan Zuluaga-Gomez, Petr Motlicek, Qingran Zhan et al.