Papers
Depression State Assessment: Application for Detection of Depression by Speech
Gábor Kiss, Dávid Sztahó, Klára Vicsi
Design and Development of a Multi-Lingual Speech Corpora (TaMaR-EmoDB) for Emotion Analysis
Rajeev Rajan, Haritha U.G., Sujitha A.C. et al.
Detecting Depression with Word-Level Multimodal Fusion
Morteza Rohanian, Julian Hough, Matthew Purver
Detecting Mismatch Between Speech and Transcription Using Cross-Modal Attention
Qiang Huang, Thomas Hain
Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission to ASVspoof 2019 Challenge
Hossein Zeinali, Themos Stafylakis, Georgia Athanasopoulou et al.
Detecting Topic-Oriented Speaker Stance in Conversational Speech
Catherine Lai, Beatrice Alex, Johanna D. Moore et al.
Detection and Recovery of OOVs for Improved English Broadcast News Captioning
Samuel Thomas, Kartik Audhkhasi, Zoltán Tüske et al.
Detection of Glottal Closure Instants from Raw Speech Using Convolutional Neural Networks
Mohit Goyal, Varun Srivastava, Prathosh A.P.
Developing Pronunciation Models in New Languages Faster by Exploiting Common Grapheme-to-Phoneme Correspondences Across Languages
Harry Bleyan, Sandy Ritchie, Jonas Fromseier Mortensen et al.
Development of Emotion Rankers Based on Intended and Perceived Emotion Labels
Zhenghao Jin, Houwei Cao
Development of Robust Automated Scoring Models Using Adversarial Input for Oral Proficiency Assessment
Su-Youn Yoon, Chong Min Lee, Klaus Zechner et al.
Device Feature Extractor for Replay Spoofing Detection
Chang Huai You, Jichen Yang, Huy Dat Tran
Diagnosing Dysarthria with Long Short-Term Memory Networks
Alex Mayle, Zhiwei Mou, Razvan Bunescu et al.
Dimensions of Prosodic Prominence in an Attractor Model
Simon Roessig, Doris Mücke, Lena Pagel
Direct F0 Estimation with Neural-Network-Based Regression
Shuzhuang Xu, Hiroshi Shimodaira
Directional Audio Rendering Using a Neural Network Based Personalized HRTF
Geon Woo Lee, Jung Hyuk Lee, Seong Ju Kim et al.
Direction-Aware Speaker Beam for Multi-Channel Speaker Extraction
Guanjun Li, Shan Liang, Shuai Nie et al.
Direct Modelling of Speech Emotion from Raw Speech
Siddique Latif, Rajib Rana, Sara Khalifa et al.
Direct Neuron-Wise Fusion of Cognate Neural Networks
Takashi Fukuda, Masayuki Suzuki, Gakuto Kurata
Direct-Path Signal Cross-Correlation Estimation for Sound Source Localization in Reverberation
Wei Xue, Ying Tong, Guohong Ding et al.
Direct Speech-to-Speech Translation with a Sequence-to-Sequence Model
Ye Jia, Ron J. Weiss, Fadi Biadsy et al.
Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-Trained BERT
Dongyang Dai, Zhiyong Wu, Shiyin Kang et al.
Discovering Dialog Rules by Means of an Evolutionary Approach
David Griol, Zoraida Callejas
Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features
Cunhang Fan, Bin Liu, Jianhua Tao et al.
Disentangling Style Factors from Speaker Representations
Jennifer Williams, Simon King