Papers
An Integrated Framework for Two-Pass Personalized Voice Trigger
Dexin Liao, Jing Li, Yiming Zhi et al.
Annotation Confidence vs. Training Sample Size: Trade-Off Solution for Partially-Continuous Categorical Emotion Recognition
Elena Ryumina, Oxana Verkholyak, Alexey Karpov
A Noise Robust Method for Word-Level Pronunciation Assessment
Binghuai Lin, Liyuan Wang
Anonymous Speaker Clusters: Making Distinctions Between Anonymised Speech Recordings with Clustering Interface
Benjamin O’Brien, Natalia Tomashenko, Anaïs Chanclu et al.
AntVoice Neural Speaker Embedding System for FFSVC 2020
Zhiming Wang, Furong Xu, Kaisheng Yao et al.
A Partitioned-Block Frequency-Domain Adaptive Kalman Filter for Stereophonic Acoustic Echo Cancellation
Rui Zhu, Feiran Yang, Yuepeng Li et al.
Application for Detecting Depression, Parkinson’s Disease and Dysphonic Speech
Gábor Kiss, Dávid Sztahó, Miklós Gábriel Tulics
Applying TDNN Architectures for Analyzing Duration Dependencies on Speech Emotion Recognition
Pooja Kumawat, Aurobinda Routray
Applying the Information Bottleneck Principle to Prosodic Representation Learning
Guangyan Zhang, Ying Qin, Daxin Tan et al.
A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion
Wen-Chin Huang, Kazuhiro Kobayashi, Yu-Huai Peng et al.
A Preliminary Study on Discourse Prosody Encoding in L1 and L2 English Spontaneous Narratives
Yuqing Zhang, Zhu Li, Binghuai Lin et al.
A Prototypical Network Approach for Evaluating Generated Emotional Speech
Alice Baird, Silvan Mertes, Manuel Milling et al.
A Psychology-Driven Computational Analysis of Political Interviews
Darren Cook, Miri Zilka, Simon Maskell et al.
Arabic Code-Switching Speech Recognition Using Monolingual Data
Ahmed Ali, Shammur Absar Chowdhury, Amir Hussein et al.
Articulatory Characteristics of Icelandic Voiced Fricative Lenition: Gradience, Categoricity, and Speaker/Gesture-Specific Effects
Brynhildur Stefansdottir, Francesco Burroni, Sam Tilsen
Articulatory Coordination for Speech Motor Tracking in Huntington Disease
Matthew Perez, Amrit Romana, Angela Roberts et al.
Articulatory Data Recorder: A Framework for Real-Time Articulatory Data Recording
Alexander Wilbrandt, Simon Stone, Peter Birkholz
A Simplified Model for the Vocal Tract of [s] with Inclined Incisors
Tsukasa Yoshinaga, Kohei Tada, Kazunori Nozaki et al.
A Simultaneous Denoising and Dereverberation Framework with Target Decoupling
Andong Li, Wenzhe Liu, Xiaoxue Luo et al.
A Spectro-Temporal Glimpsing Index (STGI) for Speech Intelligibility Prediction
Amin Edraki, Wai-Yip Chan, Jesper Jensen et al.
A Speech Emotion Recognition Framework for Better Discrimination of Confusions
Jiawang Liu, Haoxiang Wang
ASR Posterior-Based Loss for Multi-Task End-to-End Speech Translation
Yuka Ko, Katsuhito Sudoh, Sakriani Sakti et al.
Assessing Posterior-Based Mispronunciation Detection on Field-Collected Recordings from Child Speech Therapy Sessions
Adam Hair, Guanlong Zhao, Beena Ahmed et al.
Assessing the Use of Prosody in Constituency Parsing of Imperfect Transcripts
Trang Tran, Mari Ostendorf
Assessment of von Mises-Bernoulli Deep Neural Network in Sound Source Localization
Katsutoshi Itoyama, Yoshiya Morimoto, Shungo Masaki et al.