Papers
Open-Set Short Utterance Forensic Speaker Verification Using Teacher-Student Network with Explicit Inductive Bias
Mufan Sang, Wei Xia, John H.L. Hansen
ORCA-CLEAN: A Deep Denoising Toolkit for Killer Whale Communication
Christian Bergler, Manuel Schmitt, Andreas Maier et al.
Overview of the Interspeech TLT2020 Shared Task on ASR for Non-Native Children’s Speech
Roberto Gretter, Marco Matassoni, Daniele Falavigna et al.
Pair Expansion for Learning Multilingual Semantic Embeddings Using Disjoint Visually-Grounded Speech Audio Datasets
Yasunori Ohishi, Akisato Kimura, Takahito Kawanishi et al.
Paralinguistic Classification of Mask Wearing by Image Classifiers and Fusion
Jeno Szep, Salim Hariri
Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Wei Li, James Qin, Chung-Cheng Chiu et al.
Pardon the Interruption: An Analysis of Gender and Turn-Taking in U.S. Supreme Court Oral Arguments
Haley Lepp, Gina-Anne Levow
Parkinson’s Disease Detection from Speech Using Single Frequency Filtering Cepstral Coefficients
Sudarsana Reddy Kadiri, Rashmi Kethireddy, Paavo Alku
Partial AUC Optimisation Using Recurrent Neural Networks for Music Detection with Limited Training Data
Pablo Gimeno, Victoria Mingote, Alfonso Ortega et al.
Peking Opera Synthesis via Duration Informed Attention Network
Yusong Wu, Shengchen Li, Chengzhu Yu et al.
Perceptimatic: A Human Speech Perception Benchmark for Unsupervised Subword Modelling
Juliette Millet, Ewan Dunbar
Perception and Production of Mandarin Initial Stops by Native Urdu Speakers
Dan Du, Xianjin Zhu, Zhu Li et al.
Perception of Japanese Consonant Length by Native Speakers of Korean Differing in Japanese Learning Experience
Kimiko Tsukada, Joo-Yeon Kim, Jeong-Im Han
Perception of Privacy Measured in the Crowd — Paired Comparison on the Effect of Background Noises
Anna Leschanowsky, Sneha Das, Tom Bäckström et al.
Phase-Aware Music Super-Resolution Using Generative Adversarial Networks
Shichao Hu, Bin Zhang, Beici Liang et al.
Phase Based Spectro-Temporal Features for Building a Robust ASR System
Anirban Dutta, G. Ashishkumar, Ch.V. Rama Rao
Phoneme-to-Grapheme Conversion Based Large-Scale Pre-Training for End-to-End Automatic Speech Recognition
Ryo Masumura, Naoki Makishima, Mana Ihori et al.
Phonetic Accommodation of L2 German Speakers to the Virtual Language Learning Tutor Mirabella
Iona Gessinger, Bernd Möbius, Bistra Andreeva et al.
Phonetically-Aware Coupled Network For Short Duration Text-Independent Speaker Verification
Siqi Zheng, Yun Lei, Hongbin Suo
Phonetic Entrainment in Cooperative Dialogues: A Case of Russian
Alla Menshikova, Daniil Kocharov, Tatiana Kachkovskaia
Phonetic, Frame Clustering and Intelligibility Analyses for the INTERSPEECH 2020 ComParE Challenge
Claude Montacié, Marie-José Caraty
Phonological Features for 0-Shot Multilingual Speech Synthesis
Marlene Staib, Tian Huey Teh, Alexandra Torresquintero et al.