Papers
Pitch Declination and Final Lowering in Northeastern Mandarin
Ping Cui, Jianjing Kuang
POCO: A Voice Spoofing and Liveness Detection Corpus Based on Pop Noise
Kosuke Akimoto, Seng Pei Liew, Sakiko Mishima et al.
PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss
Umut Isik, Ritwik Giri, Neerad Phansalkar et al.
Poetic Meter Classification Using i-Vector-MTF Fusion
Rajeev Rajan, Aiswarya Vinod Kumar, Ben P. Babu
Polishing the Classical Likelihood Ratio Test by Supervised Learning for Voice Activity Detection
Tianjiao Xu, Hui Zhang, Xueliang Zhang
Predicting Collaborative Task Performance Using Graph Interlocutor Acoustic Network in Small Group Interaction
Shun-Chang Zhong, Bo-Hao Su, Wei Huang et al.
Predicting Detection Filters for Small Footprint Open-Vocabulary Keyword Spotting
Théodore Bluche, Thibault Gisselbrecht
Predicting Intelligibility of Enhanced Speech Using Posteriors Derived from DNN-Based ASR System
Kenichi Arai, Shoko Araki, Atsunori Ogawa et al.
Prediction of Head Motion from Speech Waveforms with a Canonical-Correlation-Constrained Autoencoder
JinHong Lu, Hiroshi Shimodaira
Prediction of Sleepiness Ratings from Voice by Man and Machine
Mark Huckvale, András Beke, Mirei Ikushima
Principal Style Components: Expressive Style Control and Cross-Speaker Transfer in Neural TTS
Alexander Sorin, Slava Shechtman, Ron Hoory
Privacy Guarantees for De-Identifying Text Transformations
David Ifeoluwa Adelani, Ali Davody, Thomas Kleinbauer et al.
Pronunciation Erroneous Tendency Detection with Language Adversarial Represent Learning
Longfei Yang, Kaiqi Fu, Jinsong Zhang et al.
Prosodic Characteristics of Genuine and Mock (Im)polite Mandarin Utterances
Chengwei Xu, Wentao Gu
Prosody and Breathing: A Comparison Between Rhetorical and Information-Seeking Questions in German and Brazilian Portuguese
Jana Neitsch, Plinio A. Barbosa, Oliver Niebuhr
Prosody Learning Mechanism for Speech Synthesis System Without Text Length Limit
Zhen Zeng, Jianzong Wang, Ning Cheng et al.
Prototypical Q Networks for Automatic Conversational Diagnosis and Few-Shot New Disease Adaption
Hongyin Luo, Shang-Wen Li, James Glass
Punctuation Prediction in Spontaneous Conversations: Can We Mitigate ASR Errors with Retrofitted Word Embeddings?
Łukasz Augustyniak, Piotr Szymański, Mikołaj Morzy et al.
PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR
Yiwen Shao, Yiming Wang, Daniel Povey et al.
Quantification of Transducer Misalignment in Ultrasound Tongue Imaging
Tamás Gábor Csapó, Kele Xu
Quantization Aware Training with Absolute-Cosine Regularization for Automatic Speech Recognition
Hieu Duy Nguyen, Anastasios Alexandridis, Athanasios Mouchtaris
Quasi-Periodic Parallel WaveGAN Vocoder: A Non-Autoregressive Pitch-Dependent Dilated Convolution Model for Parametric Speech Generation
Yi-Chiao Wu, Tomoki Hayashi, Takuma Okamoto et al.
Quaternion Neural Networks for Multi-Channel Distant Speech Recognition
Xinchi Qiu, Titouan Parcollet, Mirco Ravanelli et al.