Papers
OverFlow: Putting flows on top of neural transducers for better TTS
Shivam Mehta, Ambika Kirkland, Harm Lameris et al.
Overlap Aware Continuous Speech Separation without Permutation Invariant Training
Linfeng Yu, Wangyou Zhang, Chenda Li et al.
Parameter-Efficient Learning for Text-to-Speech Accent Adaptation
Li-Jen Yang, Chao-Han Huck Yang, Jen-Tzung Chien
Parameter-Efficient Low-Resource Dialogue State Tracking by Prompt Tuning
Mingyu Derek Ma, Jiun-Yu Kao, Shuyang Gao et al.
Parameter Selection for Analyzing Conversations with Autism Spectrum Disorder
Tahiya Chowdhury, Veronica Romero, Amanda Stent
Pardon my disfluency: The impact of disfluency effects on the perception of speaker competence and confidence
Ambika Kirkland, Joakim Gustafson, Éva Székely
Parsing dialog turns with prosodic features in English
Elizabeth Nielsen, Mark Steedman, Sharon Goldwater
Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification
Sangmin Bae, June-Woo Kim, Won-Yang Cho et al.
PATCorrect: Non-autoregressive Phoneme-augmented Transformer for ASR Error Correction
Ziji Zhang, Zhehui Wang, Rajesh Kamma et al.
PCNN: A Lightweight Parallel Conformer Neural Network for Efficient Monaural Speech Enhancement
Xinmeng Xu, Weiping Tu, Yuhong Yang
Perception of Incomplete Voicing Neutralization of Obstruents in Tohoku Japanese
Mafuyu Kitahara, Naoya Watabe, Hiroto Noguchi et al.
Perceptual and Task-Oriented Assessment of a Semantic Metric for ASR Evaluation
Janine Rugayan, Giampiero Salvi, Torbjørn Svendsen
Perceptual Improvement of Deep Neural Network (DNN) Speech Coder Using Parametric and Non-parametric Density Models
Joon Byun, Seungmin Shin, Jongmo Sung et al.
Personality-aware Training based Speaker Adaptation for End-to-end Speech Recognition
Yue Gu, Zhihao Du, Shiliang Zhang et al.
Personalization for BERT-based Discriminative Speech Recognition Rescoring
Jari Kolehmainen, Yile Gu, Aditya Gourav et al.
Personalization for Robust Voice Pathology Detection in Sound Waves
Khanh-Tung Tran, Truong Hoang, Duy Khuong Nguyen et al.
Personalized Acoustic Scene Classification in Ultra-low Power Embedded Devices Using Privacy-preserving Data Augmentation
Timm Koppelmann, Semih Agcaer, Rainer Martin
Personalized Adaptation with Pre-trained Speech Encoders for Continuous Emotion Recognition
Minh Tran, Yufeng Yin, Mohammad Soleymani
Personalized Dereverberation of Speech
Ruilin Xu, Gurunandan Krishnan, Changxi Zheng et al.
Personalized Predictive ASR for Latency Reduction in Voice Assistants
Andreas Schwarz, Di He, Maarten Van Segbroeck et al.
Personal Primer Prototype 1: Invitation to Make Your Own Embooked Speech-Based Educational Artifact
Daniel D. Hromada, Hyungjoong Kim
Phase perturbation improves channel robustness for speech spoofing countermeasures
Yongyi Zang, You Zhang, Zhiyao Duan
Phonemic competition in end-to-end ASR models
Louis ten Bosch, Martijn Bentum, Lou Boves
Phonetic and Prosody-aware Self-supervised Learning Approach for Non-native Fluency Scoring
Kaiqi Fu, Shaojun Gao, Shuju Shi et al.