Papers
Phonetically Induced Subwords for End-to-End Speech Recognition
Vasileios Papadourakis, Markus Müller, Jing Liu et al.
Phonetically Motivated Self-Supervised Speech Representation Learning
Xianghu Yue, Haizhou Li
Phonetic and Prosodic Information Estimation from Texts for Genuine Japanese End-to-End Text-to-Speech
Naoto Kakegawa, Sunao Hara, Masanobu Abe et al.
Phonetic Complexity, Speech Accuracy and Intelligibility Assessment of Italian Dysarthric Speech
Barbara Gili Fivela, Vincenzo Sallustio, Silvia Pede et al.
Phonetic Distance and Surprisal in Multilingual Priming: Evidence from Slavic
Jacek Kudera, Philip Georgis, Bernd Möbius et al.
Phrase Break Prediction with Bidirectional Encoder Representations in Japanese Text-to-Speech Synthesis
Kosuke Futamata, Byeongseon Park, Ryuichi Yamamoto et al.
PILOT: Introducing Transformers for Probabilistic Sound Event Localization
Christopher Schymura, Benedikt Bönninghoff, Tsubasa Ochiai et al.
Pitch Contour Separation from Overlapping Speech
Hiroki Mori
PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS
Ye Jia, Heiga Zen, Jonathan Shen et al.
Polyphone Disambiguation in Mandarin Chinese with Semi-Supervised Learning
Yi Shi, Congyi Wang, Yu Chen et al.
PQK: Model Compression via Pruning, Quantization, and Knowledge Distillation
Jangho Kim, Simyung Chang, Nojun Kwak
Predicting Temporal Performance Drop of Deployed Production Spoken Language Understanding Models
Quynh Do, Judith Gaspers, Daniil Sorokin et al.
Presentation Matters: Evaluating Speaker Identification Tasks
Benjamin O’Brien, Christine Meunier, Alain Ghio
Pre-Training for Spoken Language Understanding with Joint Textual and Phonetic Representation Learning
Qian Chen, Wen Wang, Qinglin Zhang
Primacy of Mouth over Eyes: Eye Movement Evidence from Audiovisual Mandarin Lexical Tones and Vowels
Biao Zeng, Rui Wang, Guoxing Yu et al.
Privacy-Preserving Feature Extraction for Cloud-Based Wake Word Verification
Timm Koppelmann, Alexandru Nelus, Lea Schönherr et al.
Privacy-Preserving Voice Anti-Spoofing Using Secure Multi-Party Computation
Oubaïda Chouchane, Baptiste Brossier, Jorge Esteban Gamboa Gamboa et al.
ProsoBeast Prosody Annotation Tool
Branislav Gerazov, Michael Wagner
Prosodic Accommodation in Face-to-Face and Telephone Dialogues
Pavel Šturm, Radek Skarnitzl, Tomáš Nechanský
Prosodic Boundary Prediction Model for Vietnamese Text-To-Speech
Nguyen Thi Thu Trang, Nguyen Hoang Ky, Albert Rilliard et al.
Prosodic Disambiguation Using Chironomic Stylization of Intonation with Native and Non-Native Speakers
Xiao Xiao, Nicolas Audibert, Grégoire Locqueville et al.
Prosody of Case Markers in Urdu
Benazir Mumtaz, Massimiliano Canzi, Miriam Butt
Protecting Gender and Identity with Disentangled Speech Representations
Dimitrios Stoidis, Andrea Cavallaro
Pushing the Limits of Non-Autoregressive Speech Recognition
Edwin G. Ng, Chung-Cheng Chiu, Yu Zhang et al.
QISTA-Net-Audio: Audio Super-Resolution via Non-Convex ℓ_q-Norm Minimization
Gang-Xuan Lin, Shih-Wei Hu, Yen-Ju Lu et al.