Papers
Pay More Attention to History: A Context Modeling Strategy for Conversational Text-to-SQL
Yuntao Li, Hanchu Zhang, Yutian Li et al.
PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition
Boris Bergsma, Minhao Yang, Milos Cernak
Perceived prominence and downstep in Japanese
Hyun Kyung Hwang, Manami Hirayama, Takaomi Kato
PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement
Xiaofeng Ge, Jiangyu Han, Yanhua Long et al.
PERCEPT-R: An Open-Access American English Child/Clinical Speech Corpus Specialized for the Audio Classification of /ɹ/
Nina Benway, Jonathan L. Preston, Elaine Hitchcock et al.
Perceptual Characteristics Based Multi-objective Model for Speech Enhancement
Chiang-Jen Peng, Yun-Ju Chan, Yih-Liang Shen et al.
Perceptual Contrast Stretching on Target Feature for Speech Enhancement
Rong Chao, Cheng Yu, Szu-wei Fu et al.
Perceptual Evaluation of Penetrating Voices through a Semantic Differential Method
Tatsuya Kitamura, Naoki Kunimoto, Hideki Kawahara et al.
Performance Improvement of Speech Emotion Recognition by Neutral Speech Detection Using Autoencoder and Intermediate Representation
Jennifer Santoso, Takeshi Yamada, Kenkichi Ishizuka et al.
Personalized Acoustic Echo Cancellation for Full-duplex Communications
Shimin Zhang, Ziteng Wang, Yukai Ju et al.
Personalized Keyword Spotting through Multi-task Learning
Seunghan Yang, Byeonggeun Kim, Inseop Chung et al.
Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition
Shaojin Ding, Rajeev Rikhye, Qiao Liang et al.
Pharyngealization in Amazigh: Acoustic and articulatory marking over time
Philipp Buech, Rachid Ridouane, Anne Hermes
Phase Vocoder For Time Stretch Based On Center Frequency Estimation
Donghyeon Kim, Bowon Lee
PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification
Hexin Liu, Leibny Paola Garcia Perera, Andy Khong et al.
Phonetic Analysis of Self-supervised Representations of English Speech
Dan Wells, Hao Tang, Korin Richmond
Phonetic Embedding for ASR Robustness in Entity Resolution
Xiaozhou Zhou, Ruying Bao, William M. Campbell
Phonetic erosion and information structure in function words: the case of mia
Giuseppe Magistro, Claudia Crocco
PISA: PoIncaré Saliency-Aware Interpolative Augmentation
Ramit Sawhney, Megh Thakkar, Vishwa Shah et al.
PLCNet: Real-time Packet Loss Concealment with Semi-supervised Generative Adversarial Network
Baiyun Liu, Qi Song, Mingxue Yang et al.
Plugging a neural phoneme recognizer into a simple language model: a workflow for low-resource setting
Séverine Guillaume, Guillaume Wisniewski, Benjamin Galliot et al.
pMCT: Patched Multi-Condition Training for Robust Speech Recognition
Pablo Peso Parada, Agnieszka Dobrowolska, Karthikeyan Saravanan et al.
PM-MMUT: Boosted Phone-mask Data Augmentation using Multi-Modeling Unit Training for Phonetic-Reduction-Robust E2E Speech Recognition
Guodong Ma, Pengfei Hu, Nurmemet Yolwas et al.
PodcastMix: A dataset for separating music and speech in podcasts
Nicolás Schmidt, Jordi Pons, Marius Miron
PoeticTTS - Controllable Poetry Reading for Literary Studies
Julia Koch, Florian Lux, Nadja Schauffler et al.