Research Explorer

Pay More Attention to History: A Context Modeling Strategy for Conversational Text-to-SQL

Yuntao Li, Hanchu Zhang, Yutian Li et al.

2022 INTERSPEECH

PEAF: Learnable Power Efficient Analog Acoustic Features for Audio Recognition

Boris Bergsma, Minhao Yang, Milos Cernak

2022 INTERSPEECH

Perceived prominence and downstep in Japanese

Hyun Kyung Hwang, Manami Hirayama, Takaomi Kato

2022 INTERSPEECH

PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement

Xiaofeng Ge, Jiangyu Han, Yanhua Long et al.

2022 INTERSPEECH

PERCEPT-R: An Open-Access American English Child/Clinical Speech Corpus Specialized for the Audio Classification of /ɹ/

Nina Benway, Jonathan L. Preston, Elaine Hitchcock et al.

2022 INTERSPEECH

Perceptual Characteristics Based Multi-objective Model for Speech Enhancement

Chiang-Jen Peng, Yun-Ju Chan, Yih-Liang Shen et al.

2022 INTERSPEECH

Perceptual Contrast Stretching on Target Feature for Speech Enhancement

Rong Chao, Cheng Yu, Szu-wei Fu et al.

2022 INTERSPEECH

Perceptual Evaluation of Penetrating Voices through a Semantic Differential Method

Tatsuya Kitamura, Naoki Kunimoto, Hideki Kawahara et al.

2022 INTERSPEECH

Performance Improvement of Speech Emotion Recognition by Neutral Speech Detection Using Autoencoder and Intermediate Representation

Jennifer Santoso, Takeshi Yamada, Kenkichi Ishizuka et al.

2022 INTERSPEECH

Personalized Acoustic Echo Cancellation for Full-duplex Communications

Shimin Zhang, Ziteng Wang, Yukai Ju et al.

2022 INTERSPEECH

Personalized Keyword Spotting through Multi-task Learning

Seunghan Yang, Byeonggeun Kim, Inseop Chung et al.

2022 INTERSPEECH

Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition

Shaojin Ding, Rajeev Rikhye, Qiao Liang et al.

2022 INTERSPEECH

Pharyngealization in Amazigh: Acoustic and articulatory marking over time

Philipp Buech, Rachid Ridouane, Anne Hermes

2022 INTERSPEECH

Phase Vocoder For Time Stretch Based On Center Frequency Estimation

Donghyeon Kim, Bowon Lee

2022 INTERSPEECH

PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification

Hexin Liu, Leibny Paola Garcia Perera, Andy Khong et al.

2022 INTERSPEECH

Phonetic Analysis of Self-supervised Representations of English Speech

Dan Wells, Hao Tang, Korin Richmond

2022 INTERSPEECH

Phonetic Embedding for ASR Robustness in Entity Resolution

Xiaozhou Zhou, Ruying Bao, William M. Campbell

2022 INTERSPEECH

Phonetic erosion and information structure in function words: the case of mia

Giuseppe Magistro, Claudia Crocco

2022 INTERSPEECH

PISA: PoIncaré Saliency-Aware Interpolative Augmentation

Ramit Sawhney, Megh Thakkar, Vishwa Shah et al.

2022 INTERSPEECH

PLCNet: Real-time Packet Loss Concealment with Semi-supervised Generative Adversarial Network

Baiyun Liu, Qi Song, Mingxue Yang et al.

2022 INTERSPEECH

Plugging a neural phoneme recognizer into a simple language model: a workflow for low-resource setting

Séverine Guillaume, Guillaume Wisniewski, Benjamin Galliot et al.

2022 INTERSPEECH

pMCT: Patched Multi-Condition Training for Robust Speech Recognition

Pablo Peso Parada, Agnieszka Dobrowolska, Karthikeyan Saravanan et al.

2022 INTERSPEECH

PM-MMUT: Boosted Phone-mask Data Augmentation using Multi-Modeling Unit Training for Phonetic-Reduction-Robust E2E Speech Recognition

Guodong Ma, Pengfei Hu, Nurmemet Yolwas et al.

2022 INTERSPEECH

PodcastMix: A dataset for separating music and speech in podcasts

Nicolás Schmidt, Jordi Pons, Marius Miron

2022 INTERSPEECH

PoeticTTS - Controllable Poetry Reading for Literary Studies

Julia Koch, Florian Lux, Nadja Schauffler et al.

2022 INTERSPEECH

Papers