Research Explorer

OverFlow: Putting flows on top of neural transducers for better TTS

Shivam Mehta, Ambika Kirkland, Harm Lameris et al.

2023 INTERSPEECH

Overlap Aware Continuous Speech Separation without Permutation Invariant Training

Linfeng Yu, Wangyou Zhang, Chenda Li et al.

2023 INTERSPEECH

Parameter-efficient Dysarthric Speech Recognition Using Adapter Fusion and Householder Transformation

Jinzi Qi, Hugo Van hamme

2023 INTERSPEECH

Parameter-Efficient Learning for Text-to-Speech Accent Adaptation

Li-Jen Yang, Chao-Han Huck Yang, Jen-Tzung Chien

2023 INTERSPEECH

Parameter-Efficient Low-Resource Dialogue State Tracking by Prompt Tuning

Mingyu Derek Ma, Jiun-Yu Kao, Shuyang Gao et al.

2023 INTERSPEECH

Parameter Selection for Analyzing Conversations with Autism Spectrum Disorder

Tahiya Chowdhury, Veronica Romero, Amanda Stent

2023 INTERSPEECH

Pardon my disfluency: The impact of disfluency effects on the perception of speaker competence and confidence

Ambika Kirkland, Joakim Gustafson, Éva Székely

2023 INTERSPEECH

Parsing dialog turns with prosodic features in English

Elizabeth Nielsen, Mark Steedman, Sharon Goldwater

2023 INTERSPEECH

Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification

Sangmin Bae, June-Woo Kim, Won-Yang Cho et al.

2023 INTERSPEECH

PATCorrect: Non-autoregressive Phoneme-augmented Transformer for ASR Error Correction

Ziji Zhang, Zhehui Wang, Rajesh Kamma et al.

2023 INTERSPEECH

PCNN: A Lightweight Parallel Conformer Neural Network for Efficient Monaural Speech Enhancement

Xinmeng Xu, Weiping Tu, Yuhong Yang

2023 INTERSPEECH

Perception of Incomplete Voicing Neutralization of Obstruents in Tohoku Japanese

Mafuyu Kitahara, Naoya Watabe, Hiroto Noguchi et al.

2023 INTERSPEECH

Perceptual and Task-Oriented Assessment of a Semantic Metric for ASR Evaluation

Janine Rugayan, Giampiero Salvi, Torbjørn Svendsen

2023 INTERSPEECH

Perceptual Improvement of Deep Neural Network (DNN) Speech Coder Using Parametric and Non-parametric Density Models

Joon Byun, Seungmin Shin, Jongmo Sung et al.

2023 INTERSPEECH

Personality-aware Training based Speaker Adaptation for End-to-end Speech Recognition

Yue Gu, Zhihao Du, Shiliang Zhang et al.

2023 INTERSPEECH

Personalization for BERT-based Discriminative Speech Recognition Rescoring

Jari Kolehmainen, Yile Gu, Aditya Gourav et al.

2023 INTERSPEECH

Personalization for Robust Voice Pathology Detection in Sound Waves

Khanh-Tung Tran, Truong Hoang, Duy Khuong Nguyen et al.

2023 INTERSPEECH

Personalized Acoustic Scene Classification in Ultra-low Power Embedded Devices Using Privacy-preserving Data Augmentation

Timm Koppelmann, Semih Agcaer, Rainer Martin

2023 INTERSPEECH

Personalized Adaptation with Pre-trained Speech Encoders for Continuous Emotion Recognition

Minh Tran, Yufeng Yin, Mohammad Soleymani

2023 INTERSPEECH

Personalized Dereverberation of Speech

Ruilin Xu, Gurunandan Krishnan, Changxi Zheng et al.

2023 INTERSPEECH

Personalized Predictive ASR for Latency Reduction in Voice Assistants

Andreas Schwarz, Di He, Maarten Van Segbroeck et al.

2023 INTERSPEECH

Personal Primer Prototype 1: Invitation to Make Your Own Embooked Speech-Based Educational Artifact

Daniel D. Hromada, Hyungjoong Kim

2023 INTERSPEECH

Phase perturbation improves channel robustness for speech spoofing countermeasures

Yongyi Zang, You Zhang, Zhiyao Duan

2023 INTERSPEECH

Phonemic competition in end-to-end ASR models

Louis ten Bosch, Martijn Bentum, Lou Boves

2023 INTERSPEECH

Phonetic and Prosody-aware Self-supervised Learning Approach for Non-native Fluency Scoring

Kaiqi Fu, Shaojun Gao, Shuju Shi et al.

2023 INTERSPEECH

Papers