Research Explorer

Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis

Kun Zhou, Shengkui Zhao, Yukun Ma et al.

2024 INTERSPEECH

PhoneViz: exploring alignments at a glance

Margot Masson, Erfan A. Shams, Iona Gessinger et al.

2024 INTERSPEECH

Phonological Feature Detection for US English using the Phonet Library

Harsha Veena Tadavarthy, Austin Jones, Margaret E. L. Renwick

2024 INTERSPEECH

Phonological-Level Mispronunciation Detection and Diagnosis

Mostafa Shahin, Beena Ahmed

2024 INTERSPEECH

Phonological Symmetry Does Not Predict Generalization of Perceptual Adaptation to Vowels

Zuheyra Tokac, Jennifer Cole

2024 INTERSPEECH

Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models

Zhiyuan Tang, Dong Wang, Shen Huang et al.

2024 INTERSPEECH

Pitch-Aware RNN-T for Mandarin Chinese Mispronunciation Detection and Diagnosis

Xintong Wang, Mingqian Shi, Ye Wang

2024 INTERSPEECH

Pitch-driven adjustments in tongue positions: Insights from ultrasound imaging

May Pik Yu Chan, Jianjing Kuang

2024 INTERSPEECH

PitchFlow: adding pitch control to a Flow-matching based TTS model

Tasnima Sadekova, Mikhail Kudinov, Vadim Popov et al.

2024 INTERSPEECH

PLDNet: PLD-Guided Lightweight Deep Network Boosted by Efﬁcient Attention for Handheld Dual-Microphone Speech Enhancement

Nan Zhou, Youhai Jiang, Jialin Tan et al.

2024 INTERSPEECH

PL-TTS: A Generalizable Prompt-based Diffusion TTS Augmented by Large Language Model

Shuhua Li, Qirong Mao, Jiatong Shi

2024 INTERSPEECH

Positional Description for Numerical Normalization

Deepanshu Gupta, Javier Latorre

2024 INTERSPEECH

Post-Net: A linguistically inspired sequence-dependent transformed neural architecture for automatic syllable stress detection

Sai Harshitha Aluru, Jhansi Mallela, Chiranjeevi Yarra

2024 INTERSPEECH

PPPR: Portable Plug-in Prompt Refiner for Text to Audio Generation

Shuchen Shi, Ruibo Fu, Zhengqi Wen et al.

2024 INTERSPEECH

Pragmatically similar utterance finder demonstration

Nigel G. Ward, Andres Segura

2024 INTERSPEECH

Predefined Prototypes for Intra-Class Separation and Disentanglement

Antonio Almudévar, Théo Mariotte, Alfonso Ortega et al.

2024 INTERSPEECH

Predicting Acute Pain Levels Implicitly from Vocal Features

Jennifer Williams, Eike Schneiders, Henry Card et al.

2024 INTERSPEECH

Predicting Heart Activity from Speech using Data-driven and Knowledge-based features

Gasser Elbanna, Zohreh Mostaani, Mathew Magimai.-Doss

2024 INTERSPEECH

Preliminary Investigation of Psychometric Properties of a Novel Multimodal Dialog Based Affect Production Task in Children and Adolescents with Autism

Carly Demopoulos, Linnea Lampinen, Cristian Preciado et al.

2024 INTERSPEECH

Preprocessing for acoustic-to-articulatory inversion using real-time MRI movies of Japanese speech

Anna Oura, Hideaki Kikuchi, Tetsunori Kobayashi

2024 INTERSPEECH

Preservation, conservation and phonetic study of the voices of Italian poets: A study on the seven years of the VIP archive

Federico Lo Iacono, Valentina Colonna, Antonio Romano

2024 INTERSPEECH

Pre-trained Feature Fusion and Matching for Mild Cognitive Impairment Detection

Junwen Duan, Fangyuan Wei, Hong-Dong Li et al.

2024 INTERSPEECH

Pretraining End-to-End Keyword Search with Automatically Discovered Acoustic Units

Bolaji Yusuf, Jan Honza Cernocky, Murat Saraçlar

2024 INTERSPEECH

Pre-training Feature Guided Diffusion Model for Speech Enhancement

Yiyuan Yang, Niki Trigoni, Andrew Markham

2024 INTERSPEECH

Pre-training Neural Transducer-based Streaming Voice Conversion for Faster Convergence and Alignment-free Training

Hiroki Kanagawa, Takafumi Moriya, Yusuke Ijima

2024 INTERSPEECH

Papers