Papers

8,761 papers found
Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis
Kun Zhou, Shengkui Zhao, Yukun Ma et al.
2024 INTERSPEECH
PhoneViz: exploring alignments at a glance
Margot Masson, Erfan A. Shams, Iona Gessinger et al.
2024 INTERSPEECH
Phonological Feature Detection for US English using the Phonet Library
Harsha Veena Tadavarthy, Austin Jones, Margaret E. L. Renwick
2024 INTERSPEECH
2024 INTERSPEECH
2024 INTERSPEECH
PitchFlow: adding pitch control to a Flow-matching based TTS model
Tasnima Sadekova, Mikhail Kudinov, Vadim Popov et al.
2024 INTERSPEECH
Positional Description for Numerical Normalization
Deepanshu Gupta, Javier Latorre
2024 INTERSPEECH
PPPR: Portable Plug-in Prompt Refiner for Text to Audio Generation
Shuchen Shi, Ruibo Fu, Zhengqi Wen et al.
2024 INTERSPEECH
2024 INTERSPEECH
Predefined Prototypes for Intra-Class Separation and Disentanglement
Antonio Almudévar, Théo Mariotte, Alfonso Ortega et al.
2024 INTERSPEECH
Predicting Acute Pain Levels Implicitly from Vocal Features
Jennifer Williams, Eike Schneiders, Henry Card et al.
2024 INTERSPEECH
Predicting Heart Activity from Speech using Data-driven and Knowledge-based features
Gasser Elbanna, Zohreh Mostaani, Mathew Magimai.-Doss
2024 INTERSPEECH
2024 INTERSPEECH
2024 INTERSPEECH
2024 INTERSPEECH
Pre-training Feature Guided Diffusion Model for Speech Enhancement
Yiyuan Yang, Niki Trigoni, Andrew Markham
2024 INTERSPEECH