conftrace_

Papers

Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment INTERSPEECH 2024 LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning INTERSPEECH 2024 SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark INTERSPEECH 2024 Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data INTERSPEECH 2024 ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings INTERSPEECH 2023 CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center INTERSPEECH 2023 STUDIES: Corpus of Japanese Empathetic Dialogue Speech Towards Friendly Voice Agent INTERSPEECH 2022 DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning INTERSPEECH 2022 A Unified Accent Estimation Method Based on Multi-Task Learning for Japanese Text-to-Speech INTERSPEECH 2022 Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation INTERSPEECH 2022 Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History INTERSPEECH 2022 Phrase Break Prediction with Bidirectional Encoder Representations in Japanese Text-to-Speech Synthesis INTERSPEECH 2021 Face2Speech: Towards Multi-Speaker Text-to-Speech Synthesis Using an Embedding Vector Predicted from a Face Image INTERSPEECH 2020 Model Integration for HMM- and DNN-Based Speech Synthesis Using Product-of-Experts Framework INTERSPEECH 2016