Kentaro Tachibana

14 papers · 2016–2024 · 1 conference · across top CS/AI conferences

Achievements

+7 more ↓

🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🏃 Academic Marathon (8) 🐝 Cross-Pollinator (12)

🏃 Academic Marathon (8) 🧭 Keyword Pioneer 🗃️ Keyword Collector (67) 🚀 Conference Pioneer 💎 Century Club (14) 🔥 Unstoppable (5) ⚡ Prolific Year (5)

Conferences

INTERSPEECH (14)

Top co-authors

Ryuichi Yamamoto (8) Yuki Saito (7) Shinnosuke Takamichi (6) Hiroshi Saruwatari (6) Byeongseon Park (3) Yuma Shirahata (3) Yuto Nishimura (2) Eiji Iimori (2) Kentaro Seki (2) Takuto Igarashi (2)

Keywords

text-to-speech synthesis (3) empathetic dialogue (3) speech synthesis (3) voice conversion (3) speech corpus (2) dialogue system (2) speaker embedding (1) pseudo labeling (1) multi-task learning (1) automatic speech recognition (1) face recognition (1) speech enhancement (1) data augmentation (1) acoustic representation (1) noise robustness (1) emotion recognition (1) hierarchical structure (1) hidden markov model (1) latent variable (1) acoustic model (1)

Papers

Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment INTERSPEECH 2024 LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning INTERSPEECH 2024 SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark INTERSPEECH 2024 Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data INTERSPEECH 2024 ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings INTERSPEECH 2023 CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center INTERSPEECH 2023 STUDIES: Corpus of Japanese Empathetic Dialogue Speech Towards Friendly Voice Agent INTERSPEECH 2022 DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning INTERSPEECH 2022 A Unified Accent Estimation Method Based on Multi-Task Learning for Japanese Text-to-Speech INTERSPEECH 2022 Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation INTERSPEECH 2022 Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History INTERSPEECH 2022 Phrase Break Prediction with Bidirectional Encoder Representations in Japanese Text-to-Speech Synthesis INTERSPEECH 2021 Face2Speech: Towards Multi-Speaker Text-to-Speech Synthesis Using an Embedding Vector Predicted from a Face Image INTERSPEECH 2020 Model Integration for HMM- and DNN-Based Speech Synthesis Using Product-of-Experts Framework INTERSPEECH 2016