Ryuichi Yamamoto

14 papers · 2019–2024 · 1 conference · across top CS/AI conferences

Achievements

+3 more ↓

🐝 Cross-Pollinator (12) 🗺️ Taxonomy Completionist (20) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (5)

🗃️ Keyword Collector (68) 💎 Century Club (14) ⚡ Prolific Year (5)

Conferences

INTERSPEECH (14)

Top co-authors

Kentaro Tachibana (8) Eunwoo Song (6) Jae-Min Kim (6) Min-Jae Hwang (4) Byeongseon Park (3) Hyun-Wook Yoon (3) Ohsung Kwon (3) Yuma Shirahata (3) Shinnosuke Takamichi (2) Kentaro Seki (2)

Keywords

voice conversion (4) speech synthesis (3) neural vocoder (2) data augmentation (2) text-to-speech synthesis (2) prosody analysis (1) audio classification (1) multi-task learning (1) speech enhancement (1) benchmark dataset (1) deepfake detection (1) acoustic representation (1) acoustic model (1) hierarchical structure (1) support vector machine (1) pseudo labeling (1) generative model (1) latent variable (1) noise robustness (1) automatic speech recognition (1)

Papers

Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data INTERSPEECH 2024 Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment INTERSPEECH 2024 LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning INTERSPEECH 2024 CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection INTERSPEECH 2024 SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark INTERSPEECH 2024 Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation INTERSPEECH 2022 DRSpeech: Degradation-Robust Text-to-Speech Synthesis with Frame-Level and Utterance-Level Acoustic Representation Learning INTERSPEECH 2022 A Unified Accent Estimation Method Based on Multi-Task Learning for Japanese Text-to-Speech INTERSPEECH 2022 TTS-by-TTS 2: Data-Selective Augmentation for Neural Speech Synthesis Using Ranking Support Vector Machine with Variational Autoencoder INTERSPEECH 2022 Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems INTERSPEECH 2022 Phrase Break Prediction with Bidirectional Encoder Representations in Japanese Text-to-Speech Synthesis INTERSPEECH 2021 High-Fidelity Parallel WaveGAN with Multi-Band Harmonic-Plus-Noise Model INTERSPEECH 2021 Neural Text-to-Speech with a Modeling-by-Generation Excitation Vocoder INTERSPEECH 2020 Probability Density Distillation with Generative Adversarial Networks for High-Quality Parallel Waveform Generation INTERSPEECH 2019