Yuki Saito

21 papers · 2017–2025 · 5 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🌍 Conference Polyglot (5) 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🏃 Academic Marathon (8)

🧭 Keyword Pioneer 🐝 Cross-Pollinator (9) 🌍 Conference Polyglot (5) 🤝 Dynamic Duo (15) 🏆 Keyword Champion (2) 🔥 Unstoppable (6) 💎 Century Club (21) ⚡ Prolific Year (5) 🗃️ Keyword Collector (84)

Conferences

INTERSPEECH (17) ECCV (1) EMNLP (1) ICLR (1) IJCAI (1)

Top co-authors

Hiroshi Saruwatari (15) Shinnosuke Takamichi (14) Kentaro Tachibana (7) Tomoki Koriyama (5) Kentaro Seki (3) Ryotaro Shimizu (3) Ryuichi Yamamoto (2) Detai Xin (2) Yusuke Ijima (2) Yuto Nishimura (2)

Keywords

voice conversion (5) speech synthesis (5) empathetic dialogue (3) speaker embedding (3) domain adaptation (2) speaker adaptation (2) dialogue system (2) speaker individuality (2) speech corpus (2) cross-lingual synthesis (2) deep neural network (2) text-to-speech synthesis (2) speaker verification (1) speech enhancement (1) feature attribution (1) knowledge distillation (1) blind source separation (1) face recognition (1) posterior probability (1) emotion recognition (1)

Papers

Explaining Black-box Model Predictions via Two-level Nested Feature Attributions with Consistency Property IJCAI 2025 Static Word Embeddings for Sentence Semantic Representation EMNLP 2025 Mastering Task Arithmetic: $\tau$Jp as a Key Indicator for Weight Disentanglement ICLR 2025 Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals INTERSPEECH 2024 Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-to-Speech INTERSPEECH 2024 Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment INTERSPEECH 2024 SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark INTERSPEECH 2024 HumanDiffusion: diffusion model using perceptual gradients INTERSPEECH 2023 CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center INTERSPEECH 2023 ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings INTERSPEECH 2023 Human-in-the-loop Speaker Adaptation for DNN-based Multi-speaker TTS INTERSPEECH 2022 Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History INTERSPEECH 2022 Predicting VQVAE-based Character Acting Style from Quotation-Annotated Text for Audiobook Speech Synthesis INTERSPEECH 2022 STUDIES: Corpus of Japanese Empathetic Dialogue Speech Towards Friendly Voice Agent INTERSPEECH 2022 Cross-Lingual Speaker Adaptation Using Domain Adaptation and Speaker Consistency Loss for Text-To-Speech Synthesis INTERSPEECH 2021 Investigating Effective Additional Contextual Factors in DNN-Based Spontaneous Speech Synthesis INTERSPEECH 2020 Exchangeable Deep Neural Networks for Set-to-Set Matching and Learning ECCV 2020 Face2Speech: Towards Multi-Speaker Text-to-Speech Synthesis Using an Embedding Vector Predicted from a Face Image INTERSPEECH 2020 Real-Time, Full-Band, Online DNN-Based Voice Conversion System Using a Single CPU INTERSPEECH 2020 Cross-Lingual Text-To-Speech Synthesis via Domain Adaptation and Perceptual Similarity Regression in Speaker Space INTERSPEECH 2020 Voice Conversion Using Sequence-to-Sequence Learning of Context Posterior Probabilities INTERSPEECH 2017