Yuki Saito
21 papers · 2017–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
🌍 Conference Polyglot (5) 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🏃 Academic Marathon (8)
🧭
Keyword Pioneer
🐝
Cross-Pollinator
(9)
🌍
Conference Polyglot
(5)
🤝
Dynamic Duo
(15)
🏆
Keyword Champion
(2)
🔥
Unstoppable
(6)
💎
Century Club
(21)
⚡
Prolific Year
(5)
🗃️
Keyword Collector
(84)
Conferences
INTERSPEECH (17)
ECCV (1)
EMNLP (1)
ICLR (1)
IJCAI (1)
Top co-authors
Keywords
voice conversion
(5)
speech synthesis
(5)
empathetic dialogue
(3)
speaker embedding
(3)
domain adaptation
(2)
speaker adaptation
(2)
dialogue system
(2)
speaker individuality
(2)
speech corpus
(2)
cross-lingual synthesis
(2)
deep neural network
(2)
text-to-speech synthesis
(2)
speaker verification
(1)
speech enhancement
(1)
feature attribution
(1)
knowledge distillation
(1)
blind source separation
(1)
face recognition
(1)
posterior probability
(1)
emotion recognition
(1)
Papers
Explaining Black-box Model Predictions via Two-level Nested Feature Attributions with Consistency Property
IJCAI 2025
Static Word Embeddings for Sentence Semantic Representation
EMNLP 2025
Mastering Task Arithmetic: $\tau$Jp as a Key Indicator for Weight Disentanglement
ICLR 2025
Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals
INTERSPEECH 2024
Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-to-Speech
INTERSPEECH 2024
Noise-Robust Voice Conversion by Conditional Denoising Training Using Latent Variables of Recording Quality and Environment
INTERSPEECH 2024
SRC4VC: Smartphone-Recorded Corpus for Voice Conversion Benchmark
INTERSPEECH 2024
HumanDiffusion: diffusion model using perceptual gradients
INTERSPEECH 2023
CALLS: Japanese Empathetic Dialogue Speech Corpus of Complaint Handling and Attentive Listening in Customer Center
INTERSPEECH 2023
ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings
INTERSPEECH 2023
Human-in-the-loop Speaker Adaptation for DNN-based Multi-speaker TTS
INTERSPEECH 2022
Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History
INTERSPEECH 2022
Predicting VQVAE-based Character Acting Style from Quotation-Annotated Text for Audiobook Speech Synthesis
INTERSPEECH 2022
STUDIES: Corpus of Japanese Empathetic Dialogue Speech Towards Friendly Voice Agent
INTERSPEECH 2022
Cross-Lingual Speaker Adaptation Using Domain Adaptation and Speaker Consistency Loss for Text-To-Speech Synthesis
INTERSPEECH 2021
Investigating Effective Additional Contextual Factors in DNN-Based Spontaneous Speech Synthesis
INTERSPEECH 2020
Exchangeable Deep Neural Networks for Set-to-Set Matching and Learning
ECCV 2020
Face2Speech: Towards Multi-Speaker Text-to-Speech Synthesis Using an Embedding Vector Predicted from a Face Image
INTERSPEECH 2020
Real-Time, Full-Band, Online DNN-Based Voice Conversion System Using a Single CPU
INTERSPEECH 2020
Cross-Lingual Text-To-Speech Synthesis via Domain Adaptation and Perceptual Similarity Regression in Speaker Space
INTERSPEECH 2020
Voice Conversion Using Sequence-to-Sequence Learning of Context Posterior Probabilities
INTERSPEECH 2017