Tomoki Koriyama
14 papers · 2016–2024 · 1 conference · across top CS/AI conferences
Achievements
Jump to papers ↓+5 more ↓ Show less ↑
π£ Hot Topic Early Bird π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (10) π§ Keyword Pioneer π Academic Marathon (8)
π
Cross-Pollinator
(12)
π
Renaissance Researcher
(5)
π
Century Club
(14)
π
Conference Pioneer
ποΈ
Keyword Collector
(62)
Conferences
INTERSPEECH (14)
Top co-authors
Keywords
speech synthesis
(7)
latent variable model
(3)
speech quality
(3)
deep gaussian process
(3)
domain adaptation
(2)
cross-lingual synthesis
(2)
self-supervised learning
(2)
bayesian model
(1)
ensemble learning
(1)
self-attention mechanism
(1)
speaker individuality
(1)
disfluency detection
(1)
speaker verification
(1)
gaussian process
(1)
speaker embedding
(1)
moment matching
(1)
harmonic structure
(1)
model merging
(1)
acoustic model
(1)
gaussian process regression
(1)
Papers
Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-to-Speech
INTERSPEECH 2024
VAE-based Phoneme Alignment Using Gradient Annealing and SSL Acoustic Features
INTERSPEECH 2024
An Attribute Interpolation Method in Speech Synthesis by Model Merging
INTERSPEECH 2024
UTMOS: UTokyo-SaruLab System for VoiceMOS Challenge 2022
INTERSPEECH 2022
Predicting VQVAE-based Character Acting Style from Quotation-Annotated Text for Audiobook Speech Synthesis
INTERSPEECH 2022
Cross-Lingual Speaker Adaptation Using Domain Adaptation and Speaker Consistency Loss for Text-To-Speech Synthesis
INTERSPEECH 2021
Sequence-to-Sequence Learning for Deep Gaussian Process Based Speech Synthesis Using Self-Attention GP Layer
INTERSPEECH 2021
Harmonic WaveGAN: GAN-Based Speech Waveform Generation Model with Harmonic Structure Discriminator
INTERSPEECH 2021
Investigating Effective Additional Contextual Factors in DNN-Based Spontaneous Speech Synthesis
INTERSPEECH 2020
Cross-Lingual Text-To-Speech Synthesis via Domain Adaptation and Perceptual Similarity Regression in Speaker Space
INTERSPEECH 2020
Multi-Speaker Text-to-Speech Synthesis Using Deep Gaussian Processes
INTERSPEECH 2020
Semi-Supervised Prosody Modeling Using Deep Gaussian Process Latent Variable Model
INTERSPEECH 2019
Sampling-Based Speech Parameter Generation Using Moment-Matching Networks
INTERSPEECH 2017
Unsupervised Stress Information Labeling Using Gaussian Process Latent Variable Model for Statistical Speech Synthesis
INTERSPEECH 2016