Kentaro Mitsui
7 papers · 2020–2024 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (4) π Cross-Pollinator (12) π Renaissance Researcher (6)
πΊοΈ
Taxonomy Completionist
(22)
Conferences
INTERSPEECH (4)
ACL (1)
COLING (1)
EMNLP (1)
Top co-authors
Keywords
speech synthesis
(4)
large language model
(2)
multimodal learning
(1)
latent variable model
(1)
latent representation
(1)
variational autoencoder
(1)
pre-trained model
(1)
stable diffusion
(1)
parameter-efficient adaptation
(1)
deep gaussian process
(1)
generative pre-trained transformer
(1)
spoken dialogue system
(1)
neural vocoder
(1)
japanese language
(1)
speech representation
(1)
speaking style
(1)
speech generation
(1)
end-to-end speech recognition
(1)
speaker code
(1)
parallel wavegan
(1)
Papers
Release of Pre-Trained Models for the Japanese Language
COLING 2024
PSLM: Parallel Generation of Text and Speech with LLMs for Low-Latency Spoken Dialogue Systems
EMNLP 2024
Integrating Pre-Trained Speech and Language Models for End-to-End Speech Recognition
ACL 2024
UniFLG: Unified Facial Landmark Generator from Text or Speech
INTERSPEECH 2023
MSR-NV: Neural Vocoder Using Multiple Sampling Rates
INTERSPEECH 2022
End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue
INTERSPEECH 2022
Multi-Speaker Text-to-Speech Synthesis Using Deep Gaussian Processes
INTERSPEECH 2020