Hirokazu Kameoka
20 papers · 2016–2024 · 1 conference · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
🏃 Academic Marathon (8) 🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (13)
🐝
Cross-Pollinator
(13)
🐣
Hot Topic Early Bird
🏠
Conference Loyalist
(20)
🤝
Dynamic Duo
(10)
🔥
Unstoppable
(6)
💎
Century Club
(20)
🚀
Conference Pioneer
📈
Trend Setter
⚡
Prolific Year
(6)
🗃️
Keyword Collector
(73)
Conferences
INTERSPEECH (20)
Top co-authors
Research topics
Keywords
voice conversion
(9)
speech synthesis
(7)
generative adversarial network
(6)
gaussian mixture model
(3)
neural vocoder
(3)
speech enhancement
(3)
hidden markov model
(2)
deep neural network
(2)
knowledge distillation
(2)
non-negative matrix factorization
(2)
convolutional neural network
(2)
majorization minimization
(1)
automatic speech recognition
(1)
audio source separation
(1)
statistical model
(1)
maximum likelihood
(1)
parameter estimation
(1)
disentangled representation
(1)
fundamental frequency
(1)
diffusion model
(1)
Papers
FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation
INTERSPEECH 2024
PRVAE-VC2: Non-Parallel Voice Conversion by Distillation of Speech Representations
INTERSPEECH 2024
CFVC: Conditional Filtering for Controllable Voice Conversion
INTERSPEECH 2023
iSTFTNet2: Faster and More Lightweight iSTFT-Based Neural Vocoder Using 1D-2D CNN
INTERSPEECH 2023
MISRNet: Lightweight Neural Vocoder Using Multi-Input Single Shared Residual Blocks
INTERSPEECH 2022
CAUSE: Crossmodal Action Unit Sequence Estimation from Speech
INTERSPEECH 2022
StarGAN-VC+ASR: StarGAN-Based Non-Parallel Voice Conversion Regularized by Automatic Speech Recognition
INTERSPEECH 2021
CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-Spectrogram Conversion
INTERSPEECH 2020
Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining
INTERSPEECH 2020
A Modified Algorithm for Multiple Input Spectrogram Inversion
INTERSPEECH 2019
StarGAN-VC2: Rethinking Conditional Methods for StarGAN-Based Voice Conversion
INTERSPEECH 2019
Physically Constrained Statistical F0Prediction for Electrolaryngeal Speech Enhancement
INTERSPEECH 2017
Generative Adversarial Network-Based Postfilter for STFT Spectrograms
INTERSPEECH 2017
Speech Enhancement Using Non-Negative Spectrogram Models with Mel-Generalized Cepstral Regularization
INTERSPEECH 2017
Sequence-to-Sequence Voice Conversion with Similarity Metric Learned Using Generative Adversarial Networks
INTERSPEECH 2017
Direct Modeling of Frequency Spectra and Waveform Generation Based on Phase Recovery for DNN-Based Speech Synthesis
INTERSPEECH 2017
DNN-SPACE: DNN-HMM-Based Generative Model of Voice F0Contours for Statistical Phrase/Accent Command Estimation
INTERSPEECH 2017
Majorisation-Minimisation Based Optimisation of the Composite Autoregressive System with Application to Glottal Inverse Filtering
INTERSPEECH 2016
Semi-Supervised Joint Enhancement of Spectral and Cepstral Sequences of Noisy Speech
INTERSPEECH 2016
Acoustic-to-Articulatory Inversion Mapping Based on Latent Trajectory Gaussian Mixture Model
INTERSPEECH 2016