Hisashi Kawai
32 papers · 2011–2024 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (23) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (6) π£ Hot Topic Early Bird
π
Interdisciplinary Bridge
π£
Hot Topic Early Bird
π§
Keyword Pioneer
π
Keyword Trendsetter Combo
(4)
π
Conference Loyalist
(28)
π§¬
Topic Evolution
π€
Dynamic Duo
(15)
π¬
Deep Specialist
(10)
π
Keyword Champion
(2)
β
The Questioner
π
Conference Pioneer
β‘
Prolific Year
(6)
π₯
Unstoppable
(9)
ποΈ
Keyword Collector
(58)
π
Trend Setter
π
Century Club
(32)
Conferences
INTERSPEECH (28)
IJCNLP (2)
COLING (1)
CORL (1)
Top co-authors
Keywords
acoustic model
(5)
neural vocoder
(4)
automatic speech recognition
(4)
deep neural network
(4)
spoken language identification
(4)
speech recognition
(4)
neural network
(4)
speech enhancement
(3)
bidirectional recurrent neural network
(3)
fundamental frequency
(3)
acoustic modeling
(3)
feature extraction
(2)
speech synthesis
(2)
language model
(2)
text-to-speech synthesis
(2)
grapheme-to-phoneme conversion
(2)
noise robustness
(2)
attention mechanism
(2)
recurrent neural network
(2)
feature representation
(2)
Papers
Investigating ASR Error Correction with Large Language Model and Multilingual 1-best Hypotheses
INTERSPEECH 2024
Challenge of Singing Voice Synthesis Using Only Text-To-Speech Corpus With FIRNet Source-Filter Neural Vocoder
INTERSPEECH 2024
Mobile PresenTra: NICT fast neural text-to-speech system on smartphones with incremental inference of MS-FC-HiFi-GAN for law-latency synthesis
INTERSPEECH 2024
E2E-S2S-VC: End-To-End Sequence-To-Sequence Voice Conversion
INTERSPEECH 2023
Transducer-based language embedding for spoken language identification
INTERSPEECH 2022
Can We Train a Language Model Inside an End-to-End ASR Model? - Investigating Effective Implicit Language Modeling
COLING 2022
Noise Robust Acoustic Modeling for Single-Channel Speech Recognition Based on a Stream-Wise Transformer Architecture
INTERSPEECH 2021
Quasi-Periodic Parallel WaveGAN Vocoder: A Non-Autoregressive Pitch-Dependent Dilated Convolution Model for Parametric Speech Generation
INTERSPEECH 2020
Investigation of NICT Submission for Short-Duration Speaker Verification Challenge 2020
INTERSPEECH 2020
One-Pass Single-Channel Noisy Speech Recognition Using a Combination of Noisy and Enhanced Features
INTERSPEECH 2019
Multimodal Attention Branch Network for Perspective-Free Sentence Generation
CORL 2019
Real-Time Neural Text-to-Speech with Sequence-to-Sequence Acoustic Model and WaveGlow or Single Gaussian WaveRNN Vocoders
INTERSPEECH 2019
End-to-End Articulatory Attribute Modeling for Low-Resource Multilingual Speech Recognition
INTERSPEECH 2019
Investigating Radical-Based End-to-End Speech Recognition Systems for Chinese Dialects and Japanese
INTERSPEECH 2019
Incorporating Symbolic Sequential Modeling for Speech Enhancement
INTERSPEECH 2019
Class-Wise Centroid Distance Metric Learning for Acoustic Event Detection
INTERSPEECH 2019
Improving Transformer-Based Speech Recognition Systems with Compressed Structure and Speech Attributes Augmentation
INTERSPEECH 2019
Duration Modeling with Global Phoneme-Duration Vectors
INTERSPEECH 2019
Temporal Attentive Pooling for Acoustic Event Detection
INTERSPEECH 2018
Feature Representation of Short Utterances Based on Knowledge Distillation for Spoken Language Identification
INTERSPEECH 2018
Improving CTC-based Acoustic Model with Very Deep Residual Time-delay Neural Networks
INTERSPEECH 2018
Multilingual Grapheme-to-Phoneme Conversion with Global Character Vectors
INTERSPEECH 2018
Global Syllable Vectors for Building TTS Front-End with Deep Learning
INTERSPEECH 2017
Conditional Generative Adversarial Nets Classifier for Spoken Language Identification
INTERSPEECH 2017
Model Integration for HMM- and DNN-Based Speech Synthesis Using Product-of-Experts Framework
INTERSPEECH 2016
Using Zero-Frequency Resonator to Extract Multilingual Intonation Structure
INTERSPEECH 2016
Investigation of Semi-Supervised Acoustic Model Training Based on the Committee of Heterogeneous Neural Networks
INTERSPEECH 2016
F0Contour Analysis Based on Empirical Mode Decomposition for DNN Acoustic Modeling in Mandarin Speech Recognition
INTERSPEECH 2016
Pair-Wise Distance Metric Learning of Neural Network Model for Spoken Language Identification
INTERSPEECH 2016
Maximum a posteriori Based Decoding for CTC Acoustic Models
INTERSPEECH 2016
Improving Related Entity Finding via Incorporating Homepages and Recognizing Fine-grained Entities
IJCNLP 2011
Answering Complex Questions via Exploiting Social Q&A Collection
IJCNLP 2011