Hisashi Kawai

32 papers · 2011–2024 · 4 conferences · across top CS/AI conferences

Achievements

+16 more ↓

🗺️ Taxonomy Completionist (23) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (6) 🐣 Hot Topic Early Bird

🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌟 Keyword Trendsetter Combo (4) 🏠 Conference Loyalist (28) 🧬 Topic Evolution 🤝 Dynamic Duo (15) 🔬 Deep Specialist (10) 🏆 Keyword Champion (2) ❓ The Questioner 🚀 Conference Pioneer ⚡ Prolific Year (6) 🔥 Unstoppable (9) 🗃️ Keyword Collector (58) 📈 Trend Setter 💎 Century Club (32)

Conferences

INTERSPEECH (28) IJCNLP (2) COLING (1) CORL (1)

Top co-authors

Xugang Lu (15) Peng Shen (11) Sheng Li (10) Yoshinori Shiga (6) Takuma Okamoto (5) Tomoki Toda (5) Tatsuya Kawahara (4) Yu Tsao (4) Jinfu Ni (4) Chenchen Ding (2)

Keywords

acoustic model (5) neural vocoder (4) automatic speech recognition (4) deep neural network (4) spoken language identification (4) speech recognition (4) neural network (4) speech enhancement (3) bidirectional recurrent neural network (3) fundamental frequency (3) acoustic modeling (3) feature extraction (2) speech synthesis (2) language model (2) text-to-speech synthesis (2) grapheme-to-phoneme conversion (2) noise robustness (2) attention mechanism (2) recurrent neural network (2) feature representation (2)

Papers

Investigating ASR Error Correction with Large Language Model and Multilingual 1-best Hypotheses INTERSPEECH 2024 Challenge of Singing Voice Synthesis Using Only Text-To-Speech Corpus With FIRNet Source-Filter Neural Vocoder INTERSPEECH 2024 Mobile PresenTra: NICT fast neural text-to-speech system on smartphones with incremental inference of MS-FC-HiFi-GAN for law-latency synthesis INTERSPEECH 2024 E2E-S2S-VC: End-To-End Sequence-To-Sequence Voice Conversion INTERSPEECH 2023 Transducer-based language embedding for spoken language identification INTERSPEECH 2022 Can We Train a Language Model Inside an End-to-End ASR Model? - Investigating Effective Implicit Language Modeling COLING 2022 Noise Robust Acoustic Modeling for Single-Channel Speech Recognition Based on a Stream-Wise Transformer Architecture INTERSPEECH 2021 Quasi-Periodic Parallel WaveGAN Vocoder: A Non-Autoregressive Pitch-Dependent Dilated Convolution Model for Parametric Speech Generation INTERSPEECH 2020 Investigation of NICT Submission for Short-Duration Speaker Verification Challenge 2020 INTERSPEECH 2020 One-Pass Single-Channel Noisy Speech Recognition Using a Combination of Noisy and Enhanced Features INTERSPEECH 2019 Multimodal Attention Branch Network for Perspective-Free Sentence Generation CORL 2019 Real-Time Neural Text-to-Speech with Sequence-to-Sequence Acoustic Model and WaveGlow or Single Gaussian WaveRNN Vocoders INTERSPEECH 2019 End-to-End Articulatory Attribute Modeling for Low-Resource Multilingual Speech Recognition INTERSPEECH 2019 Investigating Radical-Based End-to-End Speech Recognition Systems for Chinese Dialects and Japanese INTERSPEECH 2019 Incorporating Symbolic Sequential Modeling for Speech Enhancement INTERSPEECH 2019 Class-Wise Centroid Distance Metric Learning for Acoustic Event Detection INTERSPEECH 2019 Improving Transformer-Based Speech Recognition Systems with Compressed Structure and Speech Attributes Augmentation INTERSPEECH 2019 Duration Modeling with Global Phoneme-Duration Vectors INTERSPEECH 2019 Temporal Attentive Pooling for Acoustic Event Detection INTERSPEECH 2018 Feature Representation of Short Utterances Based on Knowledge Distillation for Spoken Language Identification INTERSPEECH 2018 Improving CTC-based Acoustic Model with Very Deep Residual Time-delay Neural Networks INTERSPEECH 2018 Multilingual Grapheme-to-Phoneme Conversion with Global Character Vectors INTERSPEECH 2018 Global Syllable Vectors for Building TTS Front-End with Deep Learning INTERSPEECH 2017 Conditional Generative Adversarial Nets Classifier for Spoken Language Identification INTERSPEECH 2017 Model Integration for HMM- and DNN-Based Speech Synthesis Using Product-of-Experts Framework INTERSPEECH 2016 Using Zero-Frequency Resonator to Extract Multilingual Intonation Structure INTERSPEECH 2016 Investigation of Semi-Supervised Acoustic Model Training Based on the Committee of Heterogeneous Neural Networks INTERSPEECH 2016 F0Contour Analysis Based on Empirical Mode Decomposition for DNN Acoustic Modeling in Mandarin Speech Recognition INTERSPEECH 2016 Pair-Wise Distance Metric Learning of Neural Network Model for Spoken Language Identification INTERSPEECH 2016 Maximum a posteriori Based Decoding for CTC Acoustic Models INTERSPEECH 2016 Improving Related Entity Finding via Incorporating Homepages and Recognizing Fine-grained Entities IJCNLP 2011 Answering Complex Questions via Exploiting Social Q&A Collection IJCNLP 2011