Gakuto Kurata

27 papers · 2006–2024 · 5 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (12) 🌍 Conference Polyglot (5)

🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (5) 🏠 Conference Loyalist (21) 🔬 Deep Specialist (12) 🤝 Dynamic Duo (10) 🏆 Keyword Champion (3) 🗃️ Keyword Collector (104) ⚡ Prolific Year (6) 🚀 Conference Pioneer 📈 Trend Setter 💎 Century Club (27) 🔥 Unstoppable (9)

Conferences

INTERSPEECH (21) EMNLP (3) ACL (1) COLING (1) NAACL (1)

Top co-authors

Masayuki Suzuki (10) Takashi Fukuda (8) George Saon (8) Samuel Thomas (6) Brian Kingsbury (5) Bhuvana Ramabhadran (5) Kartik Audhkhasi (4) Bing Xiang (3) Bowen Zhou (3) Abhinav Sethy (2)

Keywords

speech recognition (9) automatic speech recognition (6) acoustic model (6) knowledge distillation (5) long short-term memory (4) convolutional neural network (4) end-to-end speech recognition (3) connectionist temporal classification (3) data augmentation (3) neural network (2) large language model (2) recurrent neural network (2) domain adaptation (2) deep neural network (2) multi-task learning (2) acoustic modeling (2) language model (2) system combination (1) spoken language understanding (1) model merging (1)

Papers

Robust ASR Error Correction with Conservative Data Filtering EMNLP 2024 Speech-enriched Memory for Inference-time Adaptation of ASR Models to Word Dictionaries EMNLP 2023 Global RNN Transducer Models For Multi-dialect Speech Recognition INTERSPEECH 2022 Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems INTERSPEECH 2022 Improving ASR Robustness in Noisy Condition Through VAD Integration INTERSPEECH 2022 Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing INTERSPEECH 2022 Improving Customization of Neural Transducers by Mitigating Acoustic Mismatch of Synthesized Audio INTERSPEECH 2021 Knowledge Distillation from Offline to Streaming RNN Transducer for End-to-End Speech Recognition INTERSPEECH 2020 New Advances in Speaker Diarization INTERSPEECH 2020 End-to-End Spoken Language Understanding Without Full Transcripts INTERSPEECH 2020 Direct Neuron-Wise Fusion of Cognate Neural Networks INTERSPEECH 2019 Multi-Task CTC Training with Auxiliary Feature Reconstruction for End-to-End Speech Recognition INTERSPEECH 2019 Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation INTERSPEECH 2019 Data Augmentation Improves Recognition of Foreign Accented Speech INTERSPEECH 2018 Inference-Invariant Transformation of Batch Normalization for Domain Adaptation of Acoustic Models INTERSPEECH 2018 Ensembles of Multi-Scale VGG Acoustic Models INTERSPEECH 2017 Factorial Modeling for Effective Suppression of Directional Noise INTERSPEECH 2017 Symbol Sequence Search from Telephone Conversation INTERSPEECH 2017 Efficient Knowledge Distillation from an Ensemble of Teachers INTERSPEECH 2017 English Conversational Telephone Speech Recognition by Humans and Machines INTERSPEECH 2017 Empirical Exploration of Novel Architectures and Objectives for Language Models INTERSPEECH 2017 Improved Neural Network-based Multi-label Classification with Better Initialization Leveraging Label Co-occurrence NAACL 2016 Leveraging Sentence-level Information with Encoder LSTM for Semantic Slot Filling EMNLP 2016 Improved Neural Network Initialization by Grouping Context-Dependent Targets for Acoustic Modeling INTERSPEECH 2016 Labeled Data Generation with Encoder-Decoder LSTM for Semantic Slot Filling INTERSPEECH 2016 Phoneme-to-Text Transcription System with an Infinite Vocabulary COLING 2006 Phoneme-to-Text Transcription System with an Infinite Vocabulary ACL 2006