Gakuto Kurata
27 papers · 2006–2024 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π§ Keyword Pioneer π£ Hot Topic Early Bird π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (12) π Conference Polyglot (5)
π£
Hot Topic Early Bird
π
Interdisciplinary Bridge
π
Conference Polyglot
(5)
π
Conference Loyalist
(21)
π¬
Deep Specialist
(12)
π€
Dynamic Duo
(10)
π
Keyword Champion
(3)
ποΈ
Keyword Collector
(104)
β‘
Prolific Year
(6)
π
Conference Pioneer
π
Trend Setter
π
Century Club
(27)
π₯
Unstoppable
(9)
Conferences
INTERSPEECH (21)
EMNLP (3)
ACL (1)
COLING (1)
NAACL (1)
Top co-authors
Keywords
speech recognition
(9)
automatic speech recognition
(6)
acoustic model
(6)
knowledge distillation
(5)
long short-term memory
(4)
convolutional neural network
(4)
end-to-end speech recognition
(3)
connectionist temporal classification
(3)
data augmentation
(3)
neural network
(2)
large language model
(2)
recurrent neural network
(2)
domain adaptation
(2)
deep neural network
(2)
multi-task learning
(2)
acoustic modeling
(2)
language model
(2)
system combination
(1)
spoken language understanding
(1)
model merging
(1)
Papers
Robust ASR Error Correction with Conservative Data Filtering
EMNLP 2024
Speech-enriched Memory for Inference-time Adaptation of ASR Models to Word Dictionaries
EMNLP 2023
Global RNN Transducer Models For Multi-dialect Speech Recognition
INTERSPEECH 2022
Effect and Analysis of Large-scale Language Model Rescoring on Competitive ASR Systems
INTERSPEECH 2022
Improving ASR Robustness in Noisy Condition Through VAD Integration
INTERSPEECH 2022
Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
INTERSPEECH 2022
Improving Customization of Neural Transducers by Mitigating Acoustic Mismatch of Synthesized Audio
INTERSPEECH 2021
Knowledge Distillation from Offline to Streaming RNN Transducer for End-to-End Speech Recognition
INTERSPEECH 2020
New Advances in Speaker Diarization
INTERSPEECH 2020
End-to-End Spoken Language Understanding Without Full Transcripts
INTERSPEECH 2020
Direct Neuron-Wise Fusion of Cognate Neural Networks
INTERSPEECH 2019
Multi-Task CTC Training with Auxiliary Feature Reconstruction for End-to-End Speech Recognition
INTERSPEECH 2019
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation
INTERSPEECH 2019
Data Augmentation Improves Recognition of Foreign Accented Speech
INTERSPEECH 2018
Inference-Invariant Transformation of Batch Normalization for Domain Adaptation of Acoustic Models
INTERSPEECH 2018
Ensembles of Multi-Scale VGG Acoustic Models
INTERSPEECH 2017
Factorial Modeling for Effective Suppression of Directional Noise
INTERSPEECH 2017
Symbol Sequence Search from Telephone Conversation
INTERSPEECH 2017
Efficient Knowledge Distillation from an Ensemble of Teachers
INTERSPEECH 2017
English Conversational Telephone Speech Recognition by Humans and Machines
INTERSPEECH 2017
Empirical Exploration of Novel Architectures and Objectives for Language Models
INTERSPEECH 2017
Improved Neural Network-based Multi-label Classification with Better Initialization Leveraging Label Co-occurrence
NAACL 2016
Leveraging Sentence-level Information with Encoder LSTM for Semantic Slot Filling
EMNLP 2016
Improved Neural Network Initialization by Grouping Context-Dependent Targets for Acoustic Modeling
INTERSPEECH 2016
Labeled Data Generation with Encoder-Decoder LSTM for Semantic Slot Filling
INTERSPEECH 2016
Phoneme-to-Text Transcription System with an Infinite Vocabulary
COLING 2006
Phoneme-to-Text Transcription System with an Infinite Vocabulary
ACL 2006