Tatsuya Kawahara
63 papers · 2000–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (23) π§ Keyword Pioneer π Renaissance Researcher (5) π Interdisciplinary Bridge π£ Hot Topic Early Bird
π
Conference Polyglot
(8)
π
Cross-Pollinator
(14)
πΊοΈ
Taxonomy Completionist
(23)
π
Conference Loyalist
(36)
π€
Dynamic Duo
(12)
π§¬
Topic Evolution
π¬
Deep Specialist
(16)
π
Keyword Champion
(3)
π
Trend Setter
π
Conference Pioneer
π₯
Unstoppable
(10)
β‘
Prolific Year
(9)
π
Century Club
(62)
ποΈ
Keyword Collector
(67)
Conferences
INTERSPEECH (36)
COLING (12)
ACL (7)
IJCNLP (3)
NAACL (2)
AACL (1)
EMNLP (1)
IJCAI (1)
Top co-authors
Keywords
automatic speech recognition
(14)
dialogue system
(13)
speech recognition
(8)
connectionist temporal classification
(5)
speech emotion recognition
(4)
human-robot interaction
(4)
multitask learning
(4)
spoken dialogue
(3)
backchannel prediction
(3)
neural network
(3)
prosodic feature
(3)
monotonic attention
(3)
acoustic feature
(3)
end-to-end learning
(3)
end-to-end model
(3)
turn-taking prediction
(3)
transfer learning
(2)
user satisfaction
(2)
low-resource language
(2)
semi-supervised learning
(2)
Papers
MMAC: A Multilingual, Multimodal Alignment Framework for Cultural Grounding Evaluation
ACL 2026
Minority-Aware Satisfaction Estimation in Dialogue Systems via Preference-Adaptive Reinforcement Learning
IJCNLP 2025
Minority-Aware Satisfaction Estimation in Dialogue Systems via Preference-Adaptive Reinforcement Learning
AACL 2025
Yeah, Un, Oh: Continuous and Real-time Backchannel Prediction with Fine-tuning of Voice Activity Projection
NAACL 2025
Human-Like Embodied AI Interviewer: Employing Android ERICA in Real International Conference
COLING 2025
Video Retrieval System Using Automatic Speech Recognition for the Japanese Diet
COLING 2024
Multilingual Turn-taking Prediction Using Voice Activity Projection
COLING 2024
Quantitative Analysis of Editing in Transcription Process in Japanese and European Parliaments and its Diachronic Changes
COLING 2024
Efficient and Robust Long-Form Speech Recognition with Hybrid H3-Conformer
INTERSPEECH 2024
Dual-path Adaptation of Pretrained Feature Extraction Module for Robust Automatic Speech Recognition
INTERSPEECH 2024
Speech Emotion Recognition with Multi-level Acoustic and Semantic Information Extraction and Interaction
INTERSPEECH 2024
Entrainment Analysis and Prosody Prediction of Subsequent Interlocutorβs Backchannels in Dialogue
INTERSPEECH 2024
Two-stage Finetuning of Wav2vec 2.0 for Speech Emotion Recognition with ASR and Gender Pretraining
INTERSPEECH 2023
Embedding Articulatory Constraints for Low-resource Speech Recognition Based on Large Pre-trained Model
INTERSPEECH 2023
End-to-end Speech-to-Punctuated-Text Recognition
INTERSPEECH 2022
Multimodal Persuasive Dialogue Corpus using Teleoperated Android
INTERSPEECH 2022
Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM
INTERSPEECH 2022
Leveraging Simultaneous Translation for Enhancing Transcription of Low-resource Language via Cross Attention Mechanism
INTERSPEECH 2022
Monaural Speech Enhancement Based on Spectrogram Decomposition for Convolutional Neural Network-sensitive Feature Extraction
INTERSPEECH 2022
VAD-Free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording
INTERSPEECH 2021
Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation
NAACL 2021
StableEmit: Selection Probability Discount for Reducing Emission Latency of Streaming Monotonic Attention ASR
INTERSPEECH 2021
Topic-relevant Response Generation using Optimal Transport for an Open-domain Dialog System
COLING 2020
Designing Precise and Robust Dialogue Response Evaluators
ACL 2020
End-to-End Speech Emotion Recognition Combined with Acoustic-to-Word ASR Model
INTERSPEECH 2020
CTC-Synchronous Training for Monotonic Attention Model
INTERSPEECH 2020
Enhancing Monotonic Multihead Attention for Streaming ASR
INTERSPEECH 2020
Generative Adversarial Training Data Adaptation for Very Low-Resource Automatic Speech Recognition
INTERSPEECH 2020
Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
INTERSPEECH 2020
End-to-End Speech-to-Dialog-Act Recognition
INTERSPEECH 2020
Semi-Supervised Learning for Character Expression of Spoken Dialogue Systems
INTERSPEECH 2020
Analysis of Effect and Timing of Fillers in Natural Turn-Taking
INTERSPEECH 2019
Improving Transformer-Based Speech Recognition Systems with Compressed Structure and Speech Attributes Augmentation
INTERSPEECH 2019
ERICA and WikiTalk
IJCAI 2019
End-to-End Articulatory Attribute Modeling for Low-Resource Multilingual Speech Recognition
INTERSPEECH 2019
Investigating Radical-Based End-to-End Speech Recognition Systems for Chinese Dialects and Japanese
INTERSPEECH 2019
Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning
INTERSPEECH 2019
Turn-Taking Prediction Based on Detection of Transition Relevance Place
INTERSPEECH 2019
Forward-Backward Attention Decoder
INTERSPEECH 2018
Encoder Transfer for Attention-based Acoustic-to-word Speech Recognition
INTERSPEECH 2018
Improving CTC-based Acoustic Model with Very Deep Residual Time-delay Neural Networks
INTERSPEECH 2018
Engagement Recognition in Spoken Dialogue via Neural Network by Aggregating Different Annotators' Models
INTERSPEECH 2018
Prediction of Turn-taking Using Multitask Learning with Prediction of Backchannels and Fillers
INTERSPEECH 2018
Joint Learning of Dialog Act Segmentation and Recognition in Spoken Dialog Using Neural Networks
IJCNLP 2017
Social Signal Detection in Spontaneous Dialogue Using Bidirectional LSTM-CTC
INTERSPEECH 2017
Combined Multi-Channel NMF-Based Robust Beamforming for Noisy Speech Recognition
INTERSPEECH 2017
Analysis of the Relationship Between Prosodic Features of Fillers and its Forms or Occurrence Positions
INTERSPEECH 2017
Prediction and Generation of Backchannel Form for Attentive Listening Systems
INTERSPEECH 2016
Joint Optimization of Denoising Autoencoder and DNN Acoustic Model Based on Multi-Target Learning for Noisy Speech Recognition
INTERSPEECH 2016
Predicate Argument Structure Analysis using Partially Annotated Corpora
IJCNLP 2013
Machine Translation without Words through Substring Alignment
ACL 2012
Language Modeling for Spoken Dialogue System based on Filtering using Predicate-Argument Structures
COLING 2012
An Unsupervised Model for Joint Phrase Alignment and Extraction
ACL 2011
Bayes Risk-based Dialogue Management for Document Retrieval System with Speech Interface
COLING 2008
Detection of Quotations and Inserted Clauses and Its Application to Dependency Structure Analysis in Spontaneous Japanese
COLING 2006
Detection of Quotations and Inserted Clauses and Its Application to Dependency Structure Analysis in Spontaneous Japanese
ACL 2006
Speech-based Information Retrieval System with Clarification Dialogue Strategy
EMNLP 2005
Dependency Structure Analysis and Sentence Boundary Detection in Spontaneous Japanese
COLING 2004
Efficient Confirmation Strategy for Large-scale Text Retrieval Systems with Spoken Dialogue Interface
COLING 2004
Flexible Guidance Generation Using User Model in Spoken Dialogue Systems
ACL 2003
Dialog Navigator : A Spoken Dialog Q-A System based on Large Text Knowledge Base
ACL 2003
Efficient Dialogue Strategy to Find Usersβ Intended Items from Information Query Results
COLING 2002
Flexible Mixed-Initiative Dialogue Management using Concept-Level Confidence Measures of Speech Recognizer Output
COLING 2000