conftrace_

Tatsuya Kawahara

63 papers · 2000–2026 · 8 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+14 more ↓

🗺️ Taxonomy Completionist (23) 🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird

🌍 Conference Polyglot (8) 🐝 Cross-Pollinator (14) 🗺️ Taxonomy Completionist (23) 🏠 Conference Loyalist (36) 🤝 Dynamic Duo (12) 🧬 Topic Evolution 🔬 Deep Specialist (16) 🏆 Keyword Champion (3) 📈 Trend Setter 🚀 Conference Pioneer 🔥 Unstoppable (10) ⚡ Prolific Year (9) 💎 Century Club (62) 🗃️ Keyword Collector (67)

Conferences

INTERSPEECH (36) COLING (12) ACL (7) IJCNLP (3) NAACL (2) AACL (1) EMNLP (1) IJCAI (1)

Top co-authors

Masato Mimura (12) Koji Inoue (12) Hirofumi Inaguma (9) Shinsuke Sakai (8) Divesh Lala (7) Sheng Li (6) Katsuya Takanashi (5) Tianyu Zhao (5) Sei Ueno (5) Kazunori Komatani (5)

Keywords

automatic speech recognition (14) dialogue system (13) speech recognition (8) connectionist temporal classification (5) speech emotion recognition (4) human-robot interaction (4) multitask learning (4) spoken dialogue (3) backchannel prediction (3) neural network (3) prosodic feature (3) monotonic attention (3) acoustic feature (3) end-to-end learning (3) end-to-end model (3) turn-taking prediction (3) transfer learning (2) user satisfaction (2) low-resource language (2) semi-supervised learning (2)

Papers

MMAC: A Multilingual, Multimodal Alignment Framework for Cultural Grounding Evaluation ACL 2026 Minority-Aware Satisfaction Estimation in Dialogue Systems via Preference-Adaptive Reinforcement Learning IJCNLP 2025 Minority-Aware Satisfaction Estimation in Dialogue Systems via Preference-Adaptive Reinforcement Learning AACL 2025 Yeah, Un, Oh: Continuous and Real-time Backchannel Prediction with Fine-tuning of Voice Activity Projection NAACL 2025 Human-Like Embodied AI Interviewer: Employing Android ERICA in Real International Conference COLING 2025 Video Retrieval System Using Automatic Speech Recognition for the Japanese Diet COLING 2024 Multilingual Turn-taking Prediction Using Voice Activity Projection COLING 2024 Quantitative Analysis of Editing in Transcription Process in Japanese and European Parliaments and its Diachronic Changes COLING 2024 Efficient and Robust Long-Form Speech Recognition with Hybrid H3-Conformer INTERSPEECH 2024 Dual-path Adaptation of Pretrained Feature Extraction Module for Robust Automatic Speech Recognition INTERSPEECH 2024 Speech Emotion Recognition with Multi-level Acoustic and Semantic Information Extraction and Interaction INTERSPEECH 2024 Entrainment Analysis and Prosody Prediction of Subsequent Interlocutor’s Backchannels in Dialogue INTERSPEECH 2024 Two-stage Finetuning of Wav2vec 2.0 for Speech Emotion Recognition with ASR and Gender Pretraining INTERSPEECH 2023 Embedding Articulatory Constraints for Low-resource Speech Recognition Based on Large Pre-trained Model INTERSPEECH 2023 End-to-end Speech-to-Punctuated-Text Recognition INTERSPEECH 2022 Multimodal Persuasive Dialogue Corpus using Teleoperated Android INTERSPEECH 2022 Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM INTERSPEECH 2022 Leveraging Simultaneous Translation for Enhancing Transcription of Low-resource Language via Cross Attention Mechanism INTERSPEECH 2022 Monaural Speech Enhancement Based on Spectrogram Decomposition for Convolutional Neural Network-sensitive Feature Extraction INTERSPEECH 2022 VAD-Free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording INTERSPEECH 2021 Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation NAACL 2021 StableEmit: Selection Probability Discount for Reducing Emission Latency of Streaming Monotonic Attention ASR INTERSPEECH 2021 Topic-relevant Response Generation using Optimal Transport for an Open-domain Dialog System COLING 2020 Designing Precise and Robust Dialogue Response Evaluators ACL 2020 End-to-End Speech Emotion Recognition Combined with Acoustic-to-Word ASR Model INTERSPEECH 2020 CTC-Synchronous Training for Monotonic Attention Model INTERSPEECH 2020 Enhancing Monotonic Multihead Attention for Streaming ASR INTERSPEECH 2020 Generative Adversarial Training Data Adaptation for Very Low-Resource Automatic Speech Recognition INTERSPEECH 2020 Distilling the Knowledge of BERT for Sequence-to-Sequence ASR INTERSPEECH 2020 End-to-End Speech-to-Dialog-Act Recognition INTERSPEECH 2020 Semi-Supervised Learning for Character Expression of Spoken Dialogue Systems INTERSPEECH 2020 Analysis of Effect and Timing of Fillers in Natural Turn-Taking INTERSPEECH 2019 Improving Transformer-Based Speech Recognition Systems with Compressed Structure and Speech Attributes Augmentation INTERSPEECH 2019 ERICA and WikiTalk IJCAI 2019 End-to-End Articulatory Attribute Modeling for Low-Resource Multilingual Speech Recognition INTERSPEECH 2019 Investigating Radical-Based End-to-End Speech Recognition Systems for Chinese Dialects and Japanese INTERSPEECH 2019 Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning INTERSPEECH 2019 Turn-Taking Prediction Based on Detection of Transition Relevance Place INTERSPEECH 2019 Forward-Backward Attention Decoder INTERSPEECH 2018 Encoder Transfer for Attention-based Acoustic-to-word Speech Recognition INTERSPEECH 2018 Improving CTC-based Acoustic Model with Very Deep Residual Time-delay Neural Networks INTERSPEECH 2018 Engagement Recognition in Spoken Dialogue via Neural Network by Aggregating Different Annotators' Models INTERSPEECH 2018 Prediction of Turn-taking Using Multitask Learning with Prediction of Backchannels and Fillers INTERSPEECH 2018 Joint Learning of Dialog Act Segmentation and Recognition in Spoken Dialog Using Neural Networks IJCNLP 2017 Social Signal Detection in Spontaneous Dialogue Using Bidirectional LSTM-CTC INTERSPEECH 2017 Combined Multi-Channel NMF-Based Robust Beamforming for Noisy Speech Recognition INTERSPEECH 2017 Analysis of the Relationship Between Prosodic Features of Fillers and its Forms or Occurrence Positions INTERSPEECH 2017 Prediction and Generation of Backchannel Form for Attentive Listening Systems INTERSPEECH 2016 Joint Optimization of Denoising Autoencoder and DNN Acoustic Model Based on Multi-Target Learning for Noisy Speech Recognition INTERSPEECH 2016 Predicate Argument Structure Analysis using Partially Annotated Corpora IJCNLP 2013 Machine Translation without Words through Substring Alignment ACL 2012 Language Modeling for Spoken Dialogue System based on Filtering using Predicate-Argument Structures COLING 2012 An Unsupervised Model for Joint Phrase Alignment and Extraction ACL 2011 Bayes Risk-based Dialogue Management for Document Retrieval System with Speech Interface COLING 2008 Detection of Quotations and Inserted Clauses and Its Application to Dependency Structure Analysis in Spontaneous Japanese COLING 2006 Detection of Quotations and Inserted Clauses and Its Application to Dependency Structure Analysis in Spontaneous Japanese ACL 2006 Speech-based Information Retrieval System with Clarification Dialogue Strategy EMNLP 2005 Dependency Structure Analysis and Sentence Boundary Detection in Spontaneous Japanese COLING 2004 Efficient Confirmation Strategy for Large-scale Text Retrieval Systems with Spoken Dialogue Interface COLING 2004 Flexible Guidance Generation Using User Model in Spoken Dialogue Systems ACL 2003 Dialog Navigator : A Spoken Dialog Q-A System based on Large Text Knowledge Base ACL 2003 Efficient Dialogue Strategy to Find Users’ Intended Items from Information Query Results COLING 2002 Flexible Mixed-Initiative Dialogue Management using Concept-Level Confidence Measures of Speech Recognizer Output COLING 2000