conftrace_

Hiroshi Sato

20 papers · 2013–2024 · 3 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+10 more ↓ 🧭 Keyword Pioneer πŸ—ΊοΈ Taxonomy Completionist (12) πŸŒ‰ Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🌍 Conference Polyglot (3)
πŸŒ‰ Interdisciplinary Bridge πŸ—ΊοΈ Taxonomy Completionist (12) 🀝 Dynamic Duo (14) πŸ”¬ Deep Specialist (11) πŸš€ Conference Pioneer ⚑ Prolific Year (5) πŸ”₯ Unstoppable (6) ❓ The Questioner (3) πŸ—ƒοΈ Keyword Collector (84) πŸ’Ž Century Club (20)

Conferences

INTERSPEECH (18) COLING (1) IJCAI (1)

Research topics

Papers

SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling INTERSPEECH 2024 Boosting Hybrid Autoregressive Transducer-based ASR with Internal Acoustic Model Training and Dual Blank Thresholding INTERSPEECH 2024 End-to-End Joint Target and Non-Target Speakers ASR INTERSPEECH 2023 Audio-Visual Praise Estimation for Conversational Video based on Synchronization-Guided Multimodal Transformer INTERSPEECH 2023 Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data INTERSPEECH 2023 Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss INTERSPEECH 2023 Transcribing Speech as Spoken and Written Dual Text Using an Autoregressive Model INTERSPEECH 2023 Streaming Target-Speaker ASR with Neural Transducer INTERSPEECH 2022 Listen only to me! How well can target speech extraction handle false alarms? INTERSPEECH 2022 Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations INTERSPEECH 2022 Domain Adversarial Self-Supervised Speech Representation Learning for Improving Unknown Domain Downstream Tasks INTERSPEECH 2022 Multi-Perspective Document Revision COLING 2022 End-to-End Joint Modeling of Conversation History-Dependent and Independent ASR Systems with Multi-History Training INTERSPEECH 2022 How bad are artifacts?: Analyzing the impact of speech enhancement errors on ASR INTERSPEECH 2022 Streaming End-to-End Speech Recognition for Hybrid RNN-T/Attention Architecture INTERSPEECH 2021 Should We Always Separate?: Switching Between Enhanced and Observed Signals for Overlapping Speech Recognition INTERSPEECH 2021 Self-Distillation for Improving CTC-Transformer-Based ASR Systems INTERSPEECH 2020 Neural Whispered Speech Detection with Imbalanced Learning INTERSPEECH 2019 End-to-End Automatic Speech Recognition with a Reconstruction Criterion Using Speech-to-Text and Text-to-Speech Encoder-Decoders INTERSPEECH 2019 Prior-Free Exploration Bonus for and beyond Near Bayes-Optimal Behavior IJCAI 2013