Shota Horiguchi

16 papers · 2019–2024 · 4 conferences · across top CS/AI conferences

Achievements

+6 more ↓

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🗺️ Taxonomy Completionist (13) 🌍 Conference Polyglot (4)

🐝 Cross-Pollinator (12) 🌈 Renaissance Researcher (5) 🌍 Conference Polyglot (4) 💎 Century Club (16) 🔥 Unstoppable (6) 🗃️ Keyword Collector (92)

Conferences

INTERSPEECH (13) COLING (1) ICML (1) SEMEVAL (1)

Top co-authors

Kenji Nagamatsu (8) Yusuke Fujita (7) Shinji Watanabe (6) Naoyuki Kanda (4) Leibny Paola García Perera (3) Hiroaki Ozaki (3) Gaku Morio (3) Terufumi Morishita (3) Yuki Takashima (3) Yohei Kawaguchi (2)

Keywords

speaker diarization (5) multimodal learning (3) speaker embedding (2) guided source separation (2) transfer learning (2) speech enhancement (2) text classification (2) end-to-end learning (2) overlapping speech (2) speech recognition (2) end-to-end model (2) word error rate (2) image classification (2) automatic speech recognition (2) ensemble learning (2) multi-label classification (1) emotion recognition (1) domain adaptation (1) model fusion (1) source separation (1)

Papers

Factor-Conditioned Speaking-Style Captioning INTERSPEECH 2024 SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling INTERSPEECH 2024 Spoofing Attacker Also Benefits from Self-Supervised Pretrained Model INTERSPEECH 2023 CAPTDURE: Captioned Sound Dataset of Single Sources INTERSPEECH 2023 Rethinking Fano’s Inequality in Ensemble Learning ICML 2022 Updating Only Encoders Prevents Catastrophic Forgetting of End-to-End ASR Models INTERSPEECH 2022 Semi-Supervised Training with Pseudo-Labeling for End-To-End Neural Diarization INTERSPEECH 2021 Online Streaming End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers INTERSPEECH 2021 Hitachi at SemEval-2020 Task 8: Simple but Effective Modality Ensemble for Meme Emotion Recognition SEMEVAL 2020 End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors INTERSPEECH 2020 Utterance-Wise Meeting Transcription System Using Asynchronous Distributed Microphones INTERSPEECH 2020 Hitachi at SemEval-2020 Task 8: Simple but Effective Modality Ensemble for Meme Emotion Recognition COLING 2020 Multimodal Response Obligation Detection with Unsupervised Online Domain Adaptation INTERSPEECH 2019 Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR INTERSPEECH 2019 Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition INTERSPEECH 2019 End-to-End Neural Speaker Diarization with Permutation-Free Objectives INTERSPEECH 2019