Shota Horiguchi
16 papers · 2019–2024 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
π Interdisciplinary Bridge π§ Keyword Pioneer π£ Hot Topic Early Bird πΊοΈ Taxonomy Completionist (13) π Conference Polyglot (4)
π
Cross-Pollinator
(12)
π
Renaissance Researcher
(5)
π
Conference Polyglot
(4)
π
Century Club
(16)
π₯
Unstoppable
(6)
ποΈ
Keyword Collector
(92)
Conferences
INTERSPEECH (13)
COLING (1)
ICML (1)
SEMEVAL (1)
Top co-authors
Keywords
speaker diarization
(5)
multimodal learning
(3)
speaker embedding
(2)
guided source separation
(2)
transfer learning
(2)
speech enhancement
(2)
text classification
(2)
end-to-end learning
(2)
overlapping speech
(2)
speech recognition
(2)
end-to-end model
(2)
word error rate
(2)
image classification
(2)
automatic speech recognition
(2)
ensemble learning
(2)
multi-label classification
(1)
emotion recognition
(1)
domain adaptation
(1)
model fusion
(1)
source separation
(1)
Papers
Factor-Conditioned Speaking-Style Captioning
INTERSPEECH 2024
SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling
INTERSPEECH 2024
Spoofing Attacker Also Benefits from Self-Supervised Pretrained Model
INTERSPEECH 2023
CAPTDURE: Captioned Sound Dataset of Single Sources
INTERSPEECH 2023
Rethinking Fanoβs Inequality in Ensemble Learning
ICML 2022
Updating Only Encoders Prevents Catastrophic Forgetting of End-to-End ASR Models
INTERSPEECH 2022
Semi-Supervised Training with Pseudo-Labeling for End-To-End Neural Diarization
INTERSPEECH 2021
Online Streaming End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers
INTERSPEECH 2021
Hitachi at SemEval-2020 Task 8: Simple but Effective Modality Ensemble for Meme Emotion Recognition
SEMEVAL 2020
End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors
INTERSPEECH 2020
Utterance-Wise Meeting Transcription System Using Asynchronous Distributed Microphones
INTERSPEECH 2020
Hitachi at SemEval-2020 Task 8: Simple but Effective Modality Ensemble for Meme Emotion Recognition
COLING 2020
Multimodal Response Obligation Detection with Unsupervised Online Domain Adaptation
INTERSPEECH 2019
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR
INTERSPEECH 2019
Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition
INTERSPEECH 2019
End-to-End Neural Speaker Diarization with Permutation-Free Objectives
INTERSPEECH 2019