Nobukatsu Hojo

17 papers · 2016–2026 · 3 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (10) 🌍 Conference Polyglot (2)

🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) 🧬 Topic Evolution 🤝 Dynamic Duo (11) 🗃️ Keyword Collector (81) 🚀 Conference Pioneer 💎 Century Club (16) 🔥 Unstoppable (5)

Conferences

INTERSPEECH (14) AAAI (2) EACL (1)

Top co-authors

Ryo Masumura (11) Saki Mizuno (10) Tomohiro Tanaka (7) Mana Ihori (7) Keita Suzuki (6) Hiroshi Sato (5) Kazutoshi Shinoda (5) Takafumi Moriya (4) Naoki Makishima (4) Satoshi Suzuki (4)

Research topics

Speech & Audio (1)

Keywords

speech synthesis (4) multimodal learning (3) automatic speech recognition (2) autoregressive model (2) joint modeling (2) deep neural network (2) multimodal transformer (2) theory of mind (2) voice conversion (2) large language model (2) generative adversarial network (2) class imbalance (1) speech recognition (1) video analysis (1) feature representation (1) acoustic modeling (1) domain adaptation (1) speech enhancement (1) fine-grained classification (1) hidden markov model (1)

Papers

Let’s Put Ourselves in Sally’s Shoes: Shoes-of-Others Prefilling Improves Theory of Mind in Large Language Models EACL 2026 ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind AAAI 2025 Multimodal Fine-Grained Apparent Personality Trait Recognition: Joint Modeling of Big Five and Questionnaire Item-level Scores AAAI 2025 Participant-Pair-Wise Bottleneck Transformer for Engagement Estimation from Video Conversation INTERSPEECH 2024 Learning from Multiple Annotator Biased Labels in Multimodal Conversation INTERSPEECH 2024 Unified Multi-Talker ASR with and without Target-speaker Enrollment INTERSPEECH 2024 End-to-End Joint Target and Non-Target Speakers ASR INTERSPEECH 2023 Transcribing Speech as Spoken and Written Dual Text Using an Autoregressive Model INTERSPEECH 2023 Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss INTERSPEECH 2023 Audio-Visual Praise Estimation for Conversational Video based on Synchronization-Guided Multimodal Transformer INTERSPEECH 2023 End-to-End Joint Modeling of Conversation History-Dependent and Independent ASR Systems with Multi-History Training INTERSPEECH 2022 CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-Spectrogram Conversion INTERSPEECH 2020 StarGAN-VC2: Rethinking Conditional Methods for StarGAN-Based Voice Conversion INTERSPEECH 2019 Evaluating Intention Communication by TTS Using Explicit Definitions of Illocutionary Act Performance INTERSPEECH 2019 Prosody Aware Word-Level Encoder Based on BLSTM-RNNs for DNN-Based Speech Synthesis INTERSPEECH 2017 DNN-SPACE: DNN-HMM-Based Generative Model of Voice F0Contours for Statistical Phrase/Accent Command Estimation INTERSPEECH 2017 An Investigation of DNN-Based Speech Synthesis Using Speaker Codes INTERSPEECH 2016