Nobukatsu Hojo
17 papers · 2016–2026 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π§ Keyword Pioneer π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (10) π Conference Polyglot (2)
π§
Keyword Pioneer
π
Renaissance Researcher
(5)
π§¬
Topic Evolution
π€
Dynamic Duo
(11)
ποΈ
Keyword Collector
(81)
π
Conference Pioneer
π
Century Club
(16)
π₯
Unstoppable
(5)
Conferences
INTERSPEECH (14)
AAAI (2)
EACL (1)
Top co-authors
Research topics
Keywords
speech synthesis
(4)
multimodal learning
(3)
automatic speech recognition
(2)
autoregressive model
(2)
joint modeling
(2)
deep neural network
(2)
multimodal transformer
(2)
theory of mind
(2)
voice conversion
(2)
large language model
(2)
generative adversarial network
(2)
class imbalance
(1)
speech recognition
(1)
video analysis
(1)
feature representation
(1)
acoustic modeling
(1)
domain adaptation
(1)
speech enhancement
(1)
fine-grained classification
(1)
hidden markov model
(1)
Papers
Letβs Put Ourselves in Sallyβs Shoes: Shoes-of-Others Prefilling Improves Theory of Mind in Large Language Models
EACL 2026
ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind
AAAI 2025
Multimodal Fine-Grained Apparent Personality Trait Recognition: Joint Modeling of Big Five and Questionnaire Item-level Scores
AAAI 2025
Participant-Pair-Wise Bottleneck Transformer for Engagement Estimation from Video Conversation
INTERSPEECH 2024
Learning from Multiple Annotator Biased Labels in Multimodal Conversation
INTERSPEECH 2024
Unified Multi-Talker ASR with and without Target-speaker Enrollment
INTERSPEECH 2024
End-to-End Joint Target and Non-Target Speakers ASR
INTERSPEECH 2023
Transcribing Speech as Spoken and Written Dual Text Using an Autoregressive Model
INTERSPEECH 2023
Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss
INTERSPEECH 2023
Audio-Visual Praise Estimation for Conversational Video based on Synchronization-Guided Multimodal Transformer
INTERSPEECH 2023
End-to-End Joint Modeling of Conversation History-Dependent and Independent ASR Systems with Multi-History Training
INTERSPEECH 2022
CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-Spectrogram Conversion
INTERSPEECH 2020
StarGAN-VC2: Rethinking Conditional Methods for StarGAN-Based Voice Conversion
INTERSPEECH 2019
Evaluating Intention Communication by TTS Using Explicit Definitions of Illocutionary Act Performance
INTERSPEECH 2019
Prosody Aware Word-Level Encoder Based on BLSTM-RNNs for DNN-Based Speech Synthesis
INTERSPEECH 2017
DNN-SPACE: DNN-HMM-Based Generative Model of Voice F0Contours for Statistical Phrase/Accent Command Estimation
INTERSPEECH 2017
An Investigation of DNN-Based Speech Synthesis Using Speaker Codes
INTERSPEECH 2016