Mana Ihori
19 papers · 2020–2026 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
π Conference Polyglot (3) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (10) π§ Keyword Pioneer π Academic Marathon (5)
π
Cross-Pollinator
(5)
π
Conference Polyglot
(3)
π€
Dynamic Duo
(18)
π₯
Unstoppable
(6)
π
Century Club
(18)
β‘
Prolific Year
(5)
ποΈ
Keyword Collector
(97)
Conferences
INTERSPEECH (16)
AAAI (2)
COLING (1)
Top co-authors
Research topics
Keywords
automatic speech recognition
(5)
joint modeling
(2)
autoregressive modeling
(2)
self-supervised learning
(2)
multi-task learning
(2)
autoregressive model
(2)
multimodal transformer
(2)
overlapped speech
(2)
end-to-end automatic speech recognition
(2)
speaker verification
(2)
unsupervised domain adaptation
(1)
multimodal learning
(1)
speech recognition
(1)
semi-supervised learning
(1)
contrastive learning
(1)
grammatical error correction
(1)
feature representation
(1)
text editing
(1)
adversarial learning
(1)
speaker embedding
(1)
Papers
Difference Vector Equalization for Robust Fine-tuning of Vision-Language Models
AAAI 2026
Multimodal Fine-Grained Apparent Personality Trait Recognition: Joint Modeling of Big Five and Questionnaire Item-level Scores
AAAI 2025
SOMSRED: Sequential Output Modeling for Joint Multi-talker Overlapped Speech Recognition and Speaker Diarization
INTERSPEECH 2024
Unified Multi-Talker ASR with and without Target-speaker Enrollment
INTERSPEECH 2024
Transcribing Speech as Spoken and Written Dual Text Using an Autoregressive Model
INTERSPEECH 2023
End-to-End Joint Target and Non-Target Speakers ASR
INTERSPEECH 2023
Audio-Visual Praise Estimation for Conversational Video based on Synchronization-Guided Multimodal Transformer
INTERSPEECH 2023
Downstream Task Agnostic Speech Enhancement with Self-Supervised Representation Loss
INTERSPEECH 2023
Multi-Perspective Document Revision
COLING 2022
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations
INTERSPEECH 2022
Domain Adversarial Self-Supervised Speech Representation Learning for Improving Unknown Domain Downstream Tasks
INTERSPEECH 2022
End-to-End Joint Modeling of Conversation History-Dependent and Independent ASR Systems with Multi-History Training
INTERSPEECH 2022
End-to-End Rich Transcription-Style Automatic Speech Recognition with Semi-Supervised Learning
INTERSPEECH 2021
Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition
INTERSPEECH 2021
Unified Autoregressive Modeling for Joint End-to-End Multi-Talker Overlapped Speech Recognition and Speaker Attribute Estimation
INTERSPEECH 2021
Zero-Shot Joint Modeling of Multiple Spoken-Text-Style Conversion Tasks Using Switching Tokens
INTERSPEECH 2021
Enrollment-Less Training for Personalized Voice Activity Detection
INTERSPEECH 2021
Phoneme-to-Grapheme Conversion Based Large-Scale Pre-Training for End-to-End Automatic Speech Recognition
INTERSPEECH 2020
Unsupervised Domain Adaptation for Dialogue Sequence Labeling Based on Hierarchical Adversarial Training
INTERSPEECH 2020