Ashishkumar Gudmalwar
4 papers · 2022–2024 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🌍
Conference Polyglot
(2)
🐝
Cross-Pollinator
(8)
🌈
Renaissance Researcher
(5)
Conferences
INTERSPEECH (3)
NAACL (1)
Top co-authors
Keywords
representation learning
(1)
reinforcement learning
(1)
speaker embedding
(1)
support vector machine
(1)
deep canonical correlation analysis
(1)
multimodal large language model
(1)
lip synchronization
(1)
cross-modal attention
(1)
cross-lingual synthesis
(1)
speech emotion recognition
(1)
magnitude spectrum
(1)
phase information
(1)
student-teacher architecture
(1)
speech duration control
(1)
audio-visual alignment
(1)
emotional style transfer
(1)
multilingual speaker
(1)
isometric neural machine translation
(1)
automatic video dubbing
(1)
phoneme count compliance
(1)
Papers
DubWise: Video-Guided Speech Duration Control in Multimodal LLM-based Text-to-Speech for Dubbing
INTERSPEECH 2024
VECL-TTS: Voice identity and Emotional style controllable Cross-Lingual Text-to-Speech
INTERSPEECH 2024
Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning
NAACL 2024
The Magnitude and Phase based Speech Representation Learning using Autoencoder for Classifying Speech Emotions using Deep Canonical Correlation Analysis
INTERSPEECH 2022