Jia Jia

30 papers · 2016–2026 · 8 conferences · across top CS/AI conferences

Achievements

+14 more ↓

🌍 Conference Polyglot (8) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🏃 Academic Marathon (9)

🌉 Interdisciplinary Bridge 🏃 Academic Marathon (9) 🧭 Keyword Pioneer 🏆 Grand Slam 🤝 Dynamic Duo (14) 🧬 Topic Evolution 🏆 Keyword Champion 🔥 Unstoppable (10) 🚀 Conference Pioneer 🗃️ Keyword Collector (163) ⚡ Prolific Year (5) ❓ The Questioner (2) 💎 Century Club (29) 📈 Trend Setter

Conferences

INTERSPEECH (12) IJCAI (6) AAAI (5) ICLR (2) NIPS (2) CVPR (1) EMNLP (1) ICML (1)

Top co-authors

Zhiyong Wu (14) Helen Meng (11) Shikun Sun (5) Tat-Seng Chua (5) Lianhong Cai (4) Zhihan Yang (4) Junliang Xing (3) Wei Chen (3) Guangyao Shen (3) Zixuan Wang (3)

Keywords

social media analysis (4) speech synthesis (4) emotion recognition (3) multimodal learning (3) speech emotion recognition (3) recurrent neural network (3) automatic speech recognition (2) style transfer (2) reference encoder (2) diffusion model (2) depression detection (2) prosody modeling (2) mental health detection (2) attention mechanism (2) emotion detection (2) video captioning (1) video generation (1) self-attention mechanism (1) curriculum learning (1) semi-supervised learning (1)

Papers

Emotion-Conditioned Motion Sub-spaces with Flow Matching for Real-Time Audio-Driven Talking Heads AAAI 2026 Minimal Impact ControlNet: Advancing Multi-ControlNet Integration ICLR 2025 VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understanding NIPS 2024 Skinned Motion Retargeting with Dense Geometric Interaction Perception NIPS 2024 DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance CVPR 2024 Inner Classifier-Free Guidance and Its Taylor Expansion for Diffusion Models ICLR 2024 LoRA-MER: Low-Rank Adaptation of Pre-Trained Speech Models for Multimodal Emotion Recognition Using Mutual Information INTERSPEECH 2024 SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation ICML 2023 Prosody Modeling with 3D Visual Information for Expressive Video Dubbing INTERSPEECH 2023 What Does Your Face Sound Like? 3D Face Shape towards Voice AAAI 2023 Towards Cross-speaker Reading Style Transfer on Audiobook Dataset INTERSPEECH 2022 Towards Multi-Scale Style Control for Expressive Speech Synthesis INTERSPEECH 2021 Inferring Emotion from Large-scale Internet Voice Data: A Semi-supervised Curriculum Augmentation based Deep Learning Approach AAAI 2021 Mining Unfollow Behavior in Large-Scale Online Social Networks via Spatial-Temporal Interaction AAAI 2020 Re-Weighted Interval Loss for Handling Data Imbalance Problem of End-to-End Keyword Spotting INTERSPEECH 2020 PEIA: Personality and Emotion Integrated Attentive Model for Music Recommendation on Social Media Platforms AAAI 2020 Design and Implementation of a Disambiguity Framework for Smart Voice Controlled Devices IJCAI 2019 Towards Discriminative Representation Learning for Speech Emotion Recognition IJCAI 2019 One-Shot Voice Conversion with Global Speaker Embeddings INTERSPEECH 2019 An Online Attention-Based Model for Speech Recognition INTERSPEECH 2019 Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-Trained BERT INTERSPEECH 2019 The Sogou-TIIC Speech Translation System for IWSLT 2018 EMNLP 2018 Cross-Domain Depression Detection via Harvesting Social Media IJCAI 2018 Mental Health Computing via Harvesting Social Media Data IJCAI 2018 Emotion Recognition from Variable-Length Speech Segments Using Deep Learning on Spectrograms INTERSPEECH 2018 Speech Emotion Recognition with Emotion-Pair Based Framework Considering Emotion Distribution Information in Dimensional Emotion Space INTERSPEECH 2017 Depression Detection via Harvesting Social Media: A Multimodal Dictionary Learning Solution IJCAI 2017 Expressive Speech Driven Talking Avatar Synthesis with DBLSTM Using Limited Amount of Emotional Bimodal Data INTERSPEECH 2016 Phoneme Embedding and its Application to Speech Driven Talking Avatar Synthesis INTERSPEECH 2016 What Does Social Media Say about Your Stress? IJCAI 2016