Jia Jia
30 papers · 2016–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+14 more ↓ Show less ↑
🌍 Conference Polyglot (8) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🏃 Academic Marathon (9)
🌉
Interdisciplinary Bridge
🏃
Academic Marathon
(9)
🧭
Keyword Pioneer
🏆
Grand Slam
🤝
Dynamic Duo
(14)
🧬
Topic Evolution
🏆
Keyword Champion
🔥
Unstoppable
(10)
🚀
Conference Pioneer
🗃️
Keyword Collector
(163)
⚡
Prolific Year
(5)
❓
The Questioner
(2)
💎
Century Club
(29)
📈
Trend Setter
Conferences
INTERSPEECH (12)
IJCAI (6)
AAAI (5)
ICLR (2)
NIPS (2)
CVPR (1)
EMNLP (1)
ICML (1)
Top co-authors
Keywords
social media analysis
(4)
speech synthesis
(4)
emotion recognition
(3)
multimodal learning
(3)
speech emotion recognition
(3)
recurrent neural network
(3)
automatic speech recognition
(2)
style transfer
(2)
reference encoder
(2)
diffusion model
(2)
depression detection
(2)
prosody modeling
(2)
mental health detection
(2)
attention mechanism
(2)
emotion detection
(2)
video captioning
(1)
video generation
(1)
self-attention mechanism
(1)
curriculum learning
(1)
semi-supervised learning
(1)
Papers
Emotion-Conditioned Motion Sub-spaces with Flow Matching for Real-Time Audio-Driven Talking Heads
AAAI 2026
Minimal Impact ControlNet: Advancing Multi-ControlNet Integration
ICLR 2025
VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understanding
NIPS 2024
Skinned Motion Retargeting with Dense Geometric Interaction Perception
NIPS 2024
DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance
CVPR 2024
Inner Classifier-Free Guidance and Its Taylor Expansion for Diffusion Models
ICLR 2024
LoRA-MER: Low-Rank Adaptation of Pre-Trained Speech Models for Multimodal Emotion Recognition Using Mutual Information
INTERSPEECH 2024
SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation
ICML 2023
Prosody Modeling with 3D Visual Information for Expressive Video Dubbing
INTERSPEECH 2023
What Does Your Face Sound Like? 3D Face Shape towards Voice
AAAI 2023
Towards Cross-speaker Reading Style Transfer on Audiobook Dataset
INTERSPEECH 2022
Towards Multi-Scale Style Control for Expressive Speech Synthesis
INTERSPEECH 2021
Inferring Emotion from Large-scale Internet Voice Data: A Semi-supervised Curriculum Augmentation based Deep Learning Approach
AAAI 2021
Mining Unfollow Behavior in Large-Scale Online Social Networks via Spatial-Temporal Interaction
AAAI 2020
Re-Weighted Interval Loss for Handling Data Imbalance Problem of End-to-End Keyword Spotting
INTERSPEECH 2020
PEIA: Personality and Emotion Integrated Attentive Model for Music Recommendation on Social Media Platforms
AAAI 2020
Design and Implementation of a Disambiguity Framework for Smart Voice Controlled Devices
IJCAI 2019
Towards Discriminative Representation Learning for Speech Emotion Recognition
IJCAI 2019
One-Shot Voice Conversion with Global Speaker Embeddings
INTERSPEECH 2019
An Online Attention-Based Model for Speech Recognition
INTERSPEECH 2019
Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-Trained BERT
INTERSPEECH 2019
The Sogou-TIIC Speech Translation System for IWSLT 2018
EMNLP 2018
Cross-Domain Depression Detection via Harvesting Social Media
IJCAI 2018
Mental Health Computing via Harvesting Social Media Data
IJCAI 2018
Emotion Recognition from Variable-Length Speech Segments Using Deep Learning on Spectrograms
INTERSPEECH 2018
Speech Emotion Recognition with Emotion-Pair Based Framework Considering Emotion Distribution Information in Dimensional Emotion Space
INTERSPEECH 2017
Depression Detection via Harvesting Social Media: A Multimodal Dictionary Learning Solution
IJCAI 2017
Expressive Speech Driven Talking Avatar Synthesis with DBLSTM Using Limited Amount of Emotional Bimodal Data
INTERSPEECH 2016
Phoneme Embedding and its Application to Speech Driven Talking Avatar Synthesis
INTERSPEECH 2016
What Does Social Media Say about Your Stress?
IJCAI 2016