Jian Zhu
23 papers · 2019–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
🌍 Conference Polyglot (9) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🏃 Academic Marathon (6)
🏃
Academic Marathon
(6)
🐣
Hot Topic Early Bird
🌉
Interdisciplinary Bridge
🏆
Keyword Champion
👥
Mega-Team
(54)
🗃️
Keyword Collector
(135)
⚡
Prolific Year
(6)
📈
Trend Setter
💎
Century Club
(19)
🔥
Unstoppable
(5)
❓
The Questioner
Conferences
ACL (5)
EMNLP (4)
AAAI (3)
INTERSPEECH (3)
NAACL (3)
NIPS (2)
CVPR (1)
MLHC (1)
SEMEVAL (1)
Top co-authors
Keywords
phone recognition
(3)
representation learning
(3)
multimodal learning
(2)
multimodal domain adaptation
(2)
multilingual model
(2)
low-resource language
(2)
multilingual speech
(2)
transformer architecture
(2)
domain adaptation
(2)
contrastive learning
(2)
grapheme-to-phoneme conversion
(2)
speech synthesis
(2)
large language model
(2)
video generation
(1)
embedding learning
(1)
benchmark evaluation
(1)
human perception
(1)
information bottleneck
(1)
zero-shot learning
(1)
text classification
(1)
Papers
Other Vehicle Trajectories Are Also Needed: A Driving World Model Unifies Ego-Other Vehicle Trajectories in Video Latent Space
AAAI 2026
PRiSM: Benchmarking Phone Realization in Speech Models
ACL 2026
POWSM: A Phonetic Open Whisper-Style Speech Foundation Model
ACL 2026
Boomda: Balanced Multi-objective Optimization for Multimodal Domain Adaptation
AAAI 2026
Self-Learning Hyperspectral and Multispectral Image Fusion via Adaptive Residual Guided Subspace Diffusion Model
CVPR 2025
Developing multilingual speech synthesis system for Ojibwe, Mi’kmaq, and Maliseet
NAACL 2025
CaseReportCollective: A Large-Scale LLM-Extracted Dataset for Structured Medical Case Reports
ACL 2025
ZIPA: A family of efficient models for multilingual phone recognition
ACL 2025
Adversarial Alignment with Anchor Dragging Drift (A3D2): Multimodal Domain Adaptation with Partially Shifted Modalities
ACL 2025
LingGym: How Far Are LLMs from Thinking Like Field Linguists?
EMNLP 2025
Incorporating Test-Time Optimization into Training with Dual Networks for Human Mesh Recovery
NIPS 2024
The taste of IPA: Towards open-vocabulary keyword spotting and forced alignment in any language
NAACL 2024
Generalize for Future: Slow and Fast Trajectory Learning for CTR Prediction
AAAI 2024
A comparison of voice similarity through acoustics, human perception and deep neural network (DNN) speaker verification systems
INTERSPEECH 2024
Dialogue-Contextualized Re-ranking for Medical History-Taking
MLHC 2023
RWKV: Reinventing RNNs for the Transformer Era
EMNLP 2023
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
NIPS 2022
Bootstrapping meaning through listening: Unsupervised learning of spoken sentence embeddings
EMNLP 2022
ByT5 model for massively multilingual grapheme-to-phoneme conversion
INTERSPEECH 2022
The structure of online social networks modulates the rate of lexical change
NAACL 2021
Synchronising Speech Segments with Musical Beats in Mandarin and English Singing
INTERSPEECH 2021
Idiosyncratic but not Arbitrary: Learning Idiolects in Online Registers Reveals Distinctive yet Consistent Individual Styles
EMNLP 2021
UM-IU@LING at SemEval-2019 Task 6: Identifying Offensive Tweets Using BERT and SVMs
SEMEVAL 2019