Jian Zhu

23 papers · 2019–2026 · 9 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🌍 Conference Polyglot (9) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🏃 Academic Marathon (6)

🏃 Academic Marathon (6) 🐣 Hot Topic Early Bird 🌉 Interdisciplinary Bridge 🏆 Keyword Champion 👥 Mega-Team (54) 🗃️ Keyword Collector (135) ⚡ Prolific Year (6) 📈 Trend Setter 💎 Century Club (19) 🔥 Unstoppable (5) ❓ The Questioner

Conferences

ACL (5) EMNLP (4) AAAI (3) INTERSPEECH (3) NAACL (3) NIPS (2) CVPR (1) MLHC (1) SEMEVAL (1)

Top co-authors

Cong Zhang (3) David Jurgens (3) Changbing Yang (3) David R. Mortensen (3) Zuoyu Tian (2) Kwanghee Choi (2) Kalvin Chang (2) Xinxin Zhang (2) Jun Sun (2) Stella Biderman (2)

Keywords

phone recognition (3) representation learning (3) multimodal learning (2) multimodal domain adaptation (2) multilingual model (2) low-resource language (2) multilingual speech (2) transformer architecture (2) domain adaptation (2) contrastive learning (2) grapheme-to-phoneme conversion (2) speech synthesis (2) large language model (2) video generation (1) embedding learning (1) benchmark evaluation (1) human perception (1) information bottleneck (1) zero-shot learning (1) text classification (1)

Papers

Other Vehicle Trajectories Are Also Needed: A Driving World Model Unifies Ego-Other Vehicle Trajectories in Video Latent Space AAAI 2026 PRiSM: Benchmarking Phone Realization in Speech Models ACL 2026 POWSM: A Phonetic Open Whisper-Style Speech Foundation Model ACL 2026 Boomda: Balanced Multi-objective Optimization for Multimodal Domain Adaptation AAAI 2026 Self-Learning Hyperspectral and Multispectral Image Fusion via Adaptive Residual Guided Subspace Diffusion Model CVPR 2025 Developing multilingual speech synthesis system for Ojibwe, Mi’kmaq, and Maliseet NAACL 2025 CaseReportCollective: A Large-Scale LLM-Extracted Dataset for Structured Medical Case Reports ACL 2025 ZIPA: A family of efficient models for multilingual phone recognition ACL 2025 Adversarial Alignment with Anchor Dragging Drift (A3D2): Multimodal Domain Adaptation with Partially Shifted Modalities ACL 2025 LingGym: How Far Are LLMs from Thinking Like Field Linguists? EMNLP 2025 Incorporating Test-Time Optimization into Training with Dual Networks for Human Mesh Recovery NIPS 2024 The taste of IPA: Towards open-vocabulary keyword spotting and forced alignment in any language NAACL 2024 Generalize for Future: Slow and Fast Trajectory Learning for CTR Prediction AAAI 2024 A comparison of voice similarity through acoustics, human perception and deep neural network (DNN) speaker verification systems INTERSPEECH 2024 Dialogue-Contextualized Re-ranking for Medical History-Taking MLHC 2023 RWKV: Reinventing RNNs for the Transformer Era EMNLP 2023 The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset NIPS 2022 Bootstrapping meaning through listening: Unsupervised learning of spoken sentence embeddings EMNLP 2022 ByT5 model for massively multilingual grapheme-to-phoneme conversion INTERSPEECH 2022 The structure of online social networks modulates the rate of lexical change NAACL 2021 Synchronising Speech Segments with Musical Beats in Mandarin and English Singing INTERSPEECH 2021 Idiosyncratic but not Arbitrary: Learning Idiolects in Online Registers Reveals Distinctive yet Consistent Individual Styles EMNLP 2021 UM-IU@LING at SemEval-2019 Task 6: Identifying Offensive Tweets Using BERT and SVMs SEMEVAL 2019