Jinchuan Tian

20 papers · 2022–2026 · 7 conferences · across top CS/AI conferences

Achievements

+6 more ↓

🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (13) 🌍 Conference Polyglot (6)

🧭 Keyword Pioneer 🤝 Dynamic Duo (13) 💎 Century Club (18) ⚡ Prolific Year (8) 🔥 Unstoppable (5) 🗃️ Keyword Collector (85)

Conferences

INTERSPEECH (7) ACL (5) NAACL (3) ICML (2) EACL (1) EMNLP (1) ICLR (1)

Top co-authors

Shinji Watanabe (15) Jiatong Shi (10) William Chen (8) Yifan Peng (6) Xuankai Chang (6) Brian Yan (5) Karen Livescu (4) Jianwei Yu (4) Chao-Han Huck Yang (4) Dong Yu (4)

Keywords

automatic speech recognition (6) speech recognition (4) large language model (3) self-supervised learning (2) sequence-to-sequence model (2) zero-shot learning (2) foundation model (2) machine translation (2) speech synthesis (2) speech enhancement (2) weakly supervised learning (1) speech processing (1) multimodal learning (1) model architecture (1) question answering (1) domain adaptation (1) source separation (1) multitask learning (1) model adaptation (1) multilingual speech processing (1)

Papers

BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction EACL 2026 Speech-Hands: A Self-Reflection Voice Agentic Approach to Speech Recognition and Audio Reasoning with Omni Perception ACL 2026 ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems NAACL 2025 SpeechIQ: Speech-Agentic Intelligence Quotient Across Cognitive Levels in Voice Understanding by Large Language Models ACL 2025 ESPnet-SpeechLM: An Open Speech Language Model Toolkit NAACL 2025 VERSA: A Versatile Evaluation Toolkit for Speech, Audio, and Music NAACL 2025 OWLS: Scaling Laws for Multilingual Speech Recognition and Translation Models ICML 2025 On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models INTERSPEECH 2024 Make-A-Voice: Revisiting Voice Large Language Models as Scalable Multilingual and Multitask Learners ACL 2024 Towards Robust Speech Representation Learning for Thousands of Languages EMNLP 2024 CMU’s IWSLT 2024 Offline Speech Translation System: A Cascaded Approach For Long-Form Robustness ACL 2024 OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer INTERSPEECH 2024 ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets INTERSPEECH 2024 The Interspeech 2024 Challenge on Speech Processing Using Discrete Units INTERSPEECH 2024 UniAudio: Towards Universal Audio Generation with Large Language Models ICML 2024 Bayes Risk Transducer: Transducer with Controllable Alignment Prediction INTERSPEECH 2023 BAYES RISK CTC: CONTROLLABLE CTC ALIGNMENT IN SEQUENCE-TO-SEQUENCE TASKS ICLR 2023 The MineTrans Systems for IWSLT 2023 Offline Speech Translation and Speech-to-Speech Translation Tasks ACL 2023 Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction INTERSPEECH 2022 LAE: Language-Aware Encoder for Monolingual and Multilingual ASR INTERSPEECH 2022