Jinchuan Tian
20 papers · 2022–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
π§ Keyword Pioneer π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (13) π Conference Polyglot (6)
π§
Keyword Pioneer
π€
Dynamic Duo
(13)
π
Century Club
(18)
β‘
Prolific Year
(8)
π₯
Unstoppable
(5)
ποΈ
Keyword Collector
(85)
Conferences
INTERSPEECH (7)
ACL (5)
NAACL (3)
ICML (2)
EACL (1)
EMNLP (1)
ICLR (1)
Top co-authors
Keywords
automatic speech recognition
(6)
speech recognition
(4)
large language model
(3)
self-supervised learning
(2)
sequence-to-sequence model
(2)
zero-shot learning
(2)
foundation model
(2)
machine translation
(2)
speech synthesis
(2)
speech enhancement
(2)
weakly supervised learning
(1)
speech processing
(1)
multimodal learning
(1)
model architecture
(1)
question answering
(1)
domain adaptation
(1)
source separation
(1)
multitask learning
(1)
model adaptation
(1)
multilingual speech processing
(1)
Papers
BSCodec: A Band-Split Neural Codec for High-Quality Universal Audio Reconstruction
EACL 2026
Speech-Hands: A Self-Reflection Voice Agentic Approach to Speech Recognition and Audio Reasoning with Omni Perception
ACL 2026
ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems
NAACL 2025
SpeechIQ: Speech-Agentic Intelligence Quotient Across Cognitive Levels in Voice Understanding by Large Language Models
ACL 2025
ESPnet-SpeechLM: An Open Speech Language Model Toolkit
NAACL 2025
VERSA: A Versatile Evaluation Toolkit for Speech, Audio, and Music
NAACL 2025
OWLS: Scaling Laws for Multilingual Speech Recognition and Translation Models
ICML 2025
On the Effects of Heterogeneous Data Sources on Speech-to-Text Foundation Models
INTERSPEECH 2024
Make-A-Voice: Revisiting Voice Large Language Models as Scalable Multilingual and Multitask Learners
ACL 2024
Towards Robust Speech Representation Learning for Thousands of Languages
EMNLP 2024
CMUβs IWSLT 2024 Offline Speech Translation System: A Cascaded Approach For Long-Form Robustness
ACL 2024
OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer
INTERSPEECH 2024
ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets
INTERSPEECH 2024
The Interspeech 2024 Challenge on Speech Processing Using Discrete Units
INTERSPEECH 2024
UniAudio: Towards Universal Audio Generation with Large Language Models
ICML 2024
Bayes Risk Transducer: Transducer with Controllable Alignment Prediction
INTERSPEECH 2023
BAYES RISK CTC: CONTROLLABLE CTC ALIGNMENT IN SEQUENCE-TO-SEQUENCE TASKS
ICLR 2023
The MineTrans Systems for IWSLT 2023 Offline Speech Translation and Speech-to-Speech Translation Tasks
ACL 2023
Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction
INTERSPEECH 2022
LAE: Language-Aware Encoder for Monolingual and Multilingual ASR
INTERSPEECH 2022