Brian Yan
29 papers · 2021–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π Conference Polyglot (8) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (13) π§ Keyword Pioneer π Cross-Pollinator (15)
π
Renaissance Researcher
(5)
π
Conference Polyglot
(8)
π€
Dynamic Duo
(27)
π¬
Deep Specialist
(14)
π
Keyword Champion
(2)
ποΈ
Keyword Collector
(96)
π
Trend Setter
β‘
Prolific Year
(5)
π₯
Unstoppable
(5)
π
Century Club
(28)
Conferences
INTERSPEECH (13)
ACL (7)
EMNLP (3)
NAACL (2)
EACL (1)
ICLR (1)
ICML (1)
IJCNLP (1)
Top co-authors
Keywords
automatic speech recognition
(12)
speech translation
(12)
spoken language understanding
(7)
end-to-end model
(6)
speech recognition
(5)
simultaneous translation
(4)
beam search
(4)
machine translation
(4)
speech-to-text translation
(3)
knowledge distillation
(3)
connectionist temporal classification
(3)
self-supervised learning
(2)
speech encoder
(2)
simultaneous speech translation
(2)
zero-shot learning
(2)
speech-to-speech translation
(2)
end-to-end learning
(2)
attention mechanism
(2)
model ensembling
(2)
transducer model
(2)
Papers
Hierarchical Policy Optimization for Simultaneous Translation of Unbounded Speech
ACL 2026
Nvidia-Nemoβs WMT 2025 Metrics Shared Task Submission
EMNLP 2025
OWLS: Scaling Laws for Multilingual Speech Recognition and Translation Models
ICML 2025
OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer
INTERSPEECH 2024
CMUβs IWSLT 2024 Simultaneous Speech Translation System
ACL 2024
CMUβs IWSLT 2024 Offline Speech Translation System: A Cascaded Approach For Long-Form Robustness
ACL 2024
CTC Alignments Improve Autoregressive Translation
EACL 2023
ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit
ACL 2023
CMUβs IWSLT 2023 Simultaneous Speech Translation System
ACL 2023
BAYES RISK CTC: CONTROLLABLE CTC ALIGNMENT IN SEQUENCE-TO-SEQUENCE TASKS
ICLR 2023
Bayes Risk Transducer: Transducer with Controllable Alignment Prediction
INTERSPEECH 2023
Tensor decomposition for minimization of E2E SLU model toward on-device processing
INTERSPEECH 2023
Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization
INTERSPEECH 2023
Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding
INTERSPEECH 2023
Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
INTERSPEECH 2023
A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks
INTERSPEECH 2023
4D ASR: Joint modeling of CTC, Attention, Transducer, and Mask-Predict decoders
INTERSPEECH 2023
Incremental Blockwise Beam Search for Simultaneous Speech Translation with Controllable Quality-Latency Tradeoff
INTERSPEECH 2023
Two-Pass Low Latency End-to-End Spoken Language Understanding
INTERSPEECH 2022
CMUβs IWSLT 2022 Dialect Speech Translation System
ACL 2022
Token-level Sequence Labeling for Spoken Language Understanding using Compositional End-to-End Models
EMNLP 2022
BERT Meets CTC: New Formulation of End-to-End Speech Recognition with Pre-trained Masked Language Model
EMNLP 2022
Combining Spectral and Self-Supervised Features for Low Resource Speech Recognition and Translation
INTERSPEECH 2022
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding
INTERSPEECH 2022
ESPnet-ST IWSLT 2021 Offline Speech Translation System
ACL 2021
Differentiable Allophone Graphs for Language-Universal Speech Recognition
INTERSPEECH 2021
Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks
NAACL 2021
Highland Puebla Nahuatl Speech Translation Corpus for Endangered Language Documentation
NAACL 2021
ESPnet-ST IWSLT 2021 Offline Speech Translation System
IJCNLP 2021