Brian Yan

29 papers · 2021–2026 · 8 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🌍 Conference Polyglot (8) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (13) 🧭 Keyword Pioneer 🐝 Cross-Pollinator (15)

🌈 Renaissance Researcher (5) 🌍 Conference Polyglot (8) 🤝 Dynamic Duo (27) 🔬 Deep Specialist (14) 🏆 Keyword Champion (2) 🗃️ Keyword Collector (96) 📈 Trend Setter ⚡ Prolific Year (5) 🔥 Unstoppable (5) 💎 Century Club (28)

Conferences

INTERSPEECH (13) ACL (7) EMNLP (3) NAACL (2) EACL (1) ICLR (1) ICML (1) IJCNLP (1)

Top co-authors

Shinji Watanabe (27) Siddharth Dalmia (10) Jiatong Shi (9) Siddhant Arora (8) Yifan Peng (7) William Chen (6) Xuankai Chang (5) Jinchuan Tian (5) Patrick Fernandes (4) Graham Neubig (4)

Keywords

automatic speech recognition (12) speech translation (12) spoken language understanding (7) end-to-end model (6) speech recognition (5) simultaneous translation (4) beam search (4) machine translation (4) speech-to-text translation (3) knowledge distillation (3) connectionist temporal classification (3) self-supervised learning (2) speech encoder (2) simultaneous speech translation (2) zero-shot learning (2) speech-to-speech translation (2) end-to-end learning (2) attention mechanism (2) model ensembling (2) transducer model (2)

Papers

Hierarchical Policy Optimization for Simultaneous Translation of Unbounded Speech ACL 2026 Nvidia-Nemo’s WMT 2025 Metrics Shared Task Submission EMNLP 2025 OWLS: Scaling Laws for Multilingual Speech Recognition and Translation Models ICML 2025 OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer INTERSPEECH 2024 CMU’s IWSLT 2024 Simultaneous Speech Translation System ACL 2024 CMU’s IWSLT 2024 Offline Speech Translation System: A Cascaded Approach For Long-Form Robustness ACL 2024 CTC Alignments Improve Autoregressive Translation EACL 2023 ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit ACL 2023 CMU’s IWSLT 2023 Simultaneous Speech Translation System ACL 2023 BAYES RISK CTC: CONTROLLABLE CTC ALIGNMENT IN SEQUENCE-TO-SEQUENCE TASKS ICLR 2023 Bayes Risk Transducer: Transducer with Controllable Alignment Prediction INTERSPEECH 2023 Tensor decomposition for minimization of E2E SLU model toward on-device processing INTERSPEECH 2023 Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization INTERSPEECH 2023 Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding INTERSPEECH 2023 Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning INTERSPEECH 2023 A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks INTERSPEECH 2023 4D ASR: Joint modeling of CTC, Attention, Transducer, and Mask-Predict decoders INTERSPEECH 2023 Incremental Blockwise Beam Search for Simultaneous Speech Translation with Controllable Quality-Latency Tradeoff INTERSPEECH 2023 Two-Pass Low Latency End-to-End Spoken Language Understanding INTERSPEECH 2022 CMU’s IWSLT 2022 Dialect Speech Translation System ACL 2022 Token-level Sequence Labeling for Spoken Language Understanding using Compositional End-to-End Models EMNLP 2022 BERT Meets CTC: New Formulation of End-to-End Speech Recognition with Pre-trained Masked Language Model EMNLP 2022 Combining Spectral and Self-Supervised Features for Low Resource Speech Recognition and Translation INTERSPEECH 2022 ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding INTERSPEECH 2022 ESPnet-ST IWSLT 2021 Offline Speech Translation System ACL 2021 Differentiable Allophone Graphs for Language-Universal Speech Recognition INTERSPEECH 2021 Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks NAACL 2021 Highland Puebla Nahuatl Speech Translation Corpus for Endangered Language Documentation NAACL 2021 ESPnet-ST IWSLT 2021 Offline Speech Translation System IJCNLP 2021