Yangyang Shi

27 papers · 2016–2026 · 7 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🏃 Academic Marathon (9) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (6) 🐝 Cross-Pollinator (13)

🌍 Conference Polyglot (6) 🏃 Academic Marathon (9) 🌈 Renaissance Researcher (7) 🤝 Dynamic Duo (12) 🔬 Deep Specialist (10) 🧬 Topic Evolution ⚡ Prolific Year (6) 💎 Century Club (26) 🔥 Unstoppable (6) 🗃️ Keyword Collector (124)

Conferences

INTERSPEECH (12) ACL (5) NAACL (4) EMNLP (2) ICML (2) AAAI (1) CVPR (1)

Top co-authors

Vikas Chandra (12) Ernie Chang (10) Changsheng Zhao (8) Chunyang Wu (8) Ozlem Kalinli (8) Duc Le (6) Jay Mahadeokar (6) Zechun Liu (6) Christian Fuegen (6) Michael L. Seltzer (6)

Keywords

automatic speech recognition (6) word error rate (4) speech recognition (3) on-device speech recognition (3) language model (3) knowledge distillation (2) attention mechanism (2) latency optimization (2) acoustic modeling (2) acoustic model (2) streaming speech recognition (2) speech synthesis (1) graph learning (1) machine translation (1) domain adaptation (1) model quantization (1) knowledge transfer (1) data annotation (1) zero-shot learning (1) text generation (1)

Papers

OmniEvent: Unified Event Representation Learning AAAI 2026 Breaking Down Power Barriers in On-Device Streaming ASR: Insights and Solutions NAACL 2025 Self-Vocabularizing Training for Neural Machine Translation NAACL 2025 AutoMixer: Checkpoint Artifacts as Automatic Data Mixers ACL 2025 Agent-as-a-Judge: Evaluate Agents with Agents ICML 2025 Tumor Micro-environment Interactions Guided Graph Learning for Survival Analysis of Human Cancers from Whole-slide Pathological Images CVPR 2024 Target-Aware Language Modeling via Granular Data Sampling EMNLP 2024 Scaling Parameter-Constrained Language Models with Quality Data EMNLP 2024 MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases ICML 2024 LLM-QAT: Data-Free Quantization Aware Training for Large Language Models ACL 2024 Speech ReaLLM – Real-time Speech Recognition with Multimodal Language Models by Teaching the Flow of Time INTERSPEECH 2024 Binary and Ternary Natural Language Generation ACL 2023 Towards Zero-Shot Multilingual Transfer for Code-Switched Responses ACL 2023 Revisiting Sample Size Determination in Natural Language Understanding ACL 2023 Multi-Head State Space Model for Speech Recognition INTERSPEECH 2023 Biased Self-supervised Learning for ASR INTERSPEECH 2023 Streaming parallel transducer beam search with fast slow cascaded encoders INTERSPEECH 2022 Collaborative Training of Acoustic Encoders for Speech Recognition INTERSPEECH 2021 Flexi-Transducer: Optimizing Latency, Accuracy and Compute for Multi-Domain On-Device Scenarios INTERSPEECH 2021 Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion INTERSPEECH 2021 Transformer-Based Acoustic Modeling for Streaming Speech Synthesis INTERSPEECH 2021 Dynamic Encoder Transducer: A Flexible Solution for Trading Off Accuracy for Latency INTERSPEECH 2021 Dissecting User-Perceived Latency of On-Device E2E Speech Recognition INTERSPEECH 2021 Weak-Attention Suppression for Transformer Based Speech Recognition INTERSPEECH 2020 Streaming Transformer-Based Acoustic Models Using Self-Attention with Augmented Memory INTERSPEECH 2020 Deep LSTM based Feature Mapping for Query Classification NAACL 2016 Recurrent Support Vector Machines For Slot Tagging In Spoken Language Understanding NAACL 2016