Yangyang Shi
27 papers · 2016–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
🏃 Academic Marathon (9) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (6) 🐝 Cross-Pollinator (13)
🌍
Conference Polyglot
(6)
🏃
Academic Marathon
(9)
🌈
Renaissance Researcher
(7)
🤝
Dynamic Duo
(12)
🔬
Deep Specialist
(10)
🧬
Topic Evolution
⚡
Prolific Year
(6)
💎
Century Club
(26)
🔥
Unstoppable
(6)
🗃️
Keyword Collector
(124)
Conferences
INTERSPEECH (12)
ACL (5)
NAACL (4)
EMNLP (2)
ICML (2)
AAAI (1)
CVPR (1)
Top co-authors
Keywords
automatic speech recognition
(6)
word error rate
(4)
speech recognition
(3)
on-device speech recognition
(3)
language model
(3)
knowledge distillation
(2)
attention mechanism
(2)
latency optimization
(2)
acoustic modeling
(2)
acoustic model
(2)
streaming speech recognition
(2)
speech synthesis
(1)
graph learning
(1)
machine translation
(1)
domain adaptation
(1)
model quantization
(1)
knowledge transfer
(1)
data annotation
(1)
zero-shot learning
(1)
text generation
(1)
Papers
OmniEvent: Unified Event Representation Learning
AAAI 2026
Breaking Down Power Barriers in On-Device Streaming ASR: Insights and Solutions
NAACL 2025
Self-Vocabularizing Training for Neural Machine Translation
NAACL 2025
AutoMixer: Checkpoint Artifacts as Automatic Data Mixers
ACL 2025
Agent-as-a-Judge: Evaluate Agents with Agents
ICML 2025
Tumor Micro-environment Interactions Guided Graph Learning for Survival Analysis of Human Cancers from Whole-slide Pathological Images
CVPR 2024
Target-Aware Language Modeling via Granular Data Sampling
EMNLP 2024
Scaling Parameter-Constrained Language Models with Quality Data
EMNLP 2024
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases
ICML 2024
LLM-QAT: Data-Free Quantization Aware Training for Large Language Models
ACL 2024
Speech ReaLLM – Real-time Speech Recognition with Multimodal Language Models by Teaching the Flow of Time
INTERSPEECH 2024
Binary and Ternary Natural Language Generation
ACL 2023
Towards Zero-Shot Multilingual Transfer for Code-Switched Responses
ACL 2023
Revisiting Sample Size Determination in Natural Language Understanding
ACL 2023
Multi-Head State Space Model for Speech Recognition
INTERSPEECH 2023
Biased Self-supervised Learning for ASR
INTERSPEECH 2023
Streaming parallel transducer beam search with fast slow cascaded encoders
INTERSPEECH 2022
Collaborative Training of Acoustic Encoders for Speech Recognition
INTERSPEECH 2021
Flexi-Transducer: Optimizing Latency, Accuracy and Compute for Multi-Domain On-Device Scenarios
INTERSPEECH 2021
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion
INTERSPEECH 2021
Transformer-Based Acoustic Modeling for Streaming Speech Synthesis
INTERSPEECH 2021
Dynamic Encoder Transducer: A Flexible Solution for Trading Off Accuracy for Latency
INTERSPEECH 2021
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition
INTERSPEECH 2021
Weak-Attention Suppression for Transformer Based Speech Recognition
INTERSPEECH 2020
Streaming Transformer-Based Acoustic Models Using Self-Attention with Augmented Memory
INTERSPEECH 2020
Deep LSTM based Feature Mapping for Query Classification
NAACL 2016
Recurrent Support Vector Machines For Slot Tagging In Spoken Language Understanding
NAACL 2016