Yuxiang Huang
4 papers · 2024–2026 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (2) π Cross-Pollinator (12) π Renaissance Researcher (5)
πΊοΈ
Taxonomy Completionist
(11)
Conferences
ACL (3)
EMNLP (1)
Top co-authors
Keywords
inference acceleration
(3)
approximate attention
(2)
large language model
(2)
sequence parallelism
(2)
draft verification
(2)
large multimodal model
(1)
flash attention
(1)
speculative decoding
(1)
draft model
(1)
speculative sampling
(1)
key-value cache
(1)
load balancing
(1)
parallel decoding
(1)
long-context inference
(1)
context block
(1)
text generation
(1)
long-video understanding
(1)
distributed inference
(1)
parallel processing
(1)
vocabulary compression
(1)
Papers
APB-V: Accelerating Long-Video Understanding via Sequence-Parallelism-aware Approximate Attention
ACL 2026
FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling
ACL 2025
APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs
ACL 2025
Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster Speculative Decoding
EMNLP 2024