Mao Yang
28 papers · 2020–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π§ Keyword Pioneer π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (10) π Conference Polyglot (10)
π
Conference Polyglot
(10)
π
Academic Marathon
(5)
π
Cross-Pollinator
(13)
π
Grand Slam
π€
Dynamic Duo
(17)
π¬
Deep Specialist
(12)
π
Keyword Champion
(3)
π
Trend Setter
π
Century Club
(26)
ποΈ
Keyword Collector
(113)
β‘
Prolific Year
(7)
π₯
Unstoppable
(6)
Conferences
OSDI (9)
ICML (3)
NIPS (3)
AAAI (2)
ACL (2)
EMNLP (2)
ICCV (2)
NSDI (2)
ECCV (1)
ICLR (1)
INTERSPEECH (1)
Top co-authors
Keywords
neural architecture search
(5)
model compression
(4)
hardware acceleration
(3)
deep learning training
(2)
document retrieval
(2)
search space
(2)
deep neural network
(2)
semantic search
(2)
latency optimization
(2)
dnn compiler
(2)
inference optimization
(2)
hardware-aware optimization
(2)
text representation
(1)
information retrieval
(1)
transfer learning
(1)
prompt engineering
(1)
natural language inference
(1)
few-shot learning
(1)
post-training quantization
(1)
knowledge distillation
(1)
Papers
Data Mixing Agent: Learning to Re-weight Domains for Continual Pre-training
ACL 2026
Gold-Medal-Level Olympiad Geometry Solving with Efficient Heuristic Auxiliary Constructions
ACL 2026
LongRoPE2: Near-Lossless LLM Context Window Scaling
ICML 2025
PipeThreader: Software-Defined Pipelining for Efficient DNN Execution
OSDI 2025
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solver
ICLR 2025
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
ICML 2025
VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models
EMNLP 2024
Fewer is More: Boosting Math Reasoning with Reinforced Context Pruning
EMNLP 2024
LitePred: Transferable and Scalable Latency Prediction for Hardware-Aware Neural Architecture Search
NSDI 2024
nnScaler: Constraint-Guided Parallelization Plan Generation for Deep Learning Training
OSDI 2024
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens
ICML 2024
Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation
OSDI 2024
IRGen: Generative Modeling for Image Retrieval
ECCV 2024
Accurate and Structured Pruning for Efficient Automatic Speech Recognition
INTERSPEECH 2023
Model-enhanced Vector Index
NIPS 2023
SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference
ICCV 2023
ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices
ICCV 2023
On Modular Learning of Distributed Systems for Predicting End-to-End Latency
NSDI 2023
VBASE: Unifying Online Vector Similarity Search and Relational Queries via Relaxed Monotonicity
OSDI 2023
Cocktailer: Analyzing and Optimizing Dynamic Control Flow in Deep Learning
OSDI 2023
ROLLER: Fast and Efficient Tensor Compilation for Deep Learning
OSDI 2022
A Neural Corpus Indexer for Document Retrieval
NIPS 2022
SparTA: Deep-Learning Model Sparsity via Tensor-with-Sparsity-Attribute
OSDI 2022
SPANN: Highly-efficient Billion-scale Approximate Nearest Neighborhood Search
NIPS 2021
OpEvo: An Evolutionary Method for Tensor Operator Optimization
AAAI 2021
HiveD: Sharing a GPU Cluster for Deep Learning with Guarantees
OSDI 2020
TextNAS: A Neural Architecture Search Space Tailored for Text Representation
AAAI 2020
Retiarii: A Deep Learning Exploratory-Training Framework
OSDI 2020