Mao Yang

28 papers · 2020–2026 · 11 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (10) 🌍 Conference Polyglot (10)

🌍 Conference Polyglot (10) 🏃 Academic Marathon (5) 🐝 Cross-Pollinator (13) 🏆 Grand Slam 🤝 Dynamic Duo (17) 🔬 Deep Specialist (12) 🏆 Keyword Champion (3) 📈 Trend Setter 💎 Century Club (26) 🗃️ Keyword Collector (113) ⚡ Prolific Year (7) 🔥 Unstoppable (6)

Conferences

OSDI (9) ICML (3) NIPS (3) AAAI (2) ACL (2) EMNLP (2) ICCV (2) NSDI (2) ECCV (1) ICLR (1) INTERSPEECH (1)

Top co-authors

Fan Yang (17) Li Lyna Zhang (11) Lidong Zhou (8) Quanlu Zhang (7) Ting Cao (6) Yuqing Yang (6) Qi Chen (5) Jiahang Xu (5) Lingxiao Ma (5) Yujing Wang (5)

Keywords

neural architecture search (5) model compression (4) hardware acceleration (3) deep learning training (2) document retrieval (2) search space (2) deep neural network (2) semantic search (2) latency optimization (2) dnn compiler (2) inference optimization (2) hardware-aware optimization (2) text representation (1) information retrieval (1) transfer learning (1) prompt engineering (1) natural language inference (1) few-shot learning (1) post-training quantization (1) knowledge distillation (1)

Papers

Data Mixing Agent: Learning to Re-weight Domains for Continual Pre-training ACL 2026 Gold-Medal-Level Olympiad Geometry Solving with Efficient Heuristic Auxiliary Constructions ACL 2026 LongRoPE2: Near-Lossless LLM Context Window Scaling ICML 2025 PipeThreader: Software-Defined Pipelining for Efficient DNN Execution OSDI 2025 Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solver ICLR 2025 rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking ICML 2025 VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models EMNLP 2024 Fewer is More: Boosting Math Reasoning with Reinforced Context Pruning EMNLP 2024 LitePred: Transferable and Scalable Latency Prediction for Hardware-Aware Neural Architecture Search NSDI 2024 nnScaler: Constraint-Guided Parallelization Plan Generation for Deep Learning Training OSDI 2024 LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens ICML 2024 Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation OSDI 2024 IRGen: Generative Modeling for Image Retrieval ECCV 2024 Accurate and Structured Pruning for Efficient Automatic Speech Recognition INTERSPEECH 2023 Model-enhanced Vector Index NIPS 2023 SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference ICCV 2023 ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices ICCV 2023 On Modular Learning of Distributed Systems for Predicting End-to-End Latency NSDI 2023 VBASE: Unifying Online Vector Similarity Search and Relational Queries via Relaxed Monotonicity OSDI 2023 Cocktailer: Analyzing and Optimizing Dynamic Control Flow in Deep Learning OSDI 2023 ROLLER: Fast and Efficient Tensor Compilation for Deep Learning OSDI 2022 A Neural Corpus Indexer for Document Retrieval NIPS 2022 SparTA: Deep-Learning Model Sparsity via Tensor-with-Sparsity-Attribute OSDI 2022 SPANN: Highly-efficient Billion-scale Approximate Nearest Neighborhood Search NIPS 2021 OpEvo: An Evolutionary Method for Tensor Operator Optimization AAAI 2021 HiveD: Sharing a GPU Cluster for Deep Learning with Guarantees OSDI 2020 TextNAS: A Neural Architecture Search Space Tailored for Text Representation AAAI 2020 Retiarii: A Deep Learning Exploratory-Training Framework OSDI 2020