Yanzhi Wang

81 papers · 2017–2025 · 10 conferences · across top CS/AI conferences

Achievements

+15 more ↓

🐣 Hot Topic Early Bird 🌍 Conference Polyglot (10) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (8)

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🤝 Dynamic Duo (28) 👑 Triple Crown 🏆 Grand Slam 🔬 Deep Specialist (38) 🏆 Keyword Champion (4) 🚀 Conference Pioneer ⚡ Prolific Year (14) 🗃️ Keyword Collector (239) 📈 Trend Setter ❓ The Questioner (4) 💎 Century Club (81) 🔥 Unstoppable (9)

Conferences

AAAI (13) NIPS (13) CVPR (12) IJCAI (11) ECCV (9) ICLR (7) ICML (7) ICCV (5) EMNLP (2) WACV (2)

Top co-authors

Geng Yuan (28) Xue Lin (26) Pu Zhao (26) Wei Niu (23) Zhenglun Kong (21) Yanyu Li (20) Xuan Shen (20) Zheng Zhan (17) Yifan Gong (17) Bin Ren (17)

Keywords

model compression (31) neural network optimization (12) neural network pruning (10) vision transformer (9) mobile inference (9) neural architecture search (8) edge computing (8) inference acceleration (6) efficient computing (6) structured pruning (6) real-time inference (6) diffusion model (5) deep neural network (5) sparse training (5) weight pruning (5) lottery ticket hypothesis (5) model pruning (4) image classification (4) object detection (4) adversarial robustness (4)

Papers

Numerical Pruning for Efficient Autoregressive Models AAAI 2025 QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge CVPR 2025 SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device CVPR 2025 Taming Diffusion for Dataset Distillation with High Representativeness ICML 2025 Q-TempFusion: Quantization-Aware Temporal Multi-Sensor Fusion on Bird's-Eye View Representation WACV 2025 Can Adversarial Examples Be Parsed to Reveal Victim Model Information? WACV 2025 FairSMOE: Mitigating Multi-Attribute Fairness Problem with Sparse Mixture-of-Experts IJCAI 2025 Sparse Learning for State Space Models on Mobile ICLR 2025 Toward Adaptive Large Language Models Structured Pruning via Hybrid-grained Weight Importance Assessment AAAI 2025 LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers AAAI 2025 Fast and Memory-Efficient Video Diffusion Using Streamlined Inference NIPS 2024 Exploring Token Pruning in Vision State Space Models NIPS 2024 Search for Efficient Large Language Models NIPS 2024 E$^2$GAN: Efficient Training of Efficient GANs for Image-to-Image Translation ICML 2024 Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge AAAI 2024 Waxing-and-Waning: a Generic Similarity-based Framework for Efficient Self-Supervised Learning ICLR 2024 Pruning Foundation Models for High Accuracy without Retraining EMNLP 2024 Rethinking Token Reduction for State Space Models EMNLP 2024 SNED: Superposition Network Architecture Search for Efficient Video Diffusion Model CVPR 2024 TextCraftor: Your Text Encoder Can be Image Quality Controller CVPR 2024 Digital Avatars: Framework Development and Their Evaluation IJCAI 2024 InstructGIE: Towards Generalizable Image Editing ECCV 2024 DiffClass: Diffusion-Based Class Incremental Learning ECCV 2024 Efficient Training with Denoised Neural Weights ECCV 2024 FasterVD: On Acceleration of Video Diffusion Models IJCAI 2024 Towards Real-Time Segmentation on the Edge AAAI 2023 HotBEV: Hardware-oriented Transformer-based Multi-View 3D Detector for BEV Perception NIPS 2023 PackQViT: Faster Sub-8-bit Vision Transformers via Full and Packed Quantization on the Mobile NIPS 2023 SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds NIPS 2023 Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training AAAI 2023 You Need Multiple Exiting: Dynamic Early Exiting for Accelerating Unified Vision Language Model CVPR 2023 DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network CVPR 2023 Pruning Parameterization With Bi-Level Optimization for Efficient Semantic Segmentation on the Edge CVPR 2023 Rethinking Vision Transformers for MobileNet Size and Speed ICCV 2023 Self-Ensemble Protection: Training Checkpoints Are Good Data Protectors ICLR 2023 SmartFRZ: An Efficient Training Framework using Attention-Based Layer Freezing ICLR 2023 SpeedDETR: Speed-aware Transformers for End-to-end Object Detection ICML 2023 DualHSIC: HSIC-Bottleneck and Alignment for Continual Learning ICML 2023 Data Level Lottery Ticket Hypothesis for Vision Transformers IJCAI 2023 SparCL: Sparse Continual Learning on the Edge NIPS 2022 Real-Time Portrait Stylization on the Edge IJCAI 2022 Effective Model Sparsification by Scheduled Grow-and-Prune Methods ICLR 2022 F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization ICLR 2022 Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training NIPS 2022 Advancing Model Pruning via Bi-level Optimization NIPS 2022 You Already Have It: A Generator-Free Low-Precision DNN Training Framework Using Stochastic Rounding ECCV 2022 Compiler-Aware Neural Architecture Search for On-Mobile Real-Time Super-Resolution ECCV 2022 Pruning-as-Search: Efficient Neural Architecture Search via Channel Pruning and Structural Reparameterization IJCAI 2022 SPViT: Enabling Faster Vision Transformers via Latency-Aware Soft Token Pruning ECCV 2022 Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets ICML 2022 EfficientFormer: Vision Transformers at MobileNet Speed NIPS 2022 MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge NIPS 2021 Lottery Ticket Preserves Weight Correlation: Is It Desirable or Not? ICML 2021 Improving Neural Network Efficiency via Post-Training Quantization With Adaptive Floating-Point ICCV 2021 Towards Fast and Accurate Multi-Person Pose Estimation on Mobile Devices IJCAI 2021 Teachers Do More Than Teach: Compressing Image-to-Image Models CVPR 2021 A Compression-Compilation Framework for On-mobile Real-time BERT Applications IJCAI 2021 Sanity Checks for Lottery Tickets: Does Your Winning Ticket Really Win the Jackpot? NIPS 2021 A Compression-Compilation Co-Design Framework Towards Real-Time Object Detection on Mobile Devices AAAI 2021 RT3D: Achieving Real-Time Execution of 3D Convolutional Neural Networks on Mobile Devices AAAI 2021 YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design AAAI 2021 NPAS: A Compiler-Aware Framework of Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration CVPR 2021 Achieving On-Mobile Real-Time Super-Resolution With Neural Architecture and Pruning Search ICCV 2021 RMSMP: A Novel Deep Neural Network Quantization Framework With Row-Wise Mixed Schemes and Multiple Precisions ICCV 2021 ScaleCert: Scalable Certified Defense against Adversarial Patches with Sparse Superficial Layers NIPS 2021 Towards Real-Time DNN Inference on Mobile Platforms with Model Pruning and Compiler Optimization IJCAI 2020 An Image Enhancing Pattern-based Sparsity for Real-time Inference on Mobile Devices ECCV 2020 Embedding Compression with Isotropic Iterative Quantization AAAI 2020 PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for Real-Time Execution on Mobile Devices AAAI 2020 AutoCompress: An Automatic DNN Structured Pruning Framework for Ultra-High Compression Rates AAAI 2020 Adversarial T-shirt! Evading Person Detectors in A Physical World ECCV 2020 Protecting Neural Networks with Hierarchical Random Switching: Towards Better Robustness-Accuracy Trade-off for Stochastic Defenses IJCAI 2019 Interpreting and Evaluating Neural Network Robustness IJCAI 2019 Universal Approximation Property and Equivalence of Stochastic Computing-Based Neural Networks and Binary Neural Networks AAAI 2019 Multi-Channel Attention Selection GAN With Cascaded Semantic Guidance for Cross-View Image Translation CVPR 2019 Feature Distillation: DNN-Oriented JPEG Compression Against Adversarial Examples CVPR 2019 Structured Adversarial Attack: Towards General Implementation and Better Interpretability ICLR 2019 Adversarial Robustness vs. Model Compression, or Both? ICCV 2019 Machine Vision Guided 3D Medical Image Compression for Efficient Transmission and Accurate Segmentation in the Clouds CVPR 2019 A Systematic DNN Weight Pruning Framework using Alternating Direction Method of Multipliers ECCV 2018 Theoretical Properties for Neural Networks with Weight Matrices of Low Displacement Rank ICML 2017