Yanzhi Wang
81 papers · 2017–2025 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
🐣 Hot Topic Early Bird 🌍 Conference Polyglot (10) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (8)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🤝
Dynamic Duo
(28)
👑
Triple Crown
🏆
Grand Slam
🔬
Deep Specialist
(38)
🏆
Keyword Champion
(4)
🚀
Conference Pioneer
⚡
Prolific Year
(14)
🗃️
Keyword Collector
(239)
📈
Trend Setter
❓
The Questioner
(4)
💎
Century Club
(81)
🔥
Unstoppable
(9)
Conferences
AAAI (13)
NIPS (13)
CVPR (12)
IJCAI (11)
ECCV (9)
ICLR (7)
ICML (7)
ICCV (5)
EMNLP (2)
WACV (2)
Top co-authors
Keywords
model compression
(31)
neural network optimization
(12)
neural network pruning
(10)
vision transformer
(9)
mobile inference
(9)
neural architecture search
(8)
edge computing
(8)
inference acceleration
(6)
efficient computing
(6)
structured pruning
(6)
real-time inference
(6)
diffusion model
(5)
deep neural network
(5)
sparse training
(5)
weight pruning
(5)
lottery ticket hypothesis
(5)
model pruning
(4)
image classification
(4)
object detection
(4)
adversarial robustness
(4)
Papers
Numerical Pruning for Efficient Autoregressive Models
AAAI 2025
QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge
CVPR 2025
SnapGen-V: Generating a Five-Second Video within Five Seconds on a Mobile Device
CVPR 2025
Taming Diffusion for Dataset Distillation with High Representativeness
ICML 2025
Q-TempFusion: Quantization-Aware Temporal Multi-Sensor Fusion on Bird's-Eye View Representation
WACV 2025
Can Adversarial Examples Be Parsed to Reveal Victim Model Information?
WACV 2025
FairSMOE: Mitigating Multi-Attribute Fairness Problem with Sparse Mixture-of-Experts
IJCAI 2025
Sparse Learning for State Space Models on Mobile
ICLR 2025
Toward Adaptive Large Language Models Structured Pruning via Hybrid-grained Weight Importance Assessment
AAAI 2025
LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers
AAAI 2025
Fast and Memory-Efficient Video Diffusion Using Streamlined Inference
NIPS 2024
Exploring Token Pruning in Vision State Space Models
NIPS 2024
Search for Efficient Large Language Models
NIPS 2024
E$^2$GAN: Efficient Training of Efficient GANs for Image-to-Image Translation
ICML 2024
Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge
AAAI 2024
Waxing-and-Waning: a Generic Similarity-based Framework for Efficient Self-Supervised Learning
ICLR 2024
Pruning Foundation Models for High Accuracy without Retraining
EMNLP 2024
Rethinking Token Reduction for State Space Models
EMNLP 2024
SNED: Superposition Network Architecture Search for Efficient Video Diffusion Model
CVPR 2024
TextCraftor: Your Text Encoder Can be Image Quality Controller
CVPR 2024
Digital Avatars: Framework Development and Their Evaluation
IJCAI 2024
InstructGIE: Towards Generalizable Image Editing
ECCV 2024
DiffClass: Diffusion-Based Class Incremental Learning
ECCV 2024
Efficient Training with Denoised Neural Weights
ECCV 2024
FasterVD: On Acceleration of Video Diffusion Models
IJCAI 2024
Towards Real-Time Segmentation on the Edge
AAAI 2023
HotBEV: Hardware-oriented Transformer-based Multi-View 3D Detector for BEV Perception
NIPS 2023
PackQViT: Faster Sub-8-bit Vision Transformers via Full and Packed Quantization on the Mobile
NIPS 2023
SnapFusion: Text-to-Image Diffusion Model on Mobile Devices within Two Seconds
NIPS 2023
Peeling the Onion: Hierarchical Reduction of Data Redundancy for Efficient Vision Transformer Training
AAAI 2023
You Need Multiple Exiting: Dynamic Early Exiting for Accelerating Unified Vision Language Model
CVPR 2023
DeepMAD: Mathematical Architecture Design for Deep Convolutional Neural Network
CVPR 2023
Pruning Parameterization With Bi-Level Optimization for Efficient Semantic Segmentation on the Edge
CVPR 2023
Rethinking Vision Transformers for MobileNet Size and Speed
ICCV 2023
Self-Ensemble Protection: Training Checkpoints Are Good Data Protectors
ICLR 2023
SmartFRZ: An Efficient Training Framework using Attention-Based Layer Freezing
ICLR 2023
SpeedDETR: Speed-aware Transformers for End-to-end Object Detection
ICML 2023
DualHSIC: HSIC-Bottleneck and Alignment for Continual Learning
ICML 2023
Data Level Lottery Ticket Hypothesis for Vision Transformers
IJCAI 2023
SparCL: Sparse Continual Learning on the Edge
NIPS 2022
Real-Time Portrait Stylization on the Edge
IJCAI 2022
Effective Model Sparsification by Scheduled Grow-and-Prune Methods
ICLR 2022
F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization
ICLR 2022
Layer Freezing & Data Sieving: Missing Pieces of a Generic Framework for Sparse Training
NIPS 2022
Advancing Model Pruning via Bi-level Optimization
NIPS 2022
You Already Have It: A Generator-Free Low-Precision DNN Training Framework Using Stochastic Rounding
ECCV 2022
Compiler-Aware Neural Architecture Search for On-Mobile Real-Time Super-Resolution
ECCV 2022
Pruning-as-Search: Efficient Neural Architecture Search via Channel Pruning and Structural Reparameterization
IJCAI 2022
SPViT: Enabling Faster Vision Transformers via Latency-Aware Soft Token Pruning
ECCV 2022
Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets
ICML 2022
EfficientFormer: Vision Transformers at MobileNet Speed
NIPS 2022
MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge
NIPS 2021
Lottery Ticket Preserves Weight Correlation: Is It Desirable or Not?
ICML 2021
Improving Neural Network Efficiency via Post-Training Quantization With Adaptive Floating-Point
ICCV 2021
Towards Fast and Accurate Multi-Person Pose Estimation on Mobile Devices
IJCAI 2021
Teachers Do More Than Teach: Compressing Image-to-Image Models
CVPR 2021
A Compression-Compilation Framework for On-mobile Real-time BERT Applications
IJCAI 2021
Sanity Checks for Lottery Tickets: Does Your Winning Ticket Really Win the Jackpot?
NIPS 2021
A Compression-Compilation Co-Design Framework Towards Real-Time Object Detection on Mobile Devices
AAAI 2021
RT3D: Achieving Real-Time Execution of 3D Convolutional Neural Networks on Mobile Devices
AAAI 2021
YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design
AAAI 2021
NPAS: A Compiler-Aware Framework of Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration
CVPR 2021
Achieving On-Mobile Real-Time Super-Resolution With Neural Architecture and Pruning Search
ICCV 2021
RMSMP: A Novel Deep Neural Network Quantization Framework With Row-Wise Mixed Schemes and Multiple Precisions
ICCV 2021
ScaleCert: Scalable Certified Defense against Adversarial Patches with Sparse Superficial Layers
NIPS 2021
Towards Real-Time DNN Inference on Mobile Platforms with Model Pruning and Compiler Optimization
IJCAI 2020
An Image Enhancing Pattern-based Sparsity for Real-time Inference on Mobile Devices
ECCV 2020
Embedding Compression with Isotropic Iterative Quantization
AAAI 2020
PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for Real-Time Execution on Mobile Devices
AAAI 2020
AutoCompress: An Automatic DNN Structured Pruning Framework for Ultra-High Compression Rates
AAAI 2020
Adversarial T-shirt! Evading Person Detectors in A Physical World
ECCV 2020
Protecting Neural Networks with Hierarchical Random Switching: Towards Better Robustness-Accuracy Trade-off for Stochastic Defenses
IJCAI 2019
Interpreting and Evaluating Neural Network Robustness
IJCAI 2019
Universal Approximation Property and Equivalence of Stochastic Computing-Based Neural Networks and Binary Neural Networks
AAAI 2019
Multi-Channel Attention Selection GAN With Cascaded Semantic Guidance for Cross-View Image Translation
CVPR 2019
Feature Distillation: DNN-Oriented JPEG Compression Against Adversarial Examples
CVPR 2019
Structured Adversarial Attack: Towards General Implementation and Better Interpretability
ICLR 2019
Adversarial Robustness vs. Model Compression, or Both?
ICCV 2019
Machine Vision Guided 3D Medical Image Compression for Efficient Transmission and Accurate Segmentation in the Clouds
CVPR 2019
A Systematic DNN Weight Pruning Framework using Alternating Direction Method of Multipliers
ECCV 2018
Theoretical Properties for Neural Networks with Weight Matrices of Low Displacement Rank
ICML 2017