Hongxu Yin
36 papers · 2019–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π Conference Polyglot (8) π Academic Marathon (6) π§ Keyword Pioneer π Interdisciplinary Bridge π Cross-Pollinator (6)
π
Cross-Pollinator
(6)
π
Renaissance Researcher
(8)
πΊοΈ
Taxonomy Completionist
(50)
π₯
Mega-Team
(25)
π€
Dynamic Duo
(28)
π
Triple Crown
π§¬
Topic Evolution
π
Century Club
(36)
ποΈ
Keyword Collector
(127)
π₯
Unstoppable
(7)
β
The Questioner
β‘
Prolific Year
(10)
Conferences
CVPR (16)
ICLR (6)
ICML (5)
NIPS (3)
ECCV (2)
WACV (2)
ICCV (1)
RSS (1)
Top co-authors
Research topics
Keywords
model compression
(9)
knowledge distillation
(5)
vision-language model
(4)
vision transformer
(4)
vision language model
(3)
network pruning
(3)
large language model
(3)
structural pruning
(3)
image classification
(2)
instruction tuning
(2)
data-free learning
(2)
continual learning
(2)
neural network optimization
(2)
transfer learning
(2)
contrastive learning
(2)
image reconstruction
(2)
object detection
(2)
image synthesis
(2)
privacy attack
(2)
neural network architecture
(2)
Papers
Advancing Weight and Channel Sparsification with Enhanced Saliency
WACV 2025
RADIOv2.5: Improved Baselines for Agglomerative Vision Foundation Models
CVPR 2025
NVILA: Efficient Frontier Visual Language Models
CVPR 2025
Scaling Vision Pre-Training to 4K Resolution
CVPR 2025
VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge
CVPR 2025
Token-Efficient VLM: High-Resolution Image Understanding via Dynamic Region Proposal
ICCV 2025
VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation
ICLR 2025
LongVILA: Scaling Long-Context Visual Language Models for Long Videos
ICLR 2025
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders
ICLR 2025
LLaMaFlex: Many-in-one LLMs via Generalized Pruning and Weight Sharing
ICLR 2025
NaVILA: Legged Robot Vision-Language-Action Model for Navigation
RSS 2025
Flextron: Many-in-One Flexible Large Language Model
ICML 2024
Adaptive Sharpness-Aware Pruning for Robust Sparse Networks
ICLR 2024
FasterViT: Fast Vision Transformers with Hierarchical Attention
ICLR 2024
DoRA: Weight-Decomposed Low-Rank Adaptation
ICML 2024
SpatialRGPT: Grounded Spatial Reasoning in Vision-Language Models
NIPS 2024
MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models
NIPS 2024
RegionGPT: Towards Region Understanding Vision Language Model
CVPR 2024
VILA: On Pre-training for Visual Language Models
CVPR 2024
FedBPT: Efficient Federated Black-box Prompt Tuning for Large Language Models
ICML 2024
LITA: Language Instructed Temporal-Localization Assistant
ECCV 2024
Global Vision Transformer Pruning With Hessian-Aware Saliency
CVPR 2023
Heterogeneous Continual Learning
CVPR 2023
Global Context Vision Transformers
ICML 2023
Loss-Guided Diffusion Models for Plug-and-Play Controllable Generation
ICML 2023
Recurrence Without Recurrence: Stable Video Landmark Detection With Deep Equilibrium Models
CVPR 2023
LANA: Latency Aware Network Acceleration
ECCV 2022
GradViT: Gradient Inversion of Vision Transformers
CVPR 2022
When To Prune? A Policy Towards Early Structural Pruning
CVPR 2022
A-ViT: Adaptive Tokens for Efficient Vision Transformer
CVPR 2022
Structural Pruning via Latency-Saliency Knapsack
NIPS 2022
See Through Gradients: Image Batch Recovery via GradInversion
CVPR 2021
Optimal Quantization Using Scaled Codebook
CVPR 2021
Data-Free Knowledge Distillation for Object Detection
WACV 2021
Dreaming to Distill: Data-Free Knowledge Transfer via DeepInversion
CVPR 2020
ChamNet: Towards Efficient Network Design Through Platform-Aware Model Adaptation
CVPR 2019