Hongxu Yin

36 papers · 2019–2025 · 8 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🌍 Conference Polyglot (8) 🏃 Academic Marathon (6) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (6)

🐝 Cross-Pollinator (6) 🌈 Renaissance Researcher (8) 🗺️ Taxonomy Completionist (50) 👥 Mega-Team (25) 🤝 Dynamic Duo (28) 👑 Triple Crown 🧬 Topic Evolution 💎 Century Club (36) 🗃️ Keyword Collector (127) 🔥 Unstoppable (7) ❓ The Questioner ⚡ Prolific Year (10)

Conferences

CVPR (16) ICLR (6) ICML (5) NIPS (3) ECCV (2) WACV (2) ICCV (1) RSS (1)

Top co-authors

Pavlo Molchanov (28) Jan Kautz (22) Jose M. Alvarez (9) Yao Lu (6) Maying Shen (6) Greg Heinrich (6) Sifei Liu (6) Arash Vahdat (5) Song Han (5) De-An Huang (3)

Research topics

Models (1) Privacy (1)

Keywords

model compression (9) knowledge distillation (5) vision-language model (4) vision transformer (4) vision language model (3) network pruning (3) large language model (3) structural pruning (3) image classification (2) instruction tuning (2) data-free learning (2) continual learning (2) neural network optimization (2) transfer learning (2) contrastive learning (2) image reconstruction (2) object detection (2) image synthesis (2) privacy attack (2) neural network architecture (2)

Papers

Advancing Weight and Channel Sparsification with Enhanced Saliency WACV 2025 RADIOv2.5: Improved Baselines for Agglomerative Vision Foundation Models CVPR 2025 NVILA: Efficient Frontier Visual Language Models CVPR 2025 Scaling Vision Pre-Training to 4K Resolution CVPR 2025 VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge CVPR 2025 Token-Efficient VLM: High-Resolution Image Understanding via Dynamic Region Proposal ICCV 2025 VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation ICLR 2025 LongVILA: Scaling Long-Context Visual Language Models for Long Videos ICLR 2025 Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders ICLR 2025 LLaMaFlex: Many-in-one LLMs via Generalized Pruning and Weight Sharing ICLR 2025 NaVILA: Legged Robot Vision-Language-Action Model for Navigation RSS 2025 Flextron: Many-in-One Flexible Large Language Model ICML 2024 Adaptive Sharpness-Aware Pruning for Robust Sparse Networks ICLR 2024 FasterViT: Fast Vision Transformers with Hierarchical Attention ICLR 2024 DoRA: Weight-Decomposed Low-Rank Adaptation ICML 2024 SpatialRGPT: Grounded Spatial Reasoning in Vision-Language Models NIPS 2024 MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models NIPS 2024 RegionGPT: Towards Region Understanding Vision Language Model CVPR 2024 VILA: On Pre-training for Visual Language Models CVPR 2024 FedBPT: Efficient Federated Black-box Prompt Tuning for Large Language Models ICML 2024 LITA: Language Instructed Temporal-Localization Assistant ECCV 2024 Global Vision Transformer Pruning With Hessian-Aware Saliency CVPR 2023 Heterogeneous Continual Learning CVPR 2023 Global Context Vision Transformers ICML 2023 Loss-Guided Diffusion Models for Plug-and-Play Controllable Generation ICML 2023 Recurrence Without Recurrence: Stable Video Landmark Detection With Deep Equilibrium Models CVPR 2023 LANA: Latency Aware Network Acceleration ECCV 2022 GradViT: Gradient Inversion of Vision Transformers CVPR 2022 When To Prune? A Policy Towards Early Structural Pruning CVPR 2022 A-ViT: Adaptive Tokens for Efficient Vision Transformer CVPR 2022 Structural Pruning via Latency-Saliency Knapsack NIPS 2022 See Through Gradients: Image Batch Recovery via GradInversion CVPR 2021 Optimal Quantization Using Scaled Codebook CVPR 2021 Data-Free Knowledge Distillation for Object Detection WACV 2021 Dreaming to Distill: Data-Free Knowledge Transfer via DeepInversion CVPR 2020 ChamNet: Towards Efficient Network Design Through Platform-Aware Model Adaptation CVPR 2019