Houwen Peng
29 papers · 2013–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π Academic Marathon (12) π Conference Polyglot (8) π Interdisciplinary Bridge π§ Keyword Pioneer π Cross-Pollinator (13)
π
Cross-Pollinator
(13)
πΊοΈ
Taxonomy Completionist
(53)
π€
Dynamic Duo
(11)
β‘
Prolific Year
(7)
π
Conference Pioneer
π₯
Unstoppable
(7)
π
Century Club
(27)
ποΈ
Keyword Collector
(114)
Conferences
CVPR (8)
ICCV (7)
NIPS (5)
ECCV (3)
AAAI (2)
ACL (2)
EMNLP (1)
ICML (1)
Top co-authors
Keywords
vision transformer
(6)
object tracking
(4)
neural architecture search
(4)
knowledge distillation
(3)
visual tracking
(3)
one-shot learning
(3)
multimodal learning
(3)
model compression
(3)
zero-shot learning
(3)
neural network
(2)
contrastive learning
(2)
parameter efficiency
(2)
weight sharing
(2)
siamese network
(2)
image classification
(2)
attention mechanism
(2)
transformer architecture
(2)
multimodal large language model
(2)
transductive learning
(1)
video segmentation
(1)
Papers
HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models
AAAI 2026
Beyond Ranking: Fine-Grained Diagnostics and Self-Improvement for MLLMs
ACL 2026
Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning
ACL 2025
RBench: Graduate-level Multi-disciplinary Benchmarks for LLM & MLLM Complex Reasoning Evaluation
ICML 2025
ScalingFilter: Assessing Data Quality through Inverse Utilization of Scaling Laws
EMNLP 2024
iCLIP: Bridging Image Classification and Contrastive Language-Image Pre-Training for Visual Recognition
CVPR 2023
ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation
NIPS 2023
SeqTrack: Sequence to Sequence Learning for Visual Object Tracking
CVPR 2023
EfficientViT: Memory Efficient Vision Transformer With Cascaded Group Attention
CVPR 2023
TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance
ICCV 2023
Exploring Lightweight Hierarchical Vision Transformers for Efficient Visual Tracking
ICCV 2023
Attentive Mask CLIP
ICCV 2023
Expanding Language-Image Pretrained Models for General Video Recognition
ECCV 2022
TinyViT: Fast Pretraining Distillation for Small Vision Transformers
ECCV 2022
MiniViT: Compressing Vision Transformers With Weight Multiplexing
CVPR 2022
PointNeXt: Revisiting PointNet++ with Improved Training and Scaling Strategies
NIPS 2022
Rethinking and Improving Relative Position Encoding for Vision Transformer
ICCV 2021
Probing Inter-modality: Visual Parsing with Self-Attention for Vision-and-Language Pre-training
NIPS 2021
Learning To Track Objects From Unlabeled Videos
ICCV 2021
Searching the Search Space of Vision Transformer
NIPS 2021
Learning Spatio-Temporal Transformer for Visual Tracking
ICCV 2021
AutoFormer: Searching Transformers for Visual Recognition
ICCV 2021
LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search
CVPR 2021
Ocean: Object-aware Anchor-free Tracking
ECCV 2020
Learning 2D Temporal Adjacent Networks for Moment Localization with Natural Language
AAAI 2020
A Transductive Approach for Video Object Segmentation
CVPR 2020
Cream of the Crop: Distilling Prioritized Paths For One-Shot Neural Architecture Search
NIPS 2020
Deeper and Wider Siamese Networks for Real-Time Visual Tracking
CVPR 2019
Illumination Estimation Based on Bilayer Sparse Coding
CVPR 2013