Weiming Hu
74 papers · 2013–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π Conference Polyglot (11) π£ Hot Topic Early Bird π§ Keyword Pioneer π Interdisciplinary Bridge π Academic Marathon (12)
π
Academic Marathon
(12)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Conference Loyalist
(27)
π€
Dynamic Duo
(39)
π
Grand Slam
π¬
Deep Specialist
(14)
π§¬
Topic Evolution
π₯
Unstoppable
(13)
π
Conference Pioneer
β‘
Prolific Year
(13)
β
The Questioner
ποΈ
Keyword Collector
(308)
π
Century Club
(71)
π
Trend Setter
Conferences
CVPR (27)
ICCV (11)
AAAI (9)
ECCV (9)
IJCAI (5)
NIPS (4)
EMNLP (3)
COLING (2)
ICLR (2)
ACL (1)
ICML (1)
Top co-authors
Keywords
object detection
(9)
visual tracking
(6)
object tracking
(6)
action recognition
(5)
knowledge distillation
(5)
video captioning
(4)
image restoration
(4)
siamese network
(4)
self-supervised learning
(3)
representation learning
(3)
graph neural network
(3)
retrieval-augmented generation
(3)
multimodal large language model
(3)
multi-task learning
(3)
power iteration
(3)
neural architecture search
(3)
contrastive learning
(3)
feature learning
(3)
semi-supervised learning
(2)
visual object tracking
(2)
Papers
Integrating Diverse Assignment Strategies into DETRs
AAAI 2026
HDGS: Hierarchical Dynamic Gaussian Splatting for Urban Driving Scenes
AAAI 2026
MMhops-R1: Multimodal Multi-hop Reasoning
AAAI 2026
Mitigating Hallucinations in Large Vision-Language Models by Self-Injecting Hallucinations
EMNLP 2025
D-RAG: Differentiable Retrieval-Augmented Generation for Knowledge Graph Question Answering
EMNLP 2025
Visual-Instructed Degradation Diffusion for All-in-One Image Restoration
CVPR 2025
Reversing Flow for Image Restoration
CVPR 2025
DeepTAGE: Deep Temporal-Aligned Gradient Enhancement for Optimizing Spiking Neural Networks
ICLR 2025
Towards More Discriminative Feature Learning in SNNs with Temporal-Self-Erasing Supervision
AAAI 2025
LightBSR: Towards Lightweight Blind Super-Resolution via Discriminative Implicit Degradation Representation Learning
ICCV 2025
SSTrack: Sample-interval Scheduling for Lightweight Visual Object Tracking
IJCAI 2025
Height-Fidelity Dense Global Fusion for Multi-modal 3D Object Detection
ICCV 2025
VisionMath: Vision-Form Mathematical Problem-Solving
ICCV 2025
PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts
ECCV 2024
VQ-Map: Bird's-Eye-View Map Layout Estimation in Tokenized Discrete Space via Vector Quantization
NIPS 2024
Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
NIPS 2024
Set Prediction Guided by Semantic Concepts for Diverse Video Captioning
AAAI 2024
Self-Training with Pseudo-Label Scorer for Aspect Sentiment Quad Prediction
ACL 2024
Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training
COLING 2024
Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval
COLING 2024
A-Teacher: Asymmetric Network for 3D Semi-Supervised Object Detection
CVPR 2024
How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?
CVPR 2024
STAG4D: Spatial-Temporal Anchored Generative 4D Gaussians
ECCV 2024
EA-VTR: Event-Aware Video-Text Retrieval
ECCV 2024
MIBench: Evaluating Multimodal Large Language Models over Multiple Images
EMNLP 2024
Consistent4D: Consistent 360Β° Dynamic Object Generation from Monocular Video
ICLR 2024
ZoomTrack: Target-aware Non-uniform Resizing for Efficient Visual Tracking
NIPS 2023
Exploiting Contextual Objects and Relations for 3D Visual Grounding
NIPS 2023
AUNet: Learning Relations Between Action Units for Face Forgery Detection
CVPR 2023
Learning To Exploit the Sequence-Specific Prior Knowledge for Image Processing Pipelines Optimization
CVPR 2023
ViLEM: Visual-Language Error Modeling for Image-Text Retrieval
CVPR 2023
A Closer Look at Self-Supervised Lightweight Vision Transformers
ICML 2023
PolarFormer: Multi-Camera 3D Object Detection with Polar Transformer
AAAI 2023
Order-Prompted Tag Sequence Generation for Video Tagging
ICCV 2023
Learning Target-aware Representation for Visual Tracking via Informative Interactions
IJCAI 2022
Attention-Aware Learning for Hyperparameter Prediction in Image Processing Pipelines
ECCV 2022
One More Check: Making βFake Backgroundβ Be Tracked Again
AAAI 2022
Open-Vocabulary One-Stage Detection With Hierarchical Visual-Language Knowledge Distillation
CVPR 2022
Improving Visual Grounding With Visual-Linguistic Verification and Iterative Reasoning
CVPR 2022
EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching
CVPR 2022
Long-Short Term Cross-Transformer in Compressed Domain for Few-Shot Video Classification
IJCAI 2022
Channel-Wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition
ICCV 2021
Open-Book Video Captioning With Retrieve-Copy-Generate Network
CVPR 2021
Learn To Match: Automatic Matching Network Design for Visual Tracking
ICCV 2021
DPFPS: Dynamic and Progressive Filter Pruning for Compressing Convolutional Neural Networks from Scratch
AAAI 2021
Differentiable Convolution Search for Point Cloud Processing
ICCV 2021
Object Relational Graph With Teacher-Recommended Learning for Video Captioning
CVPR 2020
Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking
CVPR 2020
RDSNet: A New Deep Architecture forReciprocal Object Detection and Instance Segmentation
AAAI 2020
Ocean: Object-aware Anchor-free Tracking
ECCV 2020
Learning to Predict Salient Faces: A Novel Visual-Audio Saliency Model
ECCV 2020
Knowledge Distillation via Instance Relationship Graph
CVPR 2019
Anchor Diffusion for Unsupervised Video Object Segmentation
ICCV 2019
Fast Online Object Tracking and Segmentation: A Unifying Approach
CVPR 2019
Deep Cost-Sensitive and Order-Preserving Feature Learning for Cross-Population Age Estimation
CVPR 2018
Do not Lose the Details: Reinforced Representation Learning for High Performance Visual Tracking
IJCAI 2018
Learning Attentions: Residual Attentional Siamese Network for High Performance Online Visual Tracking
CVPR 2018
Visual Tracking via Spatially Aligned Correlation Filters Network
ECCV 2018
Interaction-aware Spatio-temporal Pyramid Attention Networks for Action Classification
ECCV 2018
Distractor-aware Siamese Networks for Visual Object Tracking
ECCV 2018
Spatio-Temporal Self-Organizing Map Deep Network for Dynamic Object Detection From Videos
CVPR 2017
Tensor Power Iteration for Multi-Graph Matching
CVPR 2016
Optimizing Locally Linear Classifiers with Supervised Anchor Point Learning
IJCAI 2015
Multi-Feature Max-Margin Hierarchical Bayesian Model for Action Recognition
CVPR 2015
Local Subspace Collaborative Tracking
ICCV 2015
Human Action Recognition Based on Context-Dependent Graph Kernels
CVPR 2014
Towards Multi-view and Partially-Occluded Face Alignment
CVPR 2014
Multi-target Tracking with Motion Context in Tensor Power Iteration
CVPR 2014
3D R Transform on Spatio-temporal Interest Points for Action Recognition
CVPR 2013
Illumination Estimation Based on Bilayer Sparse Coding
CVPR 2013
Multi-target Tracking by Rank-1 Tensor Approximation
CVPR 2013
Multi-task Sparse Learning with Beta Process Prior for Action Recognition
CVPR 2013
Robust Object Tracking with Online Multi-lifespan Dictionary Learning
ICCV 2013
Discriminant Tracking Using Tensor Representation with Semi-supervised Improvement
ICCV 2013