Huchuan Lu
166 papers · 2013–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
🐝 Cross-Pollinator (9) 🏃 Academic Marathon (12) 🌍 Conference Polyglot (11) 🧭 Keyword Pioneer 🌈 Renaissance Researcher (7)
🐝
Cross-Pollinator
(9)
🧭
Keyword Pioneer
🏃
Academic Marathon
(12)
🌟
Keyword Trendsetter Combo
(3)
🏠
Conference Loyalist
(35)
🏆
Grand Slam
🔬
Deep Specialist
(41)
👑
Triple Crown
🏆
Keyword Champion
(13)
🤝
Dynamic Duo
(40)
❓
The Questioner
⚡
Prolific Year
(18)
🗃️
Keyword Collector
(556)
💎
Century Club
(163)
🔥
Unstoppable
(13)
📈
Trend Setter
🚀
Conference Pioneer
Conferences
CVPR (69)
ICCV (35)
ECCV (20)
AAAI (19)
ICLR (5)
NIPS (5)
WACV (4)
IJCAI (3)
ACL (2)
ICML (2)
MICCAI (2)
Top co-authors
Keywords
salient object detection
(21)
convolutional neural network
(19)
object tracking
(18)
semantic segmentation
(18)
visual tracking
(15)
saliency detection
(14)
image segmentation
(13)
attention mechanism
(12)
object detection
(11)
neural network
(10)
multi-modal learning
(10)
feature fusion
(8)
multimodal learning
(8)
feature extraction
(8)
depth estimation
(7)
feature learning
(7)
person re-identification
(7)
visual object tracking
(6)
image restoration
(5)
self-supervised learning
(5)
Papers
X-ReID: Multi-granularity Information Interaction for Video-Based Visible-Infrared Person Re-Identification
AAAI 2026
CADTrack: Learning Contextual Aggregation with Deformable Alignment for Robust RGBT Tracking
AAAI 2026
SAM3-I: Segment Anything with Instructions
ACL 2026
IDEA: Inverted Text with Cooperative Deformable Aggregation for Multi-modal Object Re-Identification
CVPR 2025
Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge
ICLR 2025
Learning Spatial-Semantic Features for Robust Video Object Segmentation
ICLR 2025
Autoregressive Video Generation without Vector Quantization
ICLR 2025
High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity
ICLR 2025
TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion Models
WACV 2025
CAT: A Unified Click-and-Track Framework for Realistic Tracking
ICCV 2025
EVEv2: Improved Baselines for Encoder-Free Vision-Language Models
ICCV 2025
VLIPP: Towards Physically Plausible Video Generation with Vision and Language Informed Physical Prior
ICCV 2025
CCL-LGS: Contrastive Codebook Learning for 3D Language Gaussian Splatting
ICCV 2025
DefMamba: Deformable Visual State Space Model
CVPR 2025
SUTrack: Towards Simple and Unified Single Object Tracking
AAAI 2025
MambaPro: Multi-Modal Object Re-identification with Mamba Aggregation and Synergistic Prompt
AAAI 2025
CLIMB-ReID: A Hybrid CLIP-Mamba Framework for Person Re-Identification
AAAI 2025
Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding
AAAI 2025
Two-stream Beats One-stream: Asymmetric Siamese Network for Efficient Visual Tracking
AAAI 2025
The Devil is in Temporal Token: High Quality Video Reasoning Segmentation
CVPR 2025
Automated Evaluation of Large Vision-Language Models on Self-Driving Corner Cases
WACV 2025
ReNeg: Learning Negative Embedding with Reward Guidance
CVPR 2025
Mono2Stereo: A Benchmark and Empirical Study for Stereo Conversion
CVPR 2025
UniSegDiff: Boosting Unified Lesion Segmentation via a Staged Diffusion Model
MICCAI 2025
Towards Real-Time Open-Vocabulary Video Instance Segmentation
WACV 2025
Efficient Motion Prompt Learning for Robust Visual Tracking
ICML 2025
Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-Identification
CVPR 2024
LLMs Can Evolve Continually on Modality for $\mathbb{X}$-Modal Reasoning
NIPS 2024
TOP-ReID: Multi-Spectral Object Re-identification with Token Permutation
AAAI 2024
Hybrid-SORT: Weak Cues Matter for Online Multi-Object Tracking
AAAI 2024
TF-CLIP: Learning Text-Free CLIP for Video-Based Person Re-identification
AAAI 2024
DME: Unveiling the Bias for Better Generalized Monocular Depth Estimation
AAAI 2024
Large Occluded Human Image Completion via Image-Prior Cooperating
AAAI 2024
PsySafe: A Comprehensive Framework for Psychological-based Attack, Defense, and Evaluation of Multi-agent System Safety
ACL 2024
Leveraging the Power of Data Augmentation for Transformer-Based Tracking
WACV 2024
CriDiff: Criss-cross Injection Diffusion Framework via Generative Pre-train for Prostate Segmentation
MICCAI 2024
MAS-SAM: Segment Any Marine Animal with Aggregated Features
IJCAI 2024
Spider: A Unified Framework for Context-dependent Concept Segmentation
ICML 2024
PixArt-$\alpha$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
ICLR 2024
Spatial-Temporal Multi-level Association for Video Object Segmentation
ECCV 2024
Open-Vocabulary Camouflaged Object Segmentation
ECCV 2024
SHERL: Synthesizing High Accuracy and Efficient Memory for Resource-Limited Transfer Learning
ECCV 2024
PixArt-Sigma: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
ECCV 2024
EvSign: Sign Language Recognition and Translation with Streaming Events
ECCV 2024
Multi-modal Instruction Tuned LLMs with Fine-grained Visual Perception
CVPR 2024
UniPT: Universal Parallel Tuning for Transfer Learning with Efficient Parameter and Memory
CVPR 2024
Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters
CVPR 2024
Multi-view Aggregation Network for Dichotomous Image Segmentation
CVPR 2024
Towards Automatic Power Battery Detection: New Challenge Benchmark Dataset and Baseline
CVPR 2024
Fantastic Animals and Where to Find Them: Segment Any Marine Animal with Dual SAM
CVPR 2024
Unveiling Encoder-Free Vision-Language Models
NIPS 2024
Representation Learning for Visual Object Tracking by Masked Appearance Transfer
CVPR 2023
Dual Memory Aggregation Network for Event-Based Object Detection with Learnable Representation
AAAI 2023
Universal Instance Perception As Object Discovery and Retrieval
CVPR 2023
MetaFusion: Infrared and Visible Image Fusion via Meta-Feature Embedding From Object Detection
CVPR 2023
Compression-Aware Video Super-Resolution
CVPR 2023
CiteTracker: Correlating Image and Text for Visual Tracking
ICCV 2023
Exploring Transformers for Open-world Instance Segmentation
ICCV 2023
Isomer: Isomerous Transformer for Zero-shot Video Object Segmentation
ICCV 2023
Adaptive Illumination Mapping for Shadow Detection in Raw Images
ICCV 2023
Segment Every Reference Object in Spatial and Temporal Spaces
ICCV 2023
Exploring Lightweight Hierarchical Vision Transformers for Efficient Visual Tracking
ICCV 2023
MetaBEV: Solving Sensor Failures for 3D Detection and Map Segmentation
ICCV 2023
Towards Deeply Unified Depth-aware Panoptic Segmentation with Bi-directional Guidance Learning
ICCV 2023
GM-NeRF: Learning Generalizable Model-Based Neural Radiance Fields From Multi-View Images
CVPR 2023
Visual Prompt Multi-Modal Tracking
CVPR 2023
ARKitTrack: A New Diverse Dataset for Tracking Using Mobile RGB-D Data
CVPR 2023
SeqTrack: Sequence to Sequence Learning for Visual Object Tracking
CVPR 2023
Adaptive Co-Teaching for Unsupervised Monocular Depth Estimation
ECCV 2022
Semi-Supervised Video Salient Object Detection Based on Uncertainty-Guided Pseudo Labels
NIPS 2022
You Only Infer Once: Cross-Modal Meta-Transfer for Referring Video Object Segmentation
AAAI 2022
Self-Supervised Pretraining for RGB-D Salient Object Detection
AAAI 2022
Multi-Source Uncertainty Mining for Deep Unsupervised Saliency Detection
CVPR 2022
Multi-Object Tracking Meets Moving UAV
CVPR 2022
TimeReplayer: Unlocking the Potential of Event Cameras for Video Interpolation
CVPR 2022
Look Back and Forth: Video Super-Resolution With Explicit Temporal Difference Modeling
CVPR 2022
Visible-Thermal UAV Tracking: A Large-Scale Benchmark and New Baseline
CVPR 2022
Zoom in and Out: A Mixed-Scale Triplet Network for Camouflaged Object Detection
CVPR 2022
Towards Grand Unification of Object Tracking
ECCV 2022
MVSalNet:Multi-View Augmentation for RGB-D Salient Object Detection
ECCV 2022
United Defocus Blur Detection and Deblurring via Adversarial Promoting Learning
ECCV 2022
Learning Spatio-Temporal Transformer for Visual Tracking
ICCV 2021
Learning Motion-Appearance Co-Attention for Zero-Shot Video Object Segmentation
ICCV 2021
Encoder Fusion Network With Co-Attention Embedding for Referring Image Segmentation
CVPR 2021
Watching You: Global-Guided Reciprocal Learning for Video-Based Person Re-Identification
CVPR 2021
Neighbor2Neighbor: Self-Supervised Denoising From Single Noisy Images
CVPR 2021
Multi-Target Domain Adaptation With Collaborative Consistency Learning
CVPR 2021
Can Scale-Consistent Monocular Depth Be Learned in a Self-Supervised Scale-Invariant Manner?
ICCV 2021
MFNet: Multi-Filter Directive Network for Weakly Supervised Salient Object Detection
ICCV 2021
CR-Fill: Generative Image Inpainting With Auxiliary Contextual Reconstruction
ICCV 2021
Alpha-Refine: Boosting Tracking Performance by Precise Bounding Box Estimation
CVPR 2021
LightTrack: Finding Lightweight Neural Networks for Object Tracking via One-Shot Architecture Search
CVPR 2021
Joint Semantic Mining for Weakly Supervised RGB-D Salient Object Detection
NIPS 2021
Transformer Tracking
CVPR 2021
Calibrated RGB-D Salient Object Detection
CVPR 2021
Similarity Reasoning and Filtration for Image-Text Matching
AAAI 2021
Self-Generated Defocus Blur Detection via Dual Adversarial Discriminators
CVPR 2021
Video Annotation for Visual Tracking via Selection and Refinement
ICCV 2021
Dynamic Context-Sensitive Filtering Network for Video Salient Object Detection
ICCV 2021
High-Resolution Image Inpainting with Iterative Confidence Feedback and Guided Upsampling
ECCV 2020
A Single Stream Network for Robust and Real-time RGB-D Salient Object Detection
ECCV 2020
Hierarchical Dynamic Filtering Network for RGB-D Salient Object Detection
ECCV 2020
Pose-Guided Visible Part Matching for Occluded Person ReID
CVPR 2020
Multi-Scale Interactive Network for Salient Object Detection
CVPR 2020
High-Performance Long-Term Tracking With Meta-Updater
CVPR 2020
A2dele: Adaptive and Attentive Depth Distiller for Efficient RGB-D Salient Object Detection
CVPR 2020
SDC-Depth: Semantic Divide-and-Conquer Network for Monocular Depth Estimation
CVPR 2020
Asymmetric Two-Stream Architecture for Accurate RGB-D Saliency Detection
ECCV 2020
Bi-Directional Relationship Inferring Network for Referring Image Segmentation
CVPR 2020
Exploit and Replace: An Asymmetrical Two-Stream Architecture for Versatile Light Field Saliency Detection
AAAI 2020
Cooling-Shrinking Attack: Blinding the Tracker With Imperceptible Noises
CVPR 2020
Multi-Type Self-Attention Guided Degraded Saliency Detection
AAAI 2020
Select, Supplement and Focus for RGB-D Saliency Detection
CVPR 2020
Suppress and Balance: A Simple Gated Network for Salient Object Detection
ECCV 2020
CLIFFNet for Monocular Depth Estimation with Hierarchical Embedding Loss
ECCV 2020
Unsupervised Video Object Segmentation with Joint Hotspot Tracking
ECCV 2020
Accurate RGB-D Salient Object Detection via Collaborative Learning
ECCV 2020
Deep Light-field-driven Saliency Detection from a Single View
IJCAI 2019
Memory-oriented Decoder for Light Field Salient Object Detection
NIPS 2019
Deep Embedding Features for Salient Object Detection
AAAI 2019
Enhancing Diversity of Defocus Blur Detectors via Cross-Ensemble Network
CVPR 2019
A Mutual Learning Method for Salient Object Detection With Intertwined Multi-Supervision
CVPR 2019
'Skimming-Perusal' Tracking: A Framework for Real-Time and Robust Long-Term Tracking
ICCV 2019
Fast Video Object Segmentation via Dynamic Targeting Network
ICCV 2019
Deep Reinforcement Active Learning for Human-in-the-Loop Person Re-Identification
ICCV 2019
GradNet: Gradient-Guided Network for Visual Object Tracking
ICCV 2019
Joint Learning of Saliency Detection and Weakly Supervised Semantic Segmentation
ICCV 2019
Towards High-Resolution Salient Object Detection
ICCV 2019
Depth-Induced Multi-Scale Recurrent Attention Network for Saliency Detection
ICCV 2019
Cascaded Context Pyramid for Full-Resolution 3D Semantic Scene Completion
ICCV 2019
Deep Learning for Light Field Saliency Detection
ICCV 2019
Multi-Source Weak Supervision for Saliency Detection
CVPR 2019
CapSal: Leveraging Captioning to Boost Semantics for Salient Object Detection
CVPR 2019
ROI Pooled Correlation Filters for Visual Tracking
CVPR 2019
Visual Tracking via Adaptive Spatially-Regularized Correlation Filters
CVPR 2019
Attentive Feedback Network for Boundary-Aware Salient Object Detection
CVPR 2019
Deep Cross-Modal Projection Learning for Image-Text Matching
ECCV 2018
Real-time 'Actor-Critic' Tracking
ECCV 2018
Structured Siamese Network for Real-Time Visual Tracking
ECCV 2018
Salient Object Detection by Lossless Feature Reflection
IJCAI 2018
Learning Spatial-Aware Regressions for Visual Tracking
CVPR 2018
Deep Mutual Learning
CVPR 2018
Detect Globally, Refine Locally: A Novel Approach to Saliency Detection
CVPR 2018
Defocus Blur Detection via Multi-Stream Bottom-Top-Bottom Fully Convolutional Network
CVPR 2018
Learning Dual Convolutional Neural Networks for Low-Level Vision
CVPR 2018
A Bi-Directional Message Passing Model for Salient Object Detection
CVPR 2018
Learning to Promote Saliency Detectors
CVPR 2018
Progressive Attention Guided Recurrent Network for Salient Object Detection
CVPR 2018
Correlation Tracking via Joint Discrimination and Reliability Learning
CVPR 2018
A Stagewise Refinement Model for Detecting Salient Objects in Images
ICCV 2017
Learning Uncertain Convolutional Features for Accurate Saliency Detection
ICCV 2017
Amulet: Aggregating Multi-Level Convolutional Features for Salient Object Detection
ICCV 2017
Learning to Detect Salient Objects With Image-Level Supervision
CVPR 2017
Stepwise Metric Promotion for Unsupervised Video Person Re-Identification
ICCV 2017
STCT: Sequentially Training Convolutional Networks for Visual Tracking
CVPR 2016
Sample-Specific SVM Learning for Person Re-Identification
CVPR 2016
Salient Object Detection via Bootstrap Learning
CVPR 2015
Saliency Detection via Cellular Automata
CVPR 2015
Deep Networks for Saliency Detection via Local Estimation and Global Search
CVPR 2015
Visual Tracking With Fully Convolutional Networks
ICCV 2015
Subspace Clustering by Mixture of Gaussian Regression
CVPR 2015
Visual Tracking via Probability Continuous Outlier Model
CVPR 2014
Saliency Detection via Dense and Sparse Reconstruction
ICCV 2013
Least Soft-Threshold Squares Tracking
CVPR 2013
Saliency Detection via Graph-Based Manifold Ranking
CVPR 2013
Saliency Detection via Absorbing Markov Chain
ICCV 2013