Guanbin Li
120 papers · 2015–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π Conference Polyglot (11) π§ Keyword Pioneer π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (11) π Academic Marathon (10)
π
Academic Marathon
(10)
π
Cross-Pollinator
(9)
π
Renaissance Researcher
(12)
π
Conference Loyalist
(31)
π
Keyword Champion
(3)
π
Grand Slam
π¬
Deep Specialist
(23)
π±
Topic Pioneer
π€
Dynamic Duo
(47)
π
Century Club
(119)
π₯
Unstoppable
(11)
β‘
Prolific Year
(13)
π
Conference Pioneer
ποΈ
Keyword Collector
(508)
π
Trend Setter
Conferences
CVPR (41)
ICCV (31)
AAAI (20)
ECCV (9)
IJCAI (8)
MICCAI (3)
NIPS (3)
ICML (2)
COLING (1)
ICLR (1)
WACV (1)
Top co-authors
Keywords
semantic segmentation
(11)
semi-supervised learning
(11)
multimodal learning
(9)
domain adaptation
(9)
graph neural network
(7)
large language model
(7)
convolutional neural network
(6)
vision-language model
(6)
transfer learning
(6)
contrastive learning
(5)
scene understanding
(5)
semi-supervised object detection
(5)
attention mechanism
(5)
pseudo label
(5)
pseudo labeling
(4)
knowledge distillation
(4)
visual grounding
(4)
unsupervised learning
(4)
point cloud
(4)
image segmentation
(4)
Papers
Mobile-Agent-RAG: Driving Smart Multi-Agent Coordination with Contextual Knowledge Empowerment for Long-Horizon Mobile Automation
AAAI 2026
Pseudo-Label Reconstruction for Partial Multi-Label Learning
IJCAI 2025
Screening, Rectifying, and Re-Screening: A Unified Framework for Tuning Vision-Language Models with Noisy Labels
IJCAI 2025
Bridging Knowledge Gap Between Image Inpainting and Large-Area Visible Watermark Removal
AAAI 2025
Hierarchically Controlled Deformable 3D Gaussians for Talking Head Synthesis
AAAI 2025
Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering
ICCV 2025
DreamFuse: Adaptive Image Fusion with Diffusion Transformer
ICCV 2025
LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation
ICCV 2025
VLDrive: Vision-Augmented Lightweight MLLMs for Efficient Language-grounded Autonomous Driving
ICCV 2025
DeepShield: Fortifying Deepfake Video Detection with Local and Global Forgery Analysis
ICCV 2025
AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving
ICCV 2025
Free-MoRef: Instantly Multiplexing Context Perception Capabilities of Video-MLLMs within Single Inference
ICCV 2025
GeoSplatting: Towards Geometry Guided Gaussian Splatting for Physically-based Inverse Rendering
ICCV 2025
DreamLayer: Simultaneous Multi-Layer Generation via Diffusion Model
ICCV 2025
FakeRadar: Probing Forgery Outliers to Detect Unknown Deepfake Videos
ICCV 2025
Sim-DETR: Unlock DETR for Temporal Sentence Grounding
ICCV 2025
Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method
CVPR 2025
VTON 360: High-Fidelity Virtual Try-On from Any Viewing Direction
CVPR 2025
DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering
CVPR 2025
Rethinking Query-based Transformer for Continual Image Segmentation
CVPR 2025
Empowering Large Language Models with 3D Situation Awareness
CVPR 2025
PDC-Net: Pattern Divide-and-Conquer Network for Pelvic Radiation Injury Segmentation
MICCAI 2025
LLM-driven Multimodal and Multi-Identity Listening Head Generation
CVPR 2025
Pattern-Anchored Adaptive Prototype Learning for Gastroscopic Lesion Detection and Beyond
MICCAI 2025
DAGSM: Disentangled Avatar Generation with GS-enhanced Mesh
CVPR 2025
ReferSplat: Referring Segmentation in 3D Gaussian Splatting
ICML 2025
GlassWizard: Harvesting Diffusion Priors for Glass Surface Detection
ICCV 2025
AlignSAM: Aligning Segment Anything Model to Open Context via Reinforcement Learning
CVPR 2024
Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection
CVPR 2024
OVER-NAV: Elevating Iterative Vision-and-Language Navigation with Open-Vocabulary Detection and StructurEd Representation
CVPR 2024
NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation
CVPR 2024
Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training
CVPR 2024
Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection
CVPR 2024
MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection
ECCV 2024
Universal Semi-Supervised Model Adaptation via Collaborative Consistency Training
WACV 2024
VersVideo: Leveraging Enhanced Temporal Diffusion Models for Versatile Video Generation
ICLR 2024
Mask-Enhanced Segment Anything Model for Tumor Lesion Semantic Segmentation
MICCAI 2024
UniFL: Improve Latent Diffusion Model via Unified Feedback Learning
NIPS 2024
WhodunitBench: Evaluating Large Multimodal Agents via Murder Mystery Games
NIPS 2024
Variance-Insensitive and Target-Preserving Mask Refinement for Interactive Image Segmentation
AAAI 2024
UniCell: Universal Cell Nucleus Classification via Prompt Learning
AAAI 2024
Removing Interference and Recovering Content Imaginatively for Visible Watermark Removal
AAAI 2024
FedDiv: Collaborative Noise Filtering for Federated Learning with Noisy Labels
AAAI 2024
Cell Graph Transformer for Nuclei Classification
AAAI 2024
Open-Vocabulary Segmentation with Semantic-Assisted Calibration
CVPR 2024
Interactive 3D Object Detection with Prompts
ECCV 2024
MMAPS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization
COLING 2024
WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models
ECCV 2024
Improved Distribution Matching for Dataset Condensation
CVPR 2023
Adapting Object Size Variance and Class Imbalance for Semi-supervised Object Detection
AAAI 2023
De-biased Teacher: Rethinking IoU Matching for Semi-supervised Object Detection
AAAI 2023
Identity-Preserving Talking Face Generation With Landmark and Appearance Priors
CVPR 2023
Being Comes From Not-Being: Open-Vocabulary Text-to-Motion Generation With Wordless Training
CVPR 2023
Parametric Implicit Face Representation for Audio-Driven Facial Reenactment
CVPR 2023
SCoDA: Domain Adaptive Shape Completion for Real Scans
CVPR 2023
Semi-DETR: Semi-Supervised Object Detection With Detection Transformers
CVPR 2023
Divide and Adapt: Active Domain Adaptation via Customized Learning
CVPR 2023
Advancing Visual Grounding With Scene Knowledge: Benchmark and Method
CVPR 2023
Enhanced Soft Label for Semi-Supervised Semantic Segmentation
ICCV 2023
SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence Pre-training
ICCV 2023
Affine-Consistent Transformer for Multi-Class Cell Nuclei Detection
ICCV 2023
Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation
ICCV 2023
Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection
ICCV 2023
RankMatch: Fostering Confidence and Consistency in Learning with Noisy Labels
ICCV 2023
Towards Real-World Burst Image Super-Resolution: Benchmark and Method
ICCV 2023
Towards Unifying Medical Vision-and-Language Pre-Training via Soft Prompts
ICCV 2023
DenseLight: Efficient Control for Large-scale Traffic Signals with Dense Feedback
IJCAI 2023
Long-term Wind Power Forecasting with Hierarchical Spatial-Temporal Transformer
IJCAI 2023
Unsupervised Domain Adaptive Salient Object Detection through Uncertainty-Aware Pseudo-Label Learning
AAAI 2022
Double-Check Soft Teacher for Semi-Supervised Object Detection
IJCAI 2022
Divide and Contrast: Source-free Domain Adaptation via Adaptive Contrastive Learning
NIPS 2022
A Causal Inference Look at Unsupervised Video Anomaly Detection
AAAI 2022
X-Trans2Cap: Cross-Modal Knowledge Transfer Using Transformer for 3D Dense Captioning
CVPR 2022
Neighborhood Collective Estimation for Noisy Label Identification and Correction
ECCV 2022
Multi-level Consistency Learning for Semi-supervised Domain Adaptation
IJCAI 2022
Centrality and Consistency: Two-Stage Clean Samples Identification for Learning with Instance-Dependent Noisy Labels
ECCV 2022
A Causal Debiasing Framework for Unsupervised Salient Object Detection
AAAI 2022
Dual Adversarial Adaptation for Cross-Device Real-World Image Super-Resolution
CVPR 2022
Multi-Layer Networks for Ensemble Precipitation Forecasts Postprocessing
AAAI 2021
Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation
CVPR 2021
Bottom-Up Shift and Reasoning for Referring Image Segmentation
CVPR 2021
Weakly-Supervised Spatio-Temporal Anomaly Detection in Surveillance Video
IJCAI 2021
Towards Interpretable Deep Networks for Monocular Depth Estimation
ICCV 2021
LapsCore: Language-Guided Person Search via Color Reasoning
ICCV 2021
Trash To Treasure: Harvesting OOD Data With Cross-Modal Matching for Open-Set Semi-Supervised Learning
ICCV 2021
Scene-Intuitive Agent for Remote Embodied Visual Grounding
CVPR 2021
Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting
CVPR 2021
Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation
CVPR 2021
Collaborative Training between Region Proposal Localization and Classification for Domain Adaptive Object Detection
ECCV 2020
Graph-Structured Referring Expression Reasoning in the Wild
CVPR 2020
An Adversarial Perturbation Oriented Domain Adaptation Approach for Semantic Segmentation
AAAI 2020
Tree-Structured Policy Based Progressive Reinforcement Learning for Temporally Language Grounding in Video
AAAI 2020
Knowledge Graph Transfer Network for Few-Shot Recognition
AAAI 2020
Propagating Over Phrase Relations for One-Stage Visual Grounding
ECCV 2020
A Real-Time Cross-Modality Correlation Filtering Method for Referring Expression Comprehension
CVPR 2020
Referring Image Segmentation via Cross-Modal Progressive Comprehension
CVPR 2020
Peeking into occluded joints: A novel framework for crowd pose estimation
ECCV 2020
Linguistic Structure Guided Context Modeling for Referring Image Segmentation
ECCV 2020
ClusterNet: Deep Hierarchical Cluster Network With Rigorously Rotation-Invariant Representation for Point Cloud Analysis
CVPR 2019
Larger Norm More Transferable: An Adaptive Feature Norm Approach for Unsupervised Domain Adaptation
ICCV 2019
Crowd Counting With Deep Structured Scale Integration Network
ICCV 2019
Semi-Supervised Skin Detection by Network With Mutual Guidance
ICCV 2019
Fashion Retrieval via Graph Reasoning Networks on a Similarity Pyramid
ICCV 2019
Dynamic Graph Attention for Referring Expression Comprehension
ICCV 2019
Motion Guided Attention for Video Salient Object Detection
ICCV 2019
Semi-Supervised Video Salient Object Detection Using Pseudo-Labels
ICCV 2019
Non-Local Context Encoder: Robust Biomedical Image Segmentation against Adversarial Attacks
AAAI 2019
FRAME Revisited: An Interpretation View Based on Particle Evolution
AAAI 2019
Semantic Relationships Guided Representation Learning for Facial Action Unit Recognition
AAAI 2019
Cross-Modal Relationship Inference for Grounding Referring Expressions
CVPR 2019
Multivariate-Information Adversarial Ensemble for Scalable Joint Distribution Matching
ICML 2019
Visual Question Reasoning on General Dependency Tree
CVPR 2018
Flow Guided Recurrent Neural Encoder for Video Salient Object Detection
CVPR 2018
Interpretable Video Captioning via Trajectory Structured Localization
CVPR 2018
Crowd Counting using Deep Recurrent Spatial-Aware Network
IJCAI 2018
Multi-Label Image Recognition by Recurrently Discovering Attentional Regions
ICCV 2017
Attention-Aware Face Hallucination via Deep Reinforcement Learning
CVPR 2017
Instance-Level Salient Object Segmentation
CVPR 2017
Deep Contrast Learning for Salient Object Detection
CVPR 2016
Visual Saliency Based on Multiscale Deep Features
CVPR 2015