Guanbin Li

120 papers · 2015–2026 · 11 conferences · across top CS/AI conferences

Achievements

+15 more ↓

🌍 Conference Polyglot (11) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (11) 🏃 Academic Marathon (10)

🏃 Academic Marathon (10) 🐝 Cross-Pollinator (9) 🌈 Renaissance Researcher (12) 🏠 Conference Loyalist (31) 🏆 Keyword Champion (3) 🏆 Grand Slam 🔬 Deep Specialist (23) 🌱 Topic Pioneer 🤝 Dynamic Duo (47) 💎 Century Club (119) 🔥 Unstoppable (11) ⚡ Prolific Year (13) 🚀 Conference Pioneer 🗃️ Keyword Collector (508) 📈 Trend Setter

Conferences

CVPR (41) ICCV (31) AAAI (20) ECCV (9) IJCAI (8) MICCAI (3) NIPS (3) ICML (2) COLING (1) ICLR (1) WACV (1)

Top co-authors

Liang Lin (47) Yizhou Yu (19) Chaowei Fang (11) Weikai Chen (10) Xiang Wan (10) Sibei Yang (10) Yipeng Qin (9) Jichang Li (9) Si Liu (8) Wei Zhang (8)

Keywords

semantic segmentation (11) semi-supervised learning (11) multimodal learning (9) domain adaptation (9) graph neural network (7) large language model (7) convolutional neural network (6) vision-language model (6) transfer learning (6) contrastive learning (5) scene understanding (5) semi-supervised object detection (5) attention mechanism (5) pseudo label (5) pseudo labeling (4) knowledge distillation (4) visual grounding (4) unsupervised learning (4) point cloud (4) image segmentation (4)

Papers

Mobile-Agent-RAG: Driving Smart Multi-Agent Coordination with Contextual Knowledge Empowerment for Long-Horizon Mobile Automation AAAI 2026 Pseudo-Label Reconstruction for Partial Multi-Label Learning IJCAI 2025 Screening, Rectifying, and Re-Screening: A Unified Framework for Tuning Vision-Language Models with Noisy Labels IJCAI 2025 Bridging Knowledge Gap Between Image Inpainting and Large-Area Visible Watermark Removal AAAI 2025 Hierarchically Controlled Deformable 3D Gaussians for Talking Head Synthesis AAAI 2025 Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering ICCV 2025 DreamFuse: Adaptive Image Fusion with Diffusion Transformer ICCV 2025 LaneDiffusion: Improving Centerline Graph Learning via Prior Injected BEV Feature Generation ICCV 2025 VLDrive: Vision-Augmented Lightweight MLLMs for Efficient Language-grounded Autonomous Driving ICCV 2025 DeepShield: Fortifying Deepfake Video Detection with Local and Global Forgery Analysis ICCV 2025 AdaDrive: Self-Adaptive Slow-Fast System for Language-Grounded Autonomous Driving ICCV 2025 Free-MoRef: Instantly Multiplexing Context Perception Capabilities of Video-MLLMs within Single Inference ICCV 2025 GeoSplatting: Towards Geometry Guided Gaussian Splatting for Physically-based Inverse Rendering ICCV 2025 DreamLayer: Simultaneous Multi-Layer Generation via Diffusion Model ICCV 2025 FakeRadar: Probing Forgery Outliers to Detect Unknown Deepfake Videos ICCV 2025 Sim-DETR: Unlock DETR for Temporal Sentence Grounding ICCV 2025 Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method CVPR 2025 VTON 360: High-Fidelity Virtual Try-On from Any Viewing Direction CVPR 2025 DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering CVPR 2025 Rethinking Query-based Transformer for Continual Image Segmentation CVPR 2025 Empowering Large Language Models with 3D Situation Awareness CVPR 2025 PDC-Net: Pattern Divide-and-Conquer Network for Pelvic Radiation Injury Segmentation MICCAI 2025 LLM-driven Multimodal and Multi-Identity Listening Head Generation CVPR 2025 Pattern-Anchored Adaptive Prototype Learning for Gastroscopic Lesion Detection and Beyond MICCAI 2025 DAGSM: Disentangled Avatar Generation with GS-enhanced Mesh CVPR 2025 ReferSplat: Referring Segmentation in 3D Gaussian Splatting ICML 2025 GlassWizard: Harvesting Diffusion Priors for Glass Surface Detection ICCV 2025 AlignSAM: Aligning Segment Anything Model to Open Context via Reinforcement Learning CVPR 2024 Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection CVPR 2024 OVER-NAV: Elevating Iterative Vision-and-Language Navigation with Open-Vocabulary Detection and StructurEd Representation CVPR 2024 NeRF-HuGS: Improved Neural Radiance Fields in Non-static Scenes Using Heuristics-Guided Segmentation CVPR 2024 Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training CVPR 2024 Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection CVPR 2024 MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection ECCV 2024 Universal Semi-Supervised Model Adaptation via Collaborative Consistency Training WACV 2024 VersVideo: Leveraging Enhanced Temporal Diffusion Models for Versatile Video Generation ICLR 2024 Mask-Enhanced Segment Anything Model for Tumor Lesion Semantic Segmentation MICCAI 2024 UniFL: Improve Latent Diffusion Model via Unified Feedback Learning NIPS 2024 WhodunitBench: Evaluating Large Multimodal Agents via Murder Mystery Games NIPS 2024 Variance-Insensitive and Target-Preserving Mask Refinement for Interactive Image Segmentation AAAI 2024 UniCell: Universal Cell Nucleus Classification via Prompt Learning AAAI 2024 Removing Interference and Recovering Content Imaginatively for Visible Watermark Removal AAAI 2024 FedDiv: Collaborative Noise Filtering for Federated Learning with Noisy Labels AAAI 2024 Cell Graph Transformer for Nuclei Classification AAAI 2024 Open-Vocabulary Segmentation with Semantic-Assisted Calibration CVPR 2024 Interactive 3D Object Detection with Prompts ECCV 2024 MMAPS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization COLING 2024 WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models ECCV 2024 Improved Distribution Matching for Dataset Condensation CVPR 2023 Adapting Object Size Variance and Class Imbalance for Semi-supervised Object Detection AAAI 2023 De-biased Teacher: Rethinking IoU Matching for Semi-supervised Object Detection AAAI 2023 Identity-Preserving Talking Face Generation With Landmark and Appearance Priors CVPR 2023 Being Comes From Not-Being: Open-Vocabulary Text-to-Motion Generation With Wordless Training CVPR 2023 Parametric Implicit Face Representation for Audio-Driven Facial Reenactment CVPR 2023 SCoDA: Domain Adaptive Shape Completion for Real Scans CVPR 2023 Semi-DETR: Semi-Supervised Object Detection With Detection Transformers CVPR 2023 Divide and Adapt: Active Domain Adaptation via Customized Learning CVPR 2023 Advancing Visual Grounding With Scene Knowledge: Benchmark and Method CVPR 2023 Enhanced Soft Label for Semi-Supervised Semantic Segmentation ICCV 2023 SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence Pre-training ICCV 2023 Affine-Consistent Transformer for Multi-Class Cell Nuclei Detection ICCV 2023 Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation ICCV 2023 Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection ICCV 2023 RankMatch: Fostering Confidence and Consistency in Learning with Noisy Labels ICCV 2023 Towards Real-World Burst Image Super-Resolution: Benchmark and Method ICCV 2023 Towards Unifying Medical Vision-and-Language Pre-Training via Soft Prompts ICCV 2023 DenseLight: Efficient Control for Large-scale Traffic Signals with Dense Feedback IJCAI 2023 Long-term Wind Power Forecasting with Hierarchical Spatial-Temporal Transformer IJCAI 2023 Unsupervised Domain Adaptive Salient Object Detection through Uncertainty-Aware Pseudo-Label Learning AAAI 2022 Double-Check Soft Teacher for Semi-Supervised Object Detection IJCAI 2022 Divide and Contrast: Source-free Domain Adaptation via Adaptive Contrastive Learning NIPS 2022 A Causal Inference Look at Unsupervised Video Anomaly Detection AAAI 2022 X-Trans2Cap: Cross-Modal Knowledge Transfer Using Transformer for 3D Dense Captioning CVPR 2022 Neighborhood Collective Estimation for Noisy Label Identification and Correction ECCV 2022 Multi-level Consistency Learning for Semi-supervised Domain Adaptation IJCAI 2022 Centrality and Consistency: Two-Stage Clean Samples Identification for Learning with Instance-Dependent Noisy Labels ECCV 2022 A Causal Debiasing Framework for Unsupervised Salient Object Detection AAAI 2022 Dual Adversarial Adaptation for Cross-Device Real-World Image Super-Resolution CVPR 2022 Multi-Layer Networks for Ensemble Precipitation Forecasts Postprocessing AAAI 2021 Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation CVPR 2021 Bottom-Up Shift and Reasoning for Referring Image Segmentation CVPR 2021 Weakly-Supervised Spatio-Temporal Anomaly Detection in Surveillance Video IJCAI 2021 Towards Interpretable Deep Networks for Monocular Depth Estimation ICCV 2021 LapsCore: Language-Guided Person Search via Color Reasoning ICCV 2021 Trash To Treasure: Harvesting OOD Data With Cross-Modal Matching for Open-Set Semi-Supervised Learning ICCV 2021 Scene-Intuitive Agent for Remote Embodied Visual Grounding CVPR 2021 Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting CVPR 2021 Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation CVPR 2021 Collaborative Training between Region Proposal Localization and Classification for Domain Adaptive Object Detection ECCV 2020 Graph-Structured Referring Expression Reasoning in the Wild CVPR 2020 An Adversarial Perturbation Oriented Domain Adaptation Approach for Semantic Segmentation AAAI 2020 Tree-Structured Policy Based Progressive Reinforcement Learning for Temporally Language Grounding in Video AAAI 2020 Knowledge Graph Transfer Network for Few-Shot Recognition AAAI 2020 Propagating Over Phrase Relations for One-Stage Visual Grounding ECCV 2020 A Real-Time Cross-Modality Correlation Filtering Method for Referring Expression Comprehension CVPR 2020 Referring Image Segmentation via Cross-Modal Progressive Comprehension CVPR 2020 Peeking into occluded joints: A novel framework for crowd pose estimation ECCV 2020 Linguistic Structure Guided Context Modeling for Referring Image Segmentation ECCV 2020 ClusterNet: Deep Hierarchical Cluster Network With Rigorously Rotation-Invariant Representation for Point Cloud Analysis CVPR 2019 Larger Norm More Transferable: An Adaptive Feature Norm Approach for Unsupervised Domain Adaptation ICCV 2019 Crowd Counting With Deep Structured Scale Integration Network ICCV 2019 Semi-Supervised Skin Detection by Network With Mutual Guidance ICCV 2019 Fashion Retrieval via Graph Reasoning Networks on a Similarity Pyramid ICCV 2019 Dynamic Graph Attention for Referring Expression Comprehension ICCV 2019 Motion Guided Attention for Video Salient Object Detection ICCV 2019 Semi-Supervised Video Salient Object Detection Using Pseudo-Labels ICCV 2019 Non-Local Context Encoder: Robust Biomedical Image Segmentation against Adversarial Attacks AAAI 2019 FRAME Revisited: An Interpretation View Based on Particle Evolution AAAI 2019 Semantic Relationships Guided Representation Learning for Facial Action Unit Recognition AAAI 2019 Cross-Modal Relationship Inference for Grounding Referring Expressions CVPR 2019 Multivariate-Information Adversarial Ensemble for Scalable Joint Distribution Matching ICML 2019 Visual Question Reasoning on General Dependency Tree CVPR 2018 Flow Guided Recurrent Neural Encoder for Video Salient Object Detection CVPR 2018 Interpretable Video Captioning via Trajectory Structured Localization CVPR 2018 Crowd Counting using Deep Recurrent Spatial-Aware Network IJCAI 2018 Multi-Label Image Recognition by Recurrently Discovering Attentional Regions ICCV 2017 Attention-Aware Face Hallucination via Deep Reinforcement Learning CVPR 2017 Instance-Level Salient Object Segmentation CVPR 2017 Deep Contrast Learning for Salient Object Detection CVPR 2016 Visual Saliency Based on Multiscale Deep Features CVPR 2015