Jianbing Shen
89 papers · 2015–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
🌍 Conference Polyglot (6) 🏃 Academic Marathon (10) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (8)
🌈
Renaissance Researcher
(8)
🐝
Cross-Pollinator
(8)
🌍
Conference Polyglot
(6)
🏠
Conference Loyalist
(37)
🤝
Dynamic Duo
(31)
🔬
Deep Specialist
(13)
🏆
Keyword Champion
(9)
🗃️
Keyword Collector
(327)
📈
Trend Setter
🚀
Conference Pioneer
⚡
Prolific Year
(7)
🔥
Unstoppable
(9)
💎
Century Club
(84)
Conferences
CVPR (37)
ICCV (17)
ECCV (16)
AAAI (11)
ACL (6)
ICLR (2)
Top co-authors
Keywords
semantic segmentation
(10)
video object segmentation
(9)
autonomous driving
(8)
attention mechanism
(6)
3d object detection
(6)
video understanding
(5)
vision-language model
(5)
object tracking
(4)
multimodal learning
(4)
salient object detection
(4)
graph neural network
(4)
weakly supervised learning
(3)
representation learning
(3)
instance segmentation
(3)
knowledge distillation
(3)
diffusion model
(3)
object detection
(3)
depth estimation
(3)
domain adaptation
(3)
reinforcement learning
(3)
Papers
Less Is More: Vision Representation Compression for Efficient Video Generation with Large Language Models
AAAI 2026
Multimodal Large Language Models for Multi-Subject In-Context Image Generation
ACL 2026
Compatibility-Aware Dynamic Fine-Tuning for Large Language Models
ACL 2026
Sim4Seg: Boosting Multimodal Multi-disease Medical Diagnosis Segmentation with Region-Aware Vision-Language Similarity Masks
AAAI 2026
Towards High-Fidelity 3D Portrait Generation with Rich Details by Cross-View Prior-Aware Diffusion
AAAI 2026
Self-Rewarding Large Vision-Language Models for Optimizing Prompts in Text-to-Image Generation
ACL 2025
MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration
ACL 2025
DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation
CVPR 2025
DME-Driver: Integrating Human Decision Logic and 3D Scene Perception in Autonomous Driving
AAAI 2025
Language Prompt for Autonomous Driving
AAAI 2025
OLiDM: Object-aware LiDAR Diffusion Models for Autonomous Driving
AAAI 2025
Improving Medical Large Vision-Language Models with Abnormal-Aware Feedback
ACL 2025
Weak to Strong Generalization for Large Language Models with Multi-capabilities
ICLR 2025
ALOcc: Adaptive Lifting-Based 3D Semantic Occupancy and Cost Volume-Based Flow Predictions
ICCV 2025
RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping
ICCV 2025
DC-ControlNet: Decoupling Inter- and Intra-Element Conditions in Image Generation with Diffusion Models
ICCV 2025
Semantic Causality-Aware Vision-Based 3D Occupancy Prediction
ICCV 2025
Rethinking Temporal Fusion with a Unified Gradient Descent View for 3D Semantic Occupancy Prediction
CVPR 2025
Decoupling Fine Detail and Global Geometry for Compressed Depth Map Super-Resolution
CVPR 2025
LOGICZSL: Exploring Logic-induced Representation for Compositional Zero-shot Learning
CVPR 2025
Visual In-Context Learning for Large Vision-Language Models
ACL 2024
Fine-Grained Distillation for Long Document Retrieval
AAAI 2024
DI-V2X: Learning Domain-Invariant Representation for Vehicle-Infrastructure Collaborative 3D Object Detection
AAAI 2024
Leveraging Frame Affinity for sRGB-to-RAW Video De-rendering
CVPR 2024
IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object Detection
CVPR 2024
High-Precision Self-Supervised Monocular Depth Estimation with Rich-Resource Prior
ECCV 2024
TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning
ICLR 2024
RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception
ECCV 2024
LWSIS: LiDAR-Guided Weakly Supervised Instance Segmentation for Autonomous Driving
AAAI 2023
Exposing the Self-Supervised Space-Time Correspondence Learning via Graph Kernels
AAAI 2023
SSDA3D: Semi-supervised Domain Adaptation for 3D Object Detection from Point Cloud
AAAI 2023
Weakly Supervised Monocular 3D Object Detection Using Multi-View Projection and Direction Consistency
CVPR 2023
Referring Multi-Object Tracking
CVPR 2023
Self-Supervised Monocular Depth Estimation by Direction-aware Cumulative Convolution Network
ICCV 2023
OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation
ICCV 2023
ProposalContrast: Unsupervised Pre-training for LiDAR-Based 3D Object Detection
ECCV 2022
Rethinking Clustering-Based Pseudo-Labeling for Unsupervised Meta-Learning
ECCV 2022
Learning Disentanglement with Decoupled Labels for Vision-Language Navigation
ECCV 2022
BRNet: Exploring Comprehensive Features for Monocular Depth Estimation
ECCV 2022
Semi-Supervised 3D Object Detection with Proficient Teachers
ECCV 2022
Multi-Level Representation Learning With Semantic Alignment for Referring Video Object Segmentation
CVPR 2022
A Graph Matching Perspective With Transformers on Video Instance Segmentation
CVPR 2022
Tree Energy Loss: Towards Sparsely Annotated Semantic Segmentation
CVPR 2022
Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-Language Navigation
CVPR 2022
Modality Synergy Complement Learning with Cascaded Aggregation for Visible-Infrared Person Re-identification
ECCV 2022
Video Object Segmentation Using Global and Instance Embedding Learning
CVPR 2021
Structured Scene Memory for Vision-Language Navigation
CVPR 2021
Learning To Fuse Asymmetric Feature Maps in Siamese Trackers
CVPR 2021
Cross-Modality Person Re-Identification via Modality Confusion and Center Aggregation
ICCV 2021
Full-Duplex Strategy for Video Object Segmentation
ICCV 2021
Face Forensics in the Wild
CVPR 2021
Probabilistic Structural Latent Representation for Unsupervised Embedding
CVPR 2020
NETNet: Neighbor Erasing and Transferring Network for Better Single Shot Object Detection
CVPR 2020
Learning Video Object Segmentation From Unlabeled Videos
CVPR 2020
A Unified Object Motion and Affinity Model for Online Multi-Object Tracking
CVPR 2020
Hierarchical Human Parsing With Typed Part-Relation Reasoning
CVPR 2020
Self-Learning With Rectification Strategy for Human Parsing
CVPR 2020
Video Object Segmentation with Episodic Graph Memory Networks
ECCV 2020
Weakly Supervised 3D Object Detection from Lidar Point Cloud
ECCV 2020
Dynamic Dual-Attentive Aggregation Learning for Visible-Infrared Person Re-Identification
ECCV 2020
CLNet: A Compact Latent Network for Fast Adjusting Siamese Trackers
ECCV 2020
Active Visual Information Gathering for Vision-Language Navigation
ECCV 2020
Cascaded Human-Object Interaction Recognition
CVPR 2020
Camouflaged Object Detection
CVPR 2020
Multi-Mutual Consistency Induced Transfer Subspace Learning for Human Motion Segmentation
CVPR 2020
LiDAR-Based Online 3D Video Object Detection With Graph-Based Message Passing and Spatiotemporal Transformer Attention
CVPR 2020
Shifting More Attention to Video Salient Object Detection
CVPR 2019
An Iterative and Cooperative Top-Down and Bottom-Up Inference Network for Salient Object Detection
CVPR 2019
See More, Know More: Unsupervised Video Object Segmentation With Co-Attention Siamese Networks
CVPR 2019
Learning Unsupervised Video Object Segmentation Through Visual Attention
CVPR 2019
Human-Aware Motion Deblurring
ICCV 2019
Learning Compositional Neural Information Fusion for Human Parsing
ICCV 2019
Gaussian Affinity for Max-Margin Class Imbalanced Learning
ICCV 2019
Zero-Shot Video Object Segmentation via Attentive Graph Neural Networks
ICCV 2019
Striking the Right Balance With Uncertainty
CVPR 2019
Adversarial Defense by Restricting the Hidden Space of Deep Neural Networks
ICCV 2019
Towards Bridging Semantic Gap to Improve Semantic Segmentation
ICCV 2019
Salient Object Detection With Pyramid Attention and Salient Edges
CVPR 2019
Salient Object Detection Driven by Fixation Prediction
CVPR 2018
Learning Human-Object Interactions by Graph Parsing Neural Networks
ECCV 2018
Pyramid Dilated Deeper ConvLSTM for Video Salient Object Detection
ECCV 2018
Hyperparameter Optimization for Tracking With Continuous Deep Q-Learning
CVPR 2018
Triplet Loss in Siamese Network for Object Tracking
ECCV 2018
Attentive Fashion Grammar Network for Fashion Landmark Detection and Clothing Category Classification
CVPR 2018
Revisiting Video Saliency: A Large-Scale Benchmark and a New Model
CVPR 2018
Deep Cropping via Attention Box Prediction and Aesthetics Assessment
ICCV 2017
Super-Trajectory for Video Segmentation
ICCV 2017
Linearization to Nonlinear Learning for Visual Tracking
ICCV 2015
Saliency-Aware Geodesic Video Object Segmentation
CVPR 2015