Chunhua Shen
205 papers · 2008–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+19 more ↓ Show less ↑
π§ Keyword Pioneer π£ Hot Topic Early Bird πΊοΈ Taxonomy Completionist (15) π Interdisciplinary Bridge π Conference Polyglot (11)
π
Interdisciplinary Bridge
π
Academic Marathon
(17)
πΊοΈ
Taxonomy Completionist
(15)
π
Conference Loyalist
(21)
π
Keyword Trendsetter Combo
(5)
π
Grand Slam
π
Triple Crown
π€
Dynamic Duo
(44)
π±
Topic Pioneer
π¬
Deep Specialist
(26)
π§¬
Topic Evolution
π
Keyword Champion
(13)
ποΈ
Keyword Collector
(707)
β‘
Prolific Year
(18)
β
The Questioner
(4)
π
Century Club
(202)
π
Trend Setter
π₯
Unstoppable
(14)
π
Conference Pioneer
Conferences
CVPR (85)
ICCV (37)
NIPS (21)
ECCV (19)
AAAI (11)
ICLR (11)
IJCAI (8)
ICML (6)
ACL (4)
WACV (2)
JMLR (1)
Top co-authors
Keywords
semantic segmentation
(33)
convolutional neural network
(23)
object detection
(18)
attention mechanism
(14)
instance segmentation
(13)
representation learning
(9)
image segmentation
(9)
metric learning
(9)
visual question answering
(9)
depth estimation
(9)
image classification
(8)
few-shot learning
(8)
model compression
(8)
neural network
(7)
knowledge distillation
(6)
point cloud
(6)
conditional random field
(6)
neural architecture search
(6)
multimodal learning
(6)
self-supervised learning
(6)
Papers
ODYSSEY: Open-World Quadrupeds Exploration and Manipulation for Long-Horizon Tasks
AAAI 2026
Efficient Self-Evaluation for Diffusion Language Models via Sequence Regeneration
ACL 2026
Beyond Hard Masks: Progressive Token Evolution for Diffusion Language Models
ACL 2026
Revisiting Convolution Architecture in the Realm of DNA Foundation Models
ICLR 2025
Depth Any Video with Scalable Synthetic Data
ICLR 2025
MovieBench: A Hierarchical Movie Level Dataset for Long Video Generation
CVPR 2025
Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactions
ICLR 2025
TG-LLaVA: Text Guided LLaVA via Learnable Latent Embeddings
AAAI 2025
Framer: Interactive Frame Interpolation
ICLR 2025
What Matters When Repurposing Diffusion Models for General Dense Perception Tasks?
ICLR 2025
Aether: Geometric-Aware Unified World Modeling
ICCV 2025
SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splatting
ICCV 2025
Fine-grained Abnormality Prompt Learning for Zero-shot Anomaly Detection
ICCV 2025
Unified Open-World Segmentation with Multi-Modal Prompts
ICCV 2025
SMSTracker: Tri-path Score Mask Sigma Fusion for Multi-Modal Tracking
ICCV 2025
MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences
ICLR 2025
Seeing the Unseen: Composing Outliers for Compositional Zero-Shot Learning
IJCAI 2025
POMATO: Marrying Pointmap Matching with Temporal Motions for Dynamic 3D Reconstruction
ICCV 2025
SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories
CVPR 2025
PerturboLLaVA: Reducing Multimodal Hallucinations with Perturbative Visual Training
ICLR 2025
Physics Aware Neural Networks for Unsupervised Binding Energy Prediction
ICML 2025
Generative Active Learning for Long-tailed Instance Segmentation
ICML 2024
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative Data
CVPR 2024
VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks
ECCV 2024
FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior
ECCV 2024
FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition
CVPR 2024
Traffic Scene Parsing through the TSP6K Dataset
CVPR 2024
A Simple Image Segmentation Framework via In-Context Examples
NIPS 2024
Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation
NIPS 2024
Retrieval-Augmented Primitive Representations for Compositional Zero-Shot Learning
AAAI 2024
PointAttN: You Only Need Attention for Point Cloud Completion
AAAI 2024
LoRAPrune: Structured Pruning Meets Low-Rank Parameter-Efficient Fine-Tuning
ACL 2024
Enhanced Visual Instruction Tuning with Synthesized Image-Dialogue Data
ACL 2024
Floating Anchor Diffusion Model for Multi-motif Scaffolding
ICML 2024
On the Trajectory Regularity of ODE-based Diffusion Sampling
ICML 2024
De novo Protein Design Using Geometric Vector Field Networks
ICLR 2024
Matcher: Segment Anything with One Shot Using All-Purpose Feature Matching
ICLR 2024
Object-Aware Inversion and Reassembly for Image Editing
ICLR 2024
Zolly: Zoom Focal Length Correctly for Perspective-Distorted Human Mesh Reconstruction
ICCV 2023
Images Speak in Images: A Generalist Painter for In-Context Visual Learning
CVPR 2023
Learning Conditional Attributes for Compositional Zero-Shot Learning
CVPR 2023
DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models
NIPS 2023
Generative Prompt Model for Weakly Supervised Object Localization
ICCV 2023
SegPrompt: Boosting Open-World Segmentation via Category-Level Prompt Learning
ICCV 2023
FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models
ICCV 2023
Conditional Positional Encodings for Vision Transformers
ICLR 2023
DiffuMask: Synthesizing Images with Pixel-level Annotations for Semantic Segmentation Using Diffusion Models
ICCV 2023
SegGPT: Towards Segmenting Everything in Context
ICCV 2023
Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image
ICCV 2023
Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering
ICCV 2023
CTVIS: Consistent Training for Online Video Instance Segmentation
ICCV 2023
FoPro: Few-Shot Guided Robust Webly-Supervised Prototypical Learning
AAAI 2023
Point-Teaching: Weakly Semi-supervised Object Detection with Point Annotations
AAAI 2023
A Survey on Efficient Training of Transformers
IJCAI 2023
Poseur: Direct Human Pose Regression with Transformers
ECCV 2022
Text-Adaptive Multiple Visual Prototype Matching for Video-Text Retrieval
NIPS 2022
SegViT: Semantic Segmentation with Plain Vision Transformers
NIPS 2022
Hierarchical Normalization for Robust Monocular Depth Estimation
NIPS 2022
Multi-dataset Training of Transformers for Robust Action Recognition
NIPS 2022
DENSE: Data-Free One-Shot Federated Learning
NIPS 2022
Adv-Attribute: Inconspicuous and Transferable Adversarial Attack on Face Recognition
NIPS 2022
Fully Convolutional One-Stage 3D Object Detection on LiDAR Range Images
NIPS 2022
FreeSOLO: Learning To Segment Objects Without Annotations
CVPR 2022
RigidFlow: Self-Supervised Scene Flow Learning on Point Clouds by Local Rigidity Prior
CVPR 2022
Catching Both Gray and Black Swans: Open-Set Supervised Anomaly Detection
CVPR 2022
Retrieval Augmented Classification for Long-Tail Visual Recognition
CVPR 2022
Boosting Robustness of Image Matting With Context Assembling and Strong Data Augmentation
CVPR 2022
TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation
CVPR 2022
PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining
NIPS 2022
DisCo: Remedying Self-Supervised Learning on Lightweight Models with Distilled Contrastive Learning
ECCV 2022
Efficient Decoder-Free Object Detection with Transformers
ECCV 2022
PointInst3D: Segmenting 3D Instances by Points
ECCV 2022
Generic Perceptual Loss for Modeling Structured Output Dependencies
CVPR 2021
Learning Spatial-Semantic Relationship for Facial Attribute Recognition With Limited Labeled Data
CVPR 2021
Feature Decomposition and Reconstruction Learning for Effective Facial Expression Recognition
CVPR 2021
End-to-End Video Instance Segmentation With Transformers
CVPR 2021
Dense Contrastive Learning for Self-Supervised Visual Pre-Training
CVPR 2021
FCPose: Fully Convolutional Multi-Person Pose Estimation With Dynamic Instance-Aware Convolutions
CVPR 2021
BoxInst: High-Performance Instance Segmentation With Box Annotations
CVPR 2021
HCRF-Flow: Scene Flow From Point Clouds With Continuous High-Order CRFs and Position-Aware Flow Embedding
CVPR 2021
Learning Affinity-Aware Upsampling for Deep Image Matting
CVPR 2021
SA-BNN: State-Aware Binary Neural Network
AAAI 2021
Diverse Knowledge Distillation for End-to-End Person Search
AAAI 2021
Occluded Person Re-Identification With Single-Scale Global Representations
ICCV 2021
Meta Navigator: Search for a Good Adaptation Policy for Few-Shot Learning
ICCV 2021
A Simple Baseline for Semi-Supervised Semantic Segmentation With Strong Data Augmentation
ICCV 2021
Channel-Wise Knowledge Distillation for Dense Prediction
ICCV 2021
BV-Person: A Large-Scale Dataset for Bird-View Person Re-Identification
ICCV 2021
FATNN: Fast and Accurate Ternary Neural Networks
ICCV 2021
Twins: Revisiting the Design of Spatial Attention in Vision Transformers
NIPS 2021
Dynamic Neural Representational Decoders for High-Resolution Semantic Segmentation
NIPS 2021
DoDNet: Learning To Segment Multi-Organ and Tumors From Multiple Partially Labeled Datasets
CVPR 2021
Learning To Recover 3D Scene Shape From a Single Image
CVPR 2021
Graph Attention Tracking
CVPR 2021
AQD: Towards Accurate Quantized Object Detection
CVPR 2021
DyCo3D: Robust Instance Segmentation of 3D Point Clouds Through Dynamic Convolution
CVPR 2021
AE TextSpotter: Learning Visual and Linguistic Representation for Ambiguous Text Spotting
ECCV 2020
SOLOv2: Dynamic and Fast Instance Segmentation
NIPS 2020
V-PROM: A Benchmark for Visual Reasoning Using Visual Progressive Matrices
AAAI 2020
Task-Aware Monocular Depth Estimation for 3D Object Detection
AAAI 2020
Training Quantized Neural Networks With a Full-Precision Auxiliary Module
CVPR 2020
Memory-Efficient Hierarchical Neural Architecture Search for Image Denoising
CVPR 2020
BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation
CVPR 2020
DeepEMD: Few-Shot Image Classification With Differentiable Earth Mover's Distance and Structured Classifiers
CVPR 2020
ABCNet: Real-Time Scene Text Spotting With Adaptive Bezier-Curve Network
CVPR 2020
Context Prior for Scene Segmentation
CVPR 2020
Mask Encoding for Single Shot Instance Segmentation
CVPR 2020
NAS-FCOS: Fast Neural Architecture Search for Object Detection
CVPR 2020
On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering
CVPR 2020
REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments
CVPR 2020
Self-Trained Deep Ordinal Regression for End-to-End Video Anomaly Detection
CVPR 2020
PolarMask: Single Shot Instance Segmentation With Polar Representation
CVPR 2020
Conditional Convolutions for Instance Segmentation
ECCV 2020
Representative Graph Neural Network
ECCV 2020
Soft Expert Reward Learning for Vision-and-Language Navigation
ECCV 2020
Weighing Counts: Sequential Crowd Counting by Reinforcement Learning
ECCV 2020
Efficient Semantic Video Segmentation with Per-frame Inference
ECCV 2020
Scene Text Image Super-resolution in the wild
ECCV 2020
Segmenting Transparent Objects in the Wild
ECCV 2020
Learning and Memorizing Representative Prototypes for 3D Point Cloud Semantic and Instance Segmentation
ECCV 2020
SOLO: Segmenting Objects by Locations
ECCV 2020
Instance-Aware Embedding for Point Cloud Instance Segmentation
ECCV 2020
Unsupervised Representation Learning by Predicting Random Distances
IJCAI 2020
Architecture Search of Dynamic Cells for Semantic Video Segmentation
WACV 2020
Template-Based Automatic Search of Compact Semantic Segmentation Architectures
WACV 2020
Exploiting Temporal Consistency for Real-Time Video Depth Estimation
ICCV 2019
Indices Matter: Learning to Index for Deep Image Matting
ICCV 2019
Enforcing Geometric Constraints of Virtual Normal for Depth Prediction
ICCV 2019
Self-Training With Progressive Augmentation for Unsupervised Cross-Domain Person Re-Identification
ICCV 2019
From Open Set to Closed Set: Counting Objects by Spatial Divide-and-Conquer
ICCV 2019
Efficient and Accurate Arbitrary-Shaped Text Detection With Pixel Aggregation Network
ICCV 2019
FCOS: Fully Convolutional One-Stage Object Detection
ICCV 2019
Multi-marginal Wasserstein GAN
NIPS 2019
Show, Attend and Read: A Simple and Strong Baseline for Irregular Text Recognition
AAAI 2019
Mind Your Neighbours: Image Annotation With Metadata Neighbourhood Graph Co-Attention Networks
CVPR 2019
Neighbourhood Watch: Referring Expression Comprehension via Language-Guided Graph Attention Networks
CVPR 2019
Attention-Guided Network for Ghost-Free High Dynamic Range Imaging
CVPR 2019
Decoders Matter for Semantic Segmentation: Data-Dependent Decoding Enables Flexible Feature Aggregation
CVPR 2019
Associatively Segmenting Instances and Semantics in Point Clouds
CVPR 2019
CANet: Class-Agnostic Segmentation Networks With Iterative Refinement and Attentive Few-Shot Learning
CVPR 2019
Visual Question Answering as Reading Comprehension
CVPR 2019
Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary Cells
CVPR 2019
Light-Weight Hybrid Convolutional Network for Liver Tumor Segmentation
IJCAI 2019
Knowledge Adaptation for Efficient Semantic Segmentation
CVPR 2019
Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video
NIPS 2019
Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation
CVPR 2019
Bootstrapping the Performance of Webly Supervised Semantic Segmentation
CVPR 2018
Adversarial Learning with Local Coordinate Coding
ICML 2018
VITAL: VIsual Tracking via Adversarial Learning
CVPR 2018
Towards Effective Low-Bitwidth Convolutional Neural Networks
CVPR 2018
Salient Object Detection by Lossless Feature Reflection
IJCAI 2018
Learning to Predict Crisp Boundaries
ECCV 2018
Goal-Oriented Visual Question Generation via Intermediate Rewards
ECCV 2018
Repulsion Loss: Detecting Pedestrians in a Crowd
CVPR 2018
Visual Question Answering With Memory-Augmented Networks
CVPR 2018
Are You Talking to Me? Reasoned Visual Dialog Generation Through Adversarial Learning
CVPR 2018
An End-to-End TextSpotter With Explicit Alignment and Attention
CVPR 2018
Parallel Attention: A Unified Framework for Visual Object Discovery Through Dialogs and Queries
CVPR 2018
FSRNet: End-to-End Learning Face Super-Resolution With Facial Priors
CVPR 2018
Monocular Relative Depth Perception With Web Stereo Data Supervision
CVPR 2018
Sequential Person Recognition in Photo Albums With a Recurrent Network
CVPR 2017
Towards Context-Aware Interaction Recognition for Visual Relationship Detection
ICCV 2017
When Unsupervised Domain Adaptation Meets Tensor Representations
ICCV 2017
Adversarial PoseNet: A Structure-Aware Convolutional Network for Human Pose Estimation
ICCV 2017
Towards End-To-End Text Spotting With Convolutional Recurrent Neural Networks
ICCV 2017
Multi-Attention Network for One Shot Learning
CVPR 2017
From Motion Blur to Motion Flow: A Deep Learning Solution for Removing Heterogeneous Motion Blur
CVPR 2017
RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation
CVPR 2017
Attend in Groups: A Weakly-Supervised Deep Learning Framework for Learning From Web Data
CVPR 2017
The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions
CVPR 2017
Explicit Knowledge-based Reasoning for Visual Question Answering
IJCAI 2017
Learning Multi-level Region Consistency with Dense Multi-label Networks for Semantic Segmentation
IJCAI 2017
Deep Descriptor Transforming for Image Co-Localization
IJCAI 2017
What Value Do Explicit High Level Concepts Have in Vision to Language Problems?
CVPR 2016
Efficient Piecewise Training of Deep Structured Models for Semantic Segmentation
CVPR 2016
Ask Me Anything: Free-Form Visual Question Answering Based on Knowledge From External Sources
CVPR 2016
Fast Training of Triplet-Based Deep Binary Embedding Networks
CVPR 2016
Image Restoration Using Very Deep Convolutional Encoder-Decoder Networks with Symmetric Skip Connections
NIPS 2016
Less Is More: Zero-Shot Learning From Online Textual Documents With Noise Suppression
CVPR 2016
What's Wrong With That Object? Identifying Images of Unusual Objects by Modelling the Detection Score Distribution
CVPR 2016
Mid-Level Deep Pattern Mining
CVPR 2015
Deep Convolutional Neural Fields for Depth Estimation From a Single Image
CVPR 2015
Deeply Learning the Messages in Message Passing Inference
NIPS 2015
Hyperspectral Compressive Sensing Using Manifold-Structured Sparsity Prior
ICCV 2015
The Treasure Beneath Convolutional Layers: Cross-Convolutional-Layer Pooling for Image Classification
CVPR 2015
Learning Graph Structure for Multi-Label Image Classification via Clique Generation
CVPR 2015
Efficient SDP Inference for Fully-Connected CRFs Based on Low-Rank Decomposition
CVPR 2015
Learning to Rank in Person Re-Identification With Metric Ensembles
CVPR 2015
Depth and Surface Normal Estimation From Monocular Images Using Regression on Deep Features and Hierarchical CRFs
CVPR 2015
Supervised Discrete Hashing
CVPR 2015
Encoding High Dimensional Local Features by Sparse Coding Based Fisher Vectors
NIPS 2014
Fast Supervised Hashing with Decision Trees for High-Dimensional Data
CVPR 2014
Efficient Pedestrian Detection by Directly Optimizing the Partial Area under the ROC Curve
ICCV 2013
Dictionary Learning and Sparse Coding on Grassmann Manifolds: An Extrinsic Solution
ICCV 2013
Contextual Hypergraph Modeling for Salient Object Detection
ICCV 2013
A General Two-Step Approach to Learning-Based Hashing
ICCV 2013
Learning Hash Functions Using Column Generation
ICML 2013
Inductive Hashing on Manifolds
CVPR 2013
Bilinear Programming for Human Activity Recognition with Unknown MRF Graphs
CVPR 2013
A Fast Semidefinite Approach to Solving Binary Quadratic Problems
CVPR 2013
Part-Based Visual Tracking with Online Latent Structural Learning
CVPR 2013
Learning Compact Binary Codes for Visual Tracking
CVPR 2013
Positive Semidefinite Metric Learning Using Boosting-like Algorithms
JMLR 2012
Positive Semidefinite Metric Learning with Boosting
NIPS 2009
PSDBoost: Matrix-Generation Linear Programming for Positive Semidefinite Matrices Learning
NIPS 2008