Ming-Ming Cheng
110 papers · 2013–2026 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π Academic Marathon (12) π Conference Polyglot (10) π§ Keyword Pioneer π Interdisciplinary Bridge π Cross-Pollinator (10)
π
Cross-Pollinator
(10)
π
Renaissance Researcher
(7)
πΊοΈ
Taxonomy Completionist
(96)
π
Conference Loyalist
(25)
π€
Dynamic Duo
(31)
π
Grand Slam
π
Keyword Champion
(5)
π
Triple Crown
π¬
Deep Specialist
(32)
β‘
Prolific Year
(13)
π
Conference Pioneer
ποΈ
Keyword Collector
(416)
π₯
Unstoppable
(9)
π
Trend Setter
π
Century Club
(107)
Conferences
CVPR (47)
ICCV (25)
ECCV (9)
NIPS (9)
AAAI (7)
ICLR (6)
IJCAI (4)
AISTATS (1)
ICML (1)
UAI (1)
Top co-authors
Keywords
semantic segmentation
(20)
object detection
(17)
convolutional neural network
(11)
salient object detection
(9)
neural network
(7)
image segmentation
(6)
knowledge distillation
(6)
catastrophic forgetting
(6)
zero-shot learning
(5)
class incremental learning
(5)
domain adaptation
(5)
diffusion model
(5)
continual learning
(4)
deep learning
(4)
remote sensing
(4)
attention mechanism
(4)
semi-supervised learning
(3)
image synthesis
(3)
representation learning
(3)
depth estimation
(3)
Papers
DenoDet V2: Phase-Amplitude Cross Denoising for SAR Object Detection
AAAI 2026
SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection
AAAI 2026
Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection
AAAI 2026
TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction
ICCV 2025
VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning
ICCV 2025
DISTA-Net: Dynamic Closely-Spaced Infrared Small Target Unmixing
ICCV 2025
KAC: Kolmogorov-Arnold Classifier for Continual Learning
CVPR 2025
RSAR: Restricted State Angle Resolver and Rotated SAR Benchmark
CVPR 2025
Towards RAW Object Detection in Diverse Conditions
CVPR 2025
DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation
CVPR 2025
GET: Unlocking the Multi-modal Potential of CLIP for Generalized Category Discovery
CVPR 2025
From Words to Worth: Newborn Article Impact Prediction with LLM
AAAI 2025
$InterLCM$: Low-Quality Images as Intermediate States of Latent Consistency Models for Effective Blind Face Restoration
ICLR 2025
One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt
ICLR 2025
Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation
ICLR 2025
Re-Aligning Language to Visual Objects with an Agentic Workflow
ICLR 2025
Unbiased Region-Language Alignment for Open-Vocabulary Dense Prediction
ICCV 2025
Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment
ICCV 2025
AR-1-to-3: Single Image to Consistent 3D Object via Next-View Prediction
ICCV 2025
Advancing Textual Prompt Learning with Anchored Attributes
ICCV 2025
Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing
ICCV 2025
DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation
ICLR 2024
CrossKD: Cross-Head Knowledge Distillation for Object Detection
CVPR 2024
CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation
CVPR 2024
PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding
CVPR 2024
TeMO: Towards Text-Driven 3D Stylization for Multi-Object Meshes
CVPR 2024
OPUS: Occupancy Prediction Using a Sparse Set
NIPS 2024
SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object Detection
NIPS 2024
Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis
NIPS 2024
Letβs Start Over: Retraining with Selective Samples for Generalized Category Discovery
IJCAI 2024
Fine-Grained Knowledge Selection and Restoration for Non-exemplar Class Incremental Learning
AAAI 2024
Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation
ICML 2024
Towards Stable 3D Object Detection
ECCV 2024
Task-Adaptive Saliency Guidance for Exemplar-free Class Incremental Learning
CVPR 2024
Generative Multi-modal Models are Good Class Incremental Learners
CVPR 2024
Traffic Scene Parsing through the TSP6K Dataset
CVPR 2024
Faster Diffusion: Rethinking the Role of the Encoder for Diffusion Model Inference
NIPS 2024
StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation
NIPS 2024
AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation
CVPR 2023
Multi-Space Neural Radiance Fields
CVPR 2023
Endpoints Weight Fusion for Class Incremental Semantic Segmentation
CVPR 2023
Looking Through the Glass: Neural Surface Reconstruction Against High Specular Reflections
CVPR 2023
Masked Autoencoders are Efficient Class Incremental Learners
ICCV 2023
Masked Diffusion Transformer is a Strong Image Synthesizer
ICCV 2023
SLAN: Self-Locator Aided Network for Vision-Language Understanding
ICCV 2023
Large Selective Kernel Network for Remote Sensing Object Detection
ICCV 2023
SRFormer: Permuted Self-Attention for Single Image Super-Resolution
ICCV 2023
Long-Tailed Class Incremental Learning
ECCV 2022
SegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation
NIPS 2022
FocusCut: Diving Into a Focus View in Interactive Segmentation
CVPR 2022
Representation Compensation Networks for Continual Semantic Segmentation
CVPR 2022
Towards an End-to-End Framework for Flow-Guided Video Inpainting
CVPR 2022
Localization Distillation for Dense Object Detection
CVPR 2022
VQFR: Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder
ECCV 2022
"Restore Globally, Refine Locally: A Mask-Guided Scheme to Accelerate Super-Resolution Networks"
ECCV 2022
On the Connection between Local Attention and Dynamic Depth-wise Convolution
ICLR 2022
iNAS: Integral NAS for Device-Aware Salient Object Detection
ICCV 2021
Structured sparsification with joint optimization of group convolution and channel shuffle
UAI 2021
DOTS: Decoupling Operation and Topology in Differentiable Architecture Search
CVPR 2021
Semi-Supervised Learning with Meta-Gradient
AISTATS 2021
Temporal Modulation Network for Controllable Space-Time Video Super-Resolution
CVPR 2021
Representative Batch Normalization With Feature Calibration
CVPR 2021
Global2Local: Efficient Structure Search for Video Action Segmentation
CVPR 2021
Personalized Image Semantic Segmentation
ICCV 2021
VecRoad: Point-Based Iterative Graph Exploration for Road Graphs Extraction
CVPR 2020
Rethinking Computer-Aided Tuberculosis Diagnosis
CVPR 2020
Improving Convolutional Networks With Self-Calibrated Convolutions
CVPR 2020
Image Formation Model Guided Deep Image Super-Resolution
AAAI 2020
Interactive Image Segmentation With First Click Attention
CVPR 2020
Highly Efficient Salient Object Detection with 100K Parameters
ECCV 2020
Deep Hough Transform for Semantic Line Detection
ECCV 2020
Gradient-Induced Co-Saliency Detection
ECCV 2020
Pyramid Constrained Self-Attention Network for Fast Video Salient Object Detection
AAAI 2020
ICNet: Intra-saliency Correlation Network for Co-Saliency Detection
NIPS 2020
Taking a Deeper Look at Co-Salient Object Detection
CVPR 2020
Camouflaged Object Detection
CVPR 2020
Strip Pooling: Rethinking Spatial Pooling for Scene Parsing
CVPR 2020
Optimizing the F-Measure for Threshold-Free Salient Object Detection
ICCV 2019
Image Inpainting With Learnable Bidirectional Attention Maps
ICCV 2019
Joint Acne Image Grading and Counting via Label Distribution Learning
ICCV 2019
Zero-Shot Emotion Recognition via Affective Structural Embedding
ICCV 2019
Unsupervised Scale-consistent Depth and Ego-motion Learning from Monocular Video
NIPS 2019
Multi-Level Context Ultra-Aggregation for Stereo Matching
CVPR 2019
RegularFace: Deep Face Recognition via Exclusive Regularization
CVPR 2019
IP102: A Large-Scale Benchmark Dataset for Insect Pest Recognition
CVPR 2019
Shifting More Attention to Video Salient Object Detection
CVPR 2019
S4Net: Single Stage Salient-Instance Segmentation
CVPR 2019
An Iterative and Cooperative Top-Down and Bottom-Up Inference Network for Salient Object Detection
CVPR 2019
Contrast Prior and Fluid Pyramid Integration for RGBD Salient Object Detection
CVPR 2019
A Simple Pooling-Based Design for Real-Time Salient Object Detection
CVPR 2019
Integral Object Mining via Online Attention Accumulation
ICCV 2019
Scoot: A Perceptual Metric for Facial Sketches
ICCV 2019
EGNet: Edge Guidance Network for Salient Object Detection
ICCV 2019
DEL: Deep Embedding Learning for Efficient Image Segmentation
IJCAI 2018
Salient Objects in Clutter: Bringing Salient Object Detection to the Foreground
ECCV 2018
Associating Inter-Image Salient Instances for Weakly Supervised Semantic Segmentation
ECCV 2018
Self-Erasing Network for Integral Object Attention
NIPS 2018
Crowd Counting With Deep Negative Correlation Learning
CVPR 2018
Revisiting Video Saliency: A Large-Scale Benchmark and a New Model
CVPR 2018
Enhanced-alignment Measure for Binary Foreground Map Evaluation
IJCAI 2018
Hi-Fi: Hierarchical Feature Integration for Skeleton Detection
IJCAI 2018
Object Region Mining With Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach
CVPR 2017
Richer Convolutional Features for Edge Detection
CVPR 2017
Deeply Supervised Salient Object Detection With Short Connections
CVPR 2017
GMS: Grid-based Motion Statistics for Fast, Ultra-Robust Feature Correspondence
CVPR 2017
Structure-Measure: A New Way to Evaluate Foreground Maps
ICCV 2017
BING: Binarized Normed Gradients for Objectness Estimation at 300fps
CVPR 2014
Dense Semantic Image Segmentation with Objects and Attributes
CVPR 2014
Robust Non-parametric Data Fitting for Correspondence Modeling
ICCV 2013
Efficient Salient Region Detection with Soft Image Abstraction
ICCV 2013