Ming Yang
72 papers · 2013–2026 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (13)
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🏃
Academic Marathon
(12)
🤝
Dynamic Duo
(14)
🏆
Grand Slam
🧬
Topic Evolution
🔥
Unstoppable
(8)
🚀
Conference Pioneer
📈
Trend Setter
🗃️
Keyword Collector
(328)
⚡
Prolific Year
(5)
💎
Century Club
(66)
Conferences
CVPR (18)
AAAI (12)
ICCV (10)
ECCV (6)
IJCAI (6)
NIPS (5)
ICML (4)
ACML (3)
EMNLP (3)
ICLR (2)
ACL (1)
AISTATS (1)
MICCAI (1)
Top co-authors
Research topics
Keywords
object detection
(6)
diffusion model
(6)
multimodal large language model
(5)
vision-language model
(5)
convolutional neural network
(5)
semantic segmentation
(4)
transfer learning
(4)
multimodal learning
(3)
video understanding
(3)
large language model
(3)
multi-modal learning
(3)
multi-view clustering
(3)
pedestrian detection
(3)
active learning
(2)
zero-shot learning
(2)
reinforcement learning
(2)
attention mechanism
(2)
image captioning
(2)
contrastive learning
(2)
model compression
(2)
Papers
Distributional Priors Guided Diffusion for Generating 3D Molecules in Low Data Regimes
AAAI 2026
Learning Diffusion Policy from Primitive Skills for Robot Manipulation
AAAI 2026
DSAP: Enhancing Generalization in Goal-Conditioned Reinforcement Learning
AAAI 2026
Unified View Extraction with Low-Rankness and Smoothness Fusion for Multi-View Subspace Clustering
AAAI 2026
SCAN: Self-Calibrated AutoregressioN for High-Quality Visual Generation
AAAI 2026
Tensorized Label Learning via Balanced Tensor Regression
AAAI 2026
Reversing Flow for Image Restoration
CVPR 2025
DynFocus: Dynamic Cooperative Network Empowers LLMs with Video Understanding
CVPR 2025
SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling
CVPR 2025
Unified Video Generation via Next-Set Prediction in Continuous Domain
ICCV 2025
CasP: Improving Semi-Dense Feature Matching Pipeline Leveraging Cascaded Correspondence Priors for Guidance
ICCV 2025
Engage for All: Making Ordinary Image Descriptions Appealing Again!
ICCV 2025
GAP: a Global Adaptive Pruning Method for Large Language Models
EMNLP 2025
FedSaaS: Class-Consistency Federated Semantic Segmentation via Global Prototype Supervision and Local Adversarial Harmonization
IJCAI 2025
BMIP: Bi-directional Modality Interaction Prompt Learning for VLM
IJCAI 2025
Social Debiasing for Fair Multi-modal LLMs
ICCV 2025
Animate-X: Universal Character Image Animation with Enhanced Motion Representation
ICLR 2025
VCSearch: Bridging the Gap Between Well-Defined and Ill-Defined Problems in Mathematical Reasoning
EMNLP 2025
HomoMatcher: Achieving Dense Feature Matching with Semi-Dense Efficiency by Homography Estimation
AAAI 2025
VQAGuider: Guiding Multimodal Large Language Models to Answer Complex Video Questions
ACL 2025
MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation
CVPR 2025
SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories
CVPR 2025
Mimir: Improving Video Diffusion Models for Precise Text Understanding
CVPR 2025
POA: Pre-training Once for Models of All Sizes
ECCV 2024
EcoMatcher: Efficient Clustering Oriented Matcher for Detector-free Image Matching
ECCV 2024
Referencing Where to Focus: Improving Visual Grounding with Referential Query
NIPS 2024
Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight
NIPS 2024
CSPG: Crossing Sparse Proximity Graphs for Approximate Nearest Neighbor Search
NIPS 2024
WSSADN: A Weakly Supervised Spherical Age-Disentanglement Network for Detecting Developmental Disorders with Structural MRI
MICCAI 2024
EVE: Efficient Zero-Shot Text-Based Video Editing With Depth Map Guidance and Temporal Consistency Constraints
IJCAI 2024
DeCoOp: Robust Prompt Tuning with Out-of-Distribution Detection
ICML 2024
Stability and Generalization of Stochastic Compositional Gradient Descent Algorithms
ICML 2024
SyCoCa: Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal Alignment
ICML 2024
Towards Better Vision-Inspired Vision-Language Models
CVPR 2024
Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis
CVPR 2024
Pink: Unveiling the Power of Referential Comprehension for Multi-modal LLMs
CVPR 2024
SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery
CVPR 2024
StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models
ECCV 2024
Orthogonal Non-negative Tensor Factorization based Multi-view Clustering
NIPS 2023
Efficient Potential-based Exploration in Reinforcement Learning using Inverse Dynamic Bisimulation Metric
NIPS 2023
Centerless Multi-View K-means Based on the Adjacency Matrix
AAAI 2023
High-Level Semantic Feature Matters Few-Shot Unsupervised Domain Adaptation
AAAI 2023
Long-Range Graph U-Nets: Node and Edge Clustering Pooling Model For Stroke Classification in Online Handwritten Documents
ACML 2023
Momentum Accelerates the Convergence of Stochastic AUPRC Maximization
AISTATS 2022
Towards Accurate Facial Motion Retargeting with Identity-Consistent and Expression-Exclusive Constraints
AAAI 2022
Group-based Interleaved Pipeline Parallelism for Large-scale DNN Training
ICLR 2022
Stacked Homography Transformations for Multi-View Pedestrian Detection
ICCV 2021
Recall and Learn: A Memory-augmented Solver for Math Word Problems
EMNLP 2021
Back-Tracing Representative Points for Voting-Based 3D Object Detection in Point Clouds
CVPR 2021
Robust Knowledge Transfer via Hybrid Forward on the Teacher-Student Model
AAAI 2021
Track To Detect and Segment: An Online Multi-Object Tracker
CVPR 2021
Toward A Thousand Lights: Decentralized Deep Reinforcement Learning for Large-Scale Traffic Signal Control
AAAI 2020
Temporal-Context Enhanced Detection of Heavily Occluded Pedestrians
CVPR 2020
Robust Document Distance with Wasserstein-Fisher-Rao metric
ACML 2020
Bi-Directional Cascade Network for Perceptual Edge Detection
CVPR 2019
SSAP: Single-Shot Instance Segmentation With Affinity Pyramid
ICCV 2019
Discriminative Feature Transformation for Occluded Pedestrian Detection
ICCV 2019
Resolution-invariant Person Re-Identification
IJCAI 2019
Deep Reinforcement Learning with Iterative Shift for Visual Tracking
ECCV 2018
Feature Integration with Adaptive Importance Maps for Visual Tracking
IJCAI 2018
Image Blind Denoising With Generative Adversarial Network Based Noise Modeling
CVPR 2018
Conditional Generative Adversarial Network for Structured Domain Adaptation
CVPR 2018
Deep Correlation Structure Preserved Label Space Embedding for Multi-label Classification
ACML 2018
Instance-level Human Parsing via Part Grouping Network
ECCV 2018
BSN: Boundary Sensitive Network for Temporal Action Proposal Generation
ECCV 2018
Multi-View Learning with Limited and Noisy Tagging
IJCAI 2016
Web-Scale Training for Face Identification
CVPR 2015
DeepFace: Closing the Gap to Human-Level Performance in Face Verification
CVPR 2014
Regionlets for Generic Object Detection
ICCV 2013
Semantic-Aware Co-indexing for Image Retrieval
ICCV 2013
Multi-Task Learning with Gaussian Matrix Generalized Inverse Gaussian Model
ICML 2013
Collaborative Active Learning of a Kernel Machine Ensemble for Recognition
ICCV 2013