Ming Yang

72 papers · 2013–2026 · 13 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (13)

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏃 Academic Marathon (12) 🤝 Dynamic Duo (14) 🏆 Grand Slam 🧬 Topic Evolution 🔥 Unstoppable (8) 🚀 Conference Pioneer 📈 Trend Setter 🗃️ Keyword Collector (328) ⚡ Prolific Year (5) 💎 Century Club (66)

Conferences

CVPR (18) AAAI (12) ICCV (10) ECCV (6) IJCAI (6) NIPS (5) ICML (4) ACML (3) EMNLP (3) ICLR (2) ACL (1) AISTATS (1) MICCAI (1)

Top co-authors

Jingdong Chen (15) Qingpei Guo (11) Shiliang Zhang (6) Lei Yu (6) Junsong Yuan (6) Jian Wang (6) Yingying Zhang (5) Lixiang Ru (5) Biao Gong (5) Liheng Zhong (4)

Research topics

Architectures (1)

Keywords

object detection (6) diffusion model (6) multimodal large language model (5) vision-language model (5) convolutional neural network (5) semantic segmentation (4) transfer learning (4) multimodal learning (3) video understanding (3) large language model (3) multi-modal learning (3) multi-view clustering (3) pedestrian detection (3) active learning (2) zero-shot learning (2) reinforcement learning (2) attention mechanism (2) image captioning (2) contrastive learning (2) model compression (2)

Papers

Distributional Priors Guided Diffusion for Generating 3D Molecules in Low Data Regimes AAAI 2026 Learning Diffusion Policy from Primitive Skills for Robot Manipulation AAAI 2026 DSAP: Enhancing Generalization in Goal-Conditioned Reinforcement Learning AAAI 2026 Unified View Extraction with Low-Rankness and Smoothness Fusion for Multi-View Subspace Clustering AAAI 2026 SCAN: Self-Calibrated AutoregressioN for High-Quality Visual Generation AAAI 2026 Tensorized Label Learning via Balanced Tensor Regression AAAI 2026 Reversing Flow for Image Restoration CVPR 2025 DynFocus: Dynamic Cooperative Network Empowers LLMs with Video Understanding CVPR 2025 SkySense-O: Towards Open-World Remote Sensing Interpretation with Vision-Centric Visual-Language Modeling CVPR 2025 Unified Video Generation via Next-Set Prediction in Continuous Domain ICCV 2025 CasP: Improving Semi-Dense Feature Matching Pipeline Leveraging Cascaded Correspondence Priors for Guidance ICCV 2025 Engage for All: Making Ordinary Image Descriptions Appealing Again! ICCV 2025 GAP: a Global Adaptive Pruning Method for Large Language Models EMNLP 2025 FedSaaS: Class-Consistency Federated Semantic Segmentation via Global Prototype Supervision and Local Adversarial Harmonization IJCAI 2025 BMIP: Bi-directional Modality Interaction Prompt Learning for VLM IJCAI 2025 Social Debiasing for Fair Multi-modal LLMs ICCV 2025 Animate-X: Universal Character Image Animation with Enhanced Motion Representation ICLR 2025 VCSearch: Bridging the Gap Between Well-Defined and Ill-Defined Problems in Mathematical Reasoning EMNLP 2025 HomoMatcher: Achieving Dense Feature Matching with Semi-Dense Efficiency by Homography Estimation AAAI 2025 VQAGuider: Guiding Multimodal Large Language Models to Answer Complex Video Questions ACL 2025 MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation CVPR 2025 SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories CVPR 2025 Mimir: Improving Video Diffusion Models for Precise Text Understanding CVPR 2025 POA: Pre-training Once for Models of All Sizes ECCV 2024 EcoMatcher: Efficient Clustering Oriented Matcher for Detector-free Image Matching ECCV 2024 Referencing Where to Focus: Improving Visual Grounding with Referential Query NIPS 2024 Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight NIPS 2024 CSPG: Crossing Sparse Proximity Graphs for Approximate Nearest Neighbor Search NIPS 2024 WSSADN: A Weakly Supervised Spherical Age-Disentanglement Network for Detecting Developmental Disorders with Structural MRI MICCAI 2024 EVE: Efficient Zero-Shot Text-Based Video Editing With Depth Map Guidance and Temporal Consistency Constraints IJCAI 2024 DeCoOp: Robust Prompt Tuning with Out-of-Distribution Detection ICML 2024 Stability and Generalization of Stochastic Compositional Gradient Descent Algorithms ICML 2024 SyCoCa: Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal Alignment ICML 2024 Towards Better Vision-Inspired Vision-Language Models CVPR 2024 Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis CVPR 2024 Pink: Unveiling the Power of Referential Comprehension for Multi-modal LLMs CVPR 2024 SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery CVPR 2024 StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models ECCV 2024 Orthogonal Non-negative Tensor Factorization based Multi-view Clustering NIPS 2023 Efficient Potential-based Exploration in Reinforcement Learning using Inverse Dynamic Bisimulation Metric NIPS 2023 Centerless Multi-View K-means Based on the Adjacency Matrix AAAI 2023 High-Level Semantic Feature Matters Few-Shot Unsupervised Domain Adaptation AAAI 2023 Long-Range Graph U-Nets: Node and Edge Clustering Pooling Model For Stroke Classification in Online Handwritten Documents ACML 2023 Momentum Accelerates the Convergence of Stochastic AUPRC Maximization AISTATS 2022 Towards Accurate Facial Motion Retargeting with Identity-Consistent and Expression-Exclusive Constraints AAAI 2022 Group-based Interleaved Pipeline Parallelism for Large-scale DNN Training ICLR 2022 Stacked Homography Transformations for Multi-View Pedestrian Detection ICCV 2021 Recall and Learn: A Memory-augmented Solver for Math Word Problems EMNLP 2021 Back-Tracing Representative Points for Voting-Based 3D Object Detection in Point Clouds CVPR 2021 Robust Knowledge Transfer via Hybrid Forward on the Teacher-Student Model AAAI 2021 Track To Detect and Segment: An Online Multi-Object Tracker CVPR 2021 Toward A Thousand Lights: Decentralized Deep Reinforcement Learning for Large-Scale Traffic Signal Control AAAI 2020 Temporal-Context Enhanced Detection of Heavily Occluded Pedestrians CVPR 2020 Robust Document Distance with Wasserstein-Fisher-Rao metric ACML 2020 Bi-Directional Cascade Network for Perceptual Edge Detection CVPR 2019 SSAP: Single-Shot Instance Segmentation With Affinity Pyramid ICCV 2019 Discriminative Feature Transformation for Occluded Pedestrian Detection ICCV 2019 Resolution-invariant Person Re-Identification IJCAI 2019 Deep Reinforcement Learning with Iterative Shift for Visual Tracking ECCV 2018 Feature Integration with Adaptive Importance Maps for Visual Tracking IJCAI 2018 Image Blind Denoising With Generative Adversarial Network Based Noise Modeling CVPR 2018 Conditional Generative Adversarial Network for Structured Domain Adaptation CVPR 2018 Deep Correlation Structure Preserved Label Space Embedding for Multi-label Classification ACML 2018 Instance-level Human Parsing via Part Grouping Network ECCV 2018 BSN: Boundary Sensitive Network for Temporal Action Proposal Generation ECCV 2018 Multi-View Learning with Limited and Noisy Tagging IJCAI 2016 Web-Scale Training for Face Identification CVPR 2015 DeepFace: Closing the Gap to Human-Level Performance in Face Verification CVPR 2014 Regionlets for Generic Object Detection ICCV 2013 Semantic-Aware Co-indexing for Image Retrieval ICCV 2013 Multi-Task Learning with Gaussian Matrix Generalized Inverse Gaussian Model ICML 2013 Collaborative Active Learning of a Kernel Machine Ensemble for Recognition ICCV 2013