Ming Hu

31 papers · 2023–2026 · 11 conferences · across top CS/AI conferences

Achievements

+7 more ↓

🐝 Cross-Pollinator (11) 🌍 Conference Polyglot (11) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌈 Renaissance Researcher (7)

🐝 Cross-Pollinator (11) 🤝 Dynamic Duo (16) 👥 Mega-Team (21) 💎 Century Club (29) 🚀 Conference Pioneer ⚡ Prolific Year (21) 🗃️ Keyword Collector (121)

Conferences

AAAI (7) MICCAI (7) CVPR (6) ACL (2) ICCV (2) NIPS (2) COLING (1) ECCV (1) ICML (1) IJCAI (1) WACV (1)

Top co-authors

Zongyuan Ge (16) Feilong Tang (10) Junjun He (9) Peng Xia (7) Tianbin Li (5) Lie Ju (5) Zhongxing Xu (5) Jin Ye (5) Siyuan Yan (5) Imran Razzak (4)

Keywords

few-shot learning (3) vision language model (3) semi-supervised learning (3) image classification (3) visual question answering (3) semantic segmentation (3) attention mechanism (3) vision-language model (3) video understanding (2) multimodal large language model (2) data heterogeneity (2) self-supervised learning (2) medical image segmentation (2) medical imaging (2) multimodal learning (2) class imbalance (2) hierarchical classification (1) computer vision (1) action recognition (1) zero-shot learning (1)

Papers

S2-UniSeg: Fast Universal Agglomerative Pooling for Scalable Segment Anything Without Supervision AAAI 2026 GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and a Comprehensive Multimodal Dataset Towards General Medical AI AAAI 2026 SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding CVPR 2025 DONIS: Importance Sampling for Training Physics-Informed DeepONet IJCAI 2025 MAKE: Multi-Aspect Knowledge-Enhanced Vision-Language Pretraining for Zero-shot Dermatological Assessment MICCAI 2025 MSWAL: 3D Multi-class Segmentation of Whole Abdominal Lesions Dataset MICCAI 2025 Ophora: A Large-Scale Data-Driven Text-Guided Ophthalmic Surgical Video Generation Model MICCAI 2025 RetinaLogos: Fine-Grained Synthesis of High-Resolution Retinal Images Through Captions MICCAI 2025 Robust Multimodal Learning for Ophthalmic Disease Grading via Disentangled Representation MICCAI 2025 Temporal Model-Based Federated Active Medical Image Classification MICCAI 2025 Local Masked Reconstruction for Efficient Self-Supervised Learning on High-Resolution Images WACV 2025 MultiSFL: Towards Accurate Split Federated Learning via Multi-Model Aggregation and Knowledge Replay AAAI 2025 Towards Realistic Semi-supervised Medical Image Classification AAAI 2025 Neighbor Does Matter: Density-Aware Contrastive Learning for Medical Semi-supervised Segmentation AAAI 2025 MMRC: A Large-Scale Benchmark for Understanding Multimodal Large Language Model in Real-World Conversation ACL 2025 HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding COLING 2025 Star with Bilinear Mapping CVPR 2025 Towards Visual Discrimination and Reasoning of Real-World Physical Dynamics: Physics-Grounded Anomaly Detection CVPR 2025 beta-FFT: Nonlinear Interpolation and Differentiated Training Strategies for Semi-Supervised Medical Image Segmentation CVPR 2025 Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding CVPR 2025 OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining ICCV 2025 Derm1M: A Million-scale Vision-Language Dataset Aligned with Clinical Ontology Knowledge for Dermatology ICCV 2025 One Arrow, Two Hawks: Sharpness-aware Minimization for Federated Learning via Global Model Trajectory ICML 2025 Personalization as a Shortcut for Few-Shot Backdoor Attack against Text-to-Image Diffusion Models AAAI 2024 OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding ECCV 2024 FedMut: Generalized Federated Learning via Stochastic Mutation AAAI 2024 Generalizing to Unseen Domains in Diabetic Retinopathy with Disentangled Representations MICCAI 2024 LMPT: Prompt Tuning with Class-Specific Embedding Loss for Long-Tailed Multi-Label Visual Recognition ACL 2024 SampDetox: Black-box Backdoor Defense via Perturbation-based Sample Detoxification NIPS 2024 NurViD: A Large Expert-Level Video Database for Nursing Procedure Activity Understanding NIPS 2023 MammalNet: A Large-Scale Video Benchmark for Mammal Recognition and Behavior Understanding CVPR 2023