Ming Hu
31 papers · 2023–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
🐝 Cross-Pollinator (11) 🌍 Conference Polyglot (11) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌈 Renaissance Researcher (7)
🐝
Cross-Pollinator
(11)
🤝
Dynamic Duo
(16)
👥
Mega-Team
(21)
💎
Century Club
(29)
🚀
Conference Pioneer
⚡
Prolific Year
(21)
🗃️
Keyword Collector
(121)
Conferences
AAAI (7)
MICCAI (7)
CVPR (6)
ACL (2)
ICCV (2)
NIPS (2)
COLING (1)
ECCV (1)
ICML (1)
IJCAI (1)
WACV (1)
Top co-authors
Keywords
few-shot learning
(3)
vision language model
(3)
semi-supervised learning
(3)
image classification
(3)
visual question answering
(3)
semantic segmentation
(3)
attention mechanism
(3)
vision-language model
(3)
video understanding
(2)
multimodal large language model
(2)
data heterogeneity
(2)
self-supervised learning
(2)
medical image segmentation
(2)
medical imaging
(2)
multimodal learning
(2)
class imbalance
(2)
hierarchical classification
(1)
computer vision
(1)
action recognition
(1)
zero-shot learning
(1)
Papers
S2-UniSeg: Fast Universal Agglomerative Pooling for Scalable Segment Anything Without Supervision
AAAI 2026
GMAI-VL & GMAI-VL-5.5M: A Large Vision-Language Model and a Comprehensive Multimodal Dataset Towards General Medical AI
AAAI 2026
SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding
CVPR 2025
DONIS: Importance Sampling for Training Physics-Informed DeepONet
IJCAI 2025
MAKE: Multi-Aspect Knowledge-Enhanced Vision-Language Pretraining for Zero-shot Dermatological Assessment
MICCAI 2025
MSWAL: 3D Multi-class Segmentation of Whole Abdominal Lesions Dataset
MICCAI 2025
Ophora: A Large-Scale Data-Driven Text-Guided Ophthalmic Surgical Video Generation Model
MICCAI 2025
RetinaLogos: Fine-Grained Synthesis of High-Resolution Retinal Images Through Captions
MICCAI 2025
Robust Multimodal Learning for Ophthalmic Disease Grading via Disentangled Representation
MICCAI 2025
Temporal Model-Based Federated Active Medical Image Classification
MICCAI 2025
Local Masked Reconstruction for Efficient Self-Supervised Learning on High-Resolution Images
WACV 2025
MultiSFL: Towards Accurate Split Federated Learning via Multi-Model Aggregation and Knowledge Replay
AAAI 2025
Towards Realistic Semi-supervised Medical Image Classification
AAAI 2025
Neighbor Does Matter: Density-Aware Contrastive Learning for Medical Semi-supervised Segmentation
AAAI 2025
MMRC: A Large-Scale Benchmark for Understanding Multimodal Large Language Model in Real-World Conversation
ACL 2025
HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding
COLING 2025
Star with Bilinear Mapping
CVPR 2025
Towards Visual Discrimination and Reasoning of Real-World Physical Dynamics: Physics-Grounded Anomaly Detection
CVPR 2025
beta-FFT: Nonlinear Interpolation and Differentiated Training Strategies for Semi-Supervised Medical Image Segmentation
CVPR 2025
Seeing Far and Clearly: Mitigating Hallucinations in MLLMs with Attention Causal Decoding
CVPR 2025
OphCLIP: Hierarchical Retrieval-Augmented Learning for Ophthalmic Surgical Video-Language Pretraining
ICCV 2025
Derm1M: A Million-scale Vision-Language Dataset Aligned with Clinical Ontology Knowledge for Dermatology
ICCV 2025
One Arrow, Two Hawks: Sharpness-aware Minimization for Federated Learning via Global Model Trajectory
ICML 2025
Personalization as a Shortcut for Few-Shot Backdoor Attack against Text-to-Image Diffusion Models
AAAI 2024
OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding
ECCV 2024
FedMut: Generalized Federated Learning via Stochastic Mutation
AAAI 2024
Generalizing to Unseen Domains in Diabetic Retinopathy with Disentangled Representations
MICCAI 2024
LMPT: Prompt Tuning with Class-Specific Embedding Loss for Long-Tailed Multi-Label Visual Recognition
ACL 2024
SampDetox: Black-box Backdoor Defense via Perturbation-based Sample Detoxification
NIPS 2024
NurViD: A Large Expert-Level Video Database for Nursing Procedure Activity Understanding
NIPS 2023
MammalNet: A Large-Scale Video Benchmark for Mammal Recognition and Behavior Understanding
CVPR 2023