Qingming Huang
122 papers · 2013–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+16 more ↓ Show less ↑
🗺️ Taxonomy Completionist (14) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🐣 Hot Topic Early Bird
🧭
Keyword Pioneer
🐣
Hot Topic Early Bird
🏃
Academic Marathon
(12)
🏠
Conference Loyalist
(20)
🔬
Deep Specialist
(17)
🏆
Grand Slam
🏆
Keyword Champion
(2)
🤝
Dynamic Duo
(48)
👑
Triple Crown
🗃️
Keyword Collector
(444)
❓
The Questioner
⚡
Prolific Year
(15)
🚀
Conference Pioneer
💎
Century Club
(117)
🔥
Unstoppable
(11)
📈
Trend Setter
Conferences
CVPR (30)
AAAI (24)
NIPS (19)
ICML (17)
ICCV (11)
ECCV (7)
IJCAI (7)
ACL (3)
ICLR (2)
EACL (1)
EMNLP (1)
Top co-authors
Research topics
Keywords
representation learning
(8)
convolutional neural network
(8)
domain adaptation
(7)
diffusion model
(6)
semantic segmentation
(6)
auc optimization
(6)
graph neural network
(5)
salient object detection
(5)
image generation
(5)
contrastive learning
(5)
deep learning
(5)
feature learning
(5)
transfer learning
(4)
attention mechanism
(4)
weakly supervised learning
(4)
adversarial learning
(4)
message passing
(4)
link prediction
(4)
multimodal learning
(3)
binary classification
(3)
Papers
The Devil is in the Distributions: Explicit Modeling of Scene Content is Key in Zero-Shot Video Captioning
EACL 2026
TuckA: Hierarchical Compact Tensor Experts for Efficient Fine-Tuning
AAAI 2026
Quantifying the Potential to Escape Filter Bubbles: A Behavior-Aware Measure via Contrastive Simulation
AAAI 2026
HiGFA: Hierarchical Guidance for Fine-grained Data Augmentation with Diffusion Models
AAAI 2026
DMGINE: Day-Memory Guided Nighttime Image Enhancement for Dynamic Traffic Scenes
AAAI 2026
Divide and Conquer: Heterogeneous Noise Integration for Diffusion-based Adversarial Purification
CVPR 2025
Image-to-video Adaptation with Outlier Modeling and Robust Self-learning
AAAI 2025
Change Entity-guided Heterogeneous Representation Disentangling for Change Captioning
ACL 2025
Dis²Booth: Learning Image Distribution with Disentangled Features for Text-to-Image Diffusion Models
AAAI 2025
Query-centric Audio-Visual Cognition Network for Moment Retrieval, Segmentation and Step-Captioning
AAAI 2025
Bidirectional Logits Tree: Pursuing Granularity Reconcilement in Fine-Grained Classification
AAAI 2025
SSE-SAM: Balancing Head and Tail Classes Gradually Through Stage-Wise SAM
AAAI 2025
Separation of Powers: On Segregating Knowledge from Observation in LLM-enabled Knowledge-based Visual Question Answering
CVPR 2025
EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing
CVPR 2025
When the Future Becomes the Past: Taming Temporal Correspondence for Self-supervised Video Representation Learning
CVPR 2025
Cannot See the Forest for the Trees: Invoking Heuristics and Biases to Elicit Irrational Choices of LLMs
ICML 2025
ABKD: Pursuing a Proper Allocation of the Probability Mass in Knowledge Distillation via $α$-$β$-Divergence
ICML 2025
MixBridge: Heterogeneous Image-to-Image Backdoor Attack through Mixture of Schrödinger Bridges
ICML 2025
Focal-SAM: Focal Sharpness-Aware Minimization for Long-Tailed Classification
ICML 2025
Diffusion-based Adversarial Purification from the Perspective of the Frequency Domain
ICML 2025
One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework
ICML 2025
OpenworldAUC: Towards Unified Evaluation and Optimization for Open-world Prompt Tuning
ICML 2025
Enhancing Pre-trained Representation Classifiability can Boost its Interpretability
ICLR 2025
Video Language Model Pretraining with Spatio-temporal Masking
CVPR 2025
R&B: Region and Boundary Aware Zero-shot Grounded Text-to-image Generation
ICLR 2024
Prompt-Enhanced Multiple Instance Learning for Weakly Supervised Video Anomaly Detection
CVPR 2024
Weakly Supervised Video Individual Counting
CVPR 2024
Suppress Content Shift: Better Diffusion Features via Off-the-Shelf Generation Techniques
NIPS 2024
Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features
NIPS 2024
Expanding Sparse Tuning for Low Memory Usage
NIPS 2024
Towards Dynamic Message Passing on Graphs
NIPS 2024
Leveraging Catastrophic Forgetting to Develop Safe Diffusion Models against Malicious Finetuning
NIPS 2024
AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation
NIPS 2024
Harnessing Hierarchical Label Distribution Variations in Test Agnostic Long-tail Recognition
ICML 2024
Data-free Neural Representation Compression with Riemannian Neural Dynamics
ICML 2024
Modeling Language Tokens as Functionals of Semantic Fields
ICML 2024
Bias-Conflict Sample Synthesis and Adversarial Removal Debias Strategy for Temporal Sentence Grounding in Video
AAAI 2024
ADA-GAD: Anomaly-Denoised Autoencoders for Graph Anomaly Detection
AAAI 2024
Context-aware Difference Distilling for Multi-change Captioning
ACL 2024
StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing
ACL 2024
ESNet: Evolution and Succession Network for High-Resolution Salient Object Detection
ICML 2024
Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object Detection
ICML 2024
ReconBoost: Boosting Can Achieve Modality Reconcilement
ICML 2024
Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change Captioning
ECCV 2024
DRAUC: An Instance-wise Distributionally Robust AUC Optimization Framework
NIPS 2023
All in a Row: Compressed Convolution Networks for Graphs
ICML 2023
Feature Directions Matter: Long-Tailed Learning via Rotated Balanced Representation
ICML 2023
Learning To Dub Movies via Hierarchical Prosody Models
CVPR 2023
Exploiting Completeness and Uncertainty of Pseudo Labels for Weakly Supervised Video Anomaly Detection
CVPR 2023
Towards Decision-Friendly AUC: Learning Multi-Classifier with AUCµ
AAAI 2023
Text-Driven Generative Domain Adaptation with Spectral Consistency Regularization
ICCV 2023
Building Bridge Across the Time: Disruption and Restoration of Murals In the Wild
ICCV 2023
A Unified Generalization Analysis of Re-Weighting and Logit-Adjustment for Imbalanced Learning
NIPS 2023
Self-supervised Cross-view Representation Reconstruction for Change Captioning
ICCV 2023
Weighted ROC Curve in Cost Space: Extending AUC to Cost-Sensitive Learning
NIPS 2023
Geometry Interaction Knowledge Graph Embeddings
AAAI 2022
The Minority Matters: A Diversity-Promoting Collaborative Metric Learning Algorithm
NIPS 2022
OpenAUC: Towards AUC-Oriented Open-Set Recognition
NIPS 2022
Exploring the Algorithm-Dependent Generalization of AUPRC Optimization with List Stability
NIPS 2022
Asymptotically Unbiased Instance-wise Regularized Partial AUC Optimization: Theory and Algorithm
NIPS 2022
OTKGE: Multi-modal Knowledge Graph Embeddings via Optimal Transport
NIPS 2022
ER: Equivariance Regularizer for Knowledge Graph Completion
AAAI 2022
Few Shot Generative Model Adaption via Relaxed Spatial Structural Alignment
CVPR 2022
Automatic Relation-Aware Graph Network Proliferation
CVPR 2022
Hierarchical Modular Network for Video Captioning
CVPR 2022
Dist-PU: Positive-Unlabeled Learning From a Label Distribution Perspective
CVPR 2022
Attribute Group Editing for Reliable Few-Shot Image Generation
CVPR 2022
Learning Linguistic Association towards Efficient Text-Video Retrieval
ECCV 2022
Think Beyond Words: Exploring Context-Relevant Visual Commonsense for Diverse Dialogue Generation
EMNLP 2022
AdAUC: End-to-end Adversarial AUC Optimization Against Long-tail Problems
ICML 2022
Quaternion Ordinal Embedding
IJCAI 2022
A Sparse-Motif Ensemble Graph Convolutional Network against Over-smoothing
IJCAI 2022
Greedy Gradient Ensemble for Robust Visual Question Answering
ICCV 2021
Dual Quaternion Knowledge Graph Embeddings
AAAI 2021
Seeking the Shape of Sound: An Adaptive Framework for Learning Voice-Face Association
CVPR 2021
What to Select: Pursuing Consistent Motion Segmentation from Multiple Geometric Models
AAAI 2021
Deep Partial Rank Aggregation for Personalized Attributes
AAAI 2021
When False Positive is Intolerant: End-to-End Optimization with Low FPR for Multipartite Ranking
NIPS 2021
When All We Need is a Piece of the Pie: A Generic Framework for Optimizing Two-way Partial AUC
ICML 2021
Nearest Neighbor Classifier Embedded Network for Active Learning
AAAI 2021
Exploiting Sample Correlation for Crowd Counting With Multi-Expert Network
ICCV 2021
Rethinking Graph Neural Architecture Search From Message-Passing
CVPR 2021
Towards Discriminability and Diversity: Batch Nuclear-Norm Maximization Under Label Insufficient Situations
CVPR 2020
Parsing-Based View-Aware Embedding Network for Vehicle Re-Identification
CVPR 2020
State-Relabeling Adversarial Active Learning
CVPR 2020
Heuristic Domain Adaptation
NIPS 2020
Gradually Vanishing Bridge for Adversarial Domain Adaptation
CVPR 2020
Weakly-Supervised Crowd Counting Learns from Sorting rather than Locations
ECCV 2020
Who Likes What? — SplitLBI in Exploring Preferential Diversity of Ratings
AAAI 2020
Global Context-Aware Progressive Aggregation Network for Salient Object Detection
AAAI 2020
Label Decoupling Framework for Salient Object Detection
CVPR 2020
Release the Power of Online-Training for Robust Visual Tracking
AAAI 2020
F³Net: Fusion, Feedback and Focus for Salient Object Detection
AAAI 2020
Corner Proposal Network for Anchor-free, Two-stage Object Detection
ECCV 2020
A Structured Latent Variable Recurrent Network With Stochastic Attention For Generating Weibo Comments
IJCAI 2020
Interpretable Visual Reasoning via Probabilistic Formulation under Natural Supervision
ECCV 2020
Reverse Perspective Network for Perspective-Aware Object Counting
CVPR 2020
DM2C: Deep Mixed-Modal Clustering
NIPS 2019
Learning Personalized Attribute Preference via Multi-Task AUC Optimization
AAAI 2019
Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding
ICCV 2019
CenterNet: Keypoint Triplets for Object Detection
ICCV 2019
Stacked Cross Refinement Network for Edge-Aware Salient Object Detection
ICCV 2019
Deep Robust Subjective Visual Property Prediction in Crowdsourcing
CVPR 2019
Cascaded Partial Decoder for Fast and Accurate Salient Object Detection
CVPR 2019
Spatiotemporal CNN for Video Object Segmentation
CVPR 2019
Unsupervised Open Domain Recognition by Semantic Discrepancy Minimization
CVPR 2019
Generalized Block-Diagonal Structure Pursuit: Learning Soft Latent Task Assignment against Negative Transfer
NIPS 2019
Learning Attribute-Specific Representations for Visual Tracking
AAAI 2019
iSplit LBI: Individualized Partial Ranking with Ties via Split LBI
NIPS 2019
Less is More: Picking Informative Frames for Video Captioning
ECCV 2018
Affective Image Content Analysis: A Comprehensive Survey
IJCAI 2018
The Unmanned Aerial Vehicle Benchmark: Object Detection and Tracking
ECCV 2018
Dependency Exploitation: A Unified CNN-RNN Approach for Visual Emotion Recognition
IJCAI 2017
Online Asymmetric Similarity Learning for Cross-Modal Retrieval
CVPR 2017
A Graph Regularized Deep Neural Network for Unsupervised Image Representation Learning
CVPR 2017
Multimodal Gaussian Process Latent Variable Models With Harmonization
ICCV 2017
Adaptively Unified Semi-supervised Learning for Cross-Modal Retrieval
IJCAI 2017
Hedged Deep Tracking
CVPR 2016
Adaptive Sharing for Image Classification
IJCAI 2015
Similarity Gaussian Process Latent Variable Model for Multi-Modal Data Analysis
ICCV 2015
Multi-level Discriminative Dictionary Learning towards Hierarchical Visual Categorization
CVPR 2013
Semantically-Based Human Scanpath Estimation with HMMs
ICCV 2013