Huaibo Huang
30 papers · 2017–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π Academic Marathon (8) π Conference Polyglot (9) π Interdisciplinary Bridge π§ Keyword Pioneer π Cross-Pollinator (9)
π
Renaissance Researcher
(6)
πΊοΈ
Taxonomy Completionist
(55)
π
Interdisciplinary Bridge
π€
Dynamic Duo
(25)
β‘
Prolific Year
(9)
π
Conference Pioneer
ποΈ
Keyword Collector
(131)
π₯
Unstoppable
(9)
π
Century Club
(29)
Conferences
CVPR (8)
NIPS (8)
ICCV (4)
AAAI (3)
ECCV (3)
ACL (1)
EMNLP (1)
ICLR (1)
IJCAI (1)
Top co-authors
Keywords
vision transformer
(7)
multimodal learning
(5)
image classification
(5)
domain adaptation
(4)
semantic segmentation
(3)
diffusion model
(3)
vision-language model
(3)
neural network
(2)
large language model
(2)
heterogeneous face recognition
(2)
image generation
(2)
variational inference
(2)
variational autoencoder
(2)
latent space
(2)
efficient computing
(2)
self-supervised learning
(2)
image restoration
(2)
disentangled representation
(2)
linear attention
(2)
transfer learning
(1)
Papers
T2Agent: A Tool-augmented Multimodal Misinformation Detection Agent with Monte Carlo Tree Search
AAAI 2026
MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs
ICLR 2025
Semantic Equitable Clustering: A Simple and Effective Strategy for Clustering Vision Tokens
ICCV 2025
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning
EMNLP 2025
Rectifying Magnitude Neglect in Linear Attention
ICCV 2025
Breaking the Low-Rank Dilemma of Linear Attention
CVPR 2025
Multimodal Prompt Perceiver: Empower Adaptiveness Generalizability and Fidelity for All-in-One Image Restoration
CVPR 2024
Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model
NIPS 2024
DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
NIPS 2024
Hallo3D: Multi-Modal Hallucination Detection and Mitigation for Consistent 3D Content Generation
NIPS 2024
Heterogeneous Test-Time Training for Multi-Modal Person Re-identification
AAAI 2024
DeVAn: Dense Video Annotation for Video-Language Models
ACL 2024
RMT: Retentive Networks Meet Vision Transformers
CVPR 2024
Uncertainty-Aware Source-Free Adaptive Image Super-Resolution with Wavelet Augmentation Transformer
CVPR 2024
InstaStyle: Inversion Noise of a Stylized Image is Secretly a Style Adviser
ECCV 2024
Lightweight Vision Transformer with Bidirectional Interaction
NIPS 2023
Pluralistic Aging Diffusion Autoencoder
ICCV 2023
Learning-to-Rank Meets Language: Boosting Language-Driven Ordering Alignment for Ordinal Classification
NIPS 2023
Orthogonal Transformer: An Efficient Vision Transformer Backbone with Token Orthogonalization
NIPS 2022
Artistic Style Discovery With Independent Components
CVPR 2022
Rethinking Image Cropping: Exploring Diverse Compositions From Global Views
CVPR 2022
Information Bottleneck Disentanglement for Identity Swapping
CVPR 2021
Memory Oriented Transfer Learning for Semi-Supervised Image Deraining
CVPR 2021
Hierarchical Face Aging through Disentangled Latent Characteristics
ECCV 2020
Arbitrary Talking Face Generation via Attentional Audio-Visual Coherence Learning
IJCAI 2020
Informative Sample Mining Network for Multi-Domain Image-to-Image Translation
ECCV 2020
Disentangled Variational Representation for Heterogeneous Face Recognition
AAAI 2019
Dual Variational Generation for Low Shot Heterogeneous Face Recognition
NIPS 2019
IntroVAE: Introspective Variational Autoencoders for Photographic Image Synthesis
NIPS 2018
Wavelet-SRNet: A Wavelet-Based CNN for Multi-Scale Face Super Resolution
ICCV 2017