Hongfa Wang
12 papers · 2020–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
π Interdisciplinary Bridge π Conference Polyglot (5) π Academic Marathon (5) π Renaissance Researcher (6) πΊοΈ Taxonomy Completionist (28)
π
Interdisciplinary Bridge
π£
Hot Topic Early Bird
πΊοΈ
Taxonomy Completionist
(28)
π§¬
Topic Evolution
ποΈ
Keyword Collector
(56)
β‘
Prolific Year
(6)
π
Century Club
(11)
Conferences
AAAI (4)
CVPR (3)
ICCV (3)
ICLR (1)
ICML (1)
Top co-authors
Keywords
multimodal learning
(2)
vision-language pre-training
(2)
text detection
(2)
vision-language model
(2)
video generation
(2)
semantic segmentation
(2)
image-text retrieval
(2)
graph convolutional network
(2)
question answering
(1)
visual grounding
(1)
uncertainty modeling
(1)
relational reasoning
(1)
boundary detection
(1)
gaussian mixture model
(1)
hierarchical clustering
(1)
referring expression
(1)
object detection
(1)
cross-modal alignment
(1)
multi-modal large language model
(1)
recurrent neural network
(1)
Papers
VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation
AAAI 2026
Infinite-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation
AAAI 2025
Follow-Your-Click: Open-domain Regional Image Animation via Motion Prompts
AAAI 2025
Towards Multiple Character Image Animation Through Enhancing Implicit Decoupling
ICLR 2025
CoHD: A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation
ICCV 2025
InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models
ICCV 2025
Efficient Quantification of Multimodal Interaction at Sample Level
ICML 2025
Seeing What You Miss: Vision-Language Pre-Training With Semantic Completion Learning
CVPR 2023
MAP: Multimodal Uncertainty-Aware Vision-Language Pre-Training Model
CVPR 2023
Deep Unsupervised Hashing with Latent Semantic Components
AAAI 2022
Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection
ICCV 2021
Deep Relational Reasoning Graph Network for Arbitrary Shape Text Detection
CVPR 2020