Haoyu Cao
9 papers · 2022–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π Cross-Pollinator (15) π§ Keyword Pioneer π Conference Polyglot (5) π Renaissance Researcher (6) π Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(23)
π₯
Unstoppable
(5)
ποΈ
Keyword Collector
(51)
Conferences
AAAI (2)
CVPR (2)
ICCV (2)
ACL (1)
COLING (1)
NAACL (1)
Top co-authors
Keywords
multimodal learning
(3)
multimodal large language model
(2)
visual document understanding
(2)
multi-modal learning
(2)
generative model
(2)
text generation
(1)
speech processing
(1)
reinforcement learning
(1)
document information extraction
(1)
embedding learning
(1)
wavelet transform
(1)
instruction tuning
(1)
model pruning
(1)
bayesian optimization
(1)
vision-language model
(1)
document understanding
(1)
diffusion model
(1)
curriculum learning
(1)
embedding alignment
(1)
contrastive learning
(1)
Papers
Frequency-Aligned Cross-Modal Learning with Top-K Wavelet Fusion and Dynamic Expert Routing for Enhanced Retinal Disease Diagnosis
AAAI 2026
Multimodal Table Understanding with Difficulty-aware Reinforcement Learning
AAAI 2026
BASIC: Boosting Visual Alignment with Intrinsic Refined Embeddings in Multimodal Large Language Models
ICCV 2025
HRVDA: High-Resolution Visual Document Assistant
CVPR 2024
Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction
ACL 2024
Few-shot Temporal Pruning Accelerates Diffusion Models for Text Generation
COLING 2024
Enhancing Visual Document Understanding with Contrastive Learning in Large Visual-Language Models
CVPR 2024
Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration
ICCV 2023
GMN: Generative Multi-modal Network for Practical Document Information Extraction
NAACL 2022