Sihang Cai
4 papers · 2025–2026 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Conference Polyglot (3) π Renaissance Researcher (7) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (20) π§ Keyword Pioneer
π
Cross-Pollinator
(15)
Conferences
AAAI (1)
CVPR (1)
EMNLP (1)
NAACL (1)
Top co-authors
Keywords
multimodal large language model
(2)
domain generalization
(1)
video segmentation
(1)
image retrieval
(1)
mathematical reasoning
(1)
chain-of-thought reasoning
(1)
text generation
(1)
multi-modal learning
(1)
video understanding
(1)
visual reasoning
(1)
text-to-image generation
(1)
action localization
(1)
spatiotemporal modeling
(1)
multi-turn dialogue
(1)
vision encoder
(1)
pseudo-label generation
(1)
cross-attention map
(1)
attribute binding
(1)
feature normalization
(1)
image-text matching
(1)
Papers
Scene-Aware Spatiotemporal Generalization: Towards Robust Temporal Action Detection Across Domains
AAAI 2026
Towards Transformer-Based Aligned Generation with Self-Coherence Guidance
CVPR 2025
Chat-Driven Text Generation and Interaction for Person Retrieval
EMNLP 2025
Omni-Chart-600K: A Comprehensive Dataset of Chart Types for Chart Understanding
NAACL 2025