Yicong Li

21 papers · 2022–2026 · 9 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🗺️ Taxonomy Completionist (34) 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (6) 🌍 Conference Polyglot (9) 🧭 Keyword Pioneer

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏆 Keyword Champion (3) 🤝 Dynamic Duo (10) ❓ The Questioner ⚡ Prolific Year (8) 🚀 Conference Pioneer 💎 Century Club (19) 🗃️ Keyword Collector (85) 🔥 Unstoppable (5)

Conferences

CVPR (5) ICCV (5) AAAI (3) ICLR (2) MICCAI (2) ACL (1) EMNLP (1) IJCAI (1) NIPS (1)

Top co-authors

Junbin Xiao (10) Tat-Seng Chua (7) Angela Yao (7) Xiang Wang (5) Wei Ji (4) Han Fang (2) Wanhua Li (2) Na Zhao (2) Hanspeter Pfister (2) Zhiyi Shi (2)

Keywords

multimodal learning (6) video question answering (6) cross-modal interaction (3) video understanding (3) graph neural network (3) temporal grounding (2) self-supervised learning (2) visual question answering (2) affordance segmentation (2) domain generalization (2) visual grounding (2) vision-language model (2) egocentric vision (2) temporal reasoning (1) vision transformer (1) link prediction (1) 3d reconstruction (1) hierarchical learning (1) temporal dynamics (1) semantic segmentation (1)

Papers

AnchorDS: Anchoring Dynamic Sources for Semantically Consistent Text-to-3D Generation AAAI 2026 DRSoRec: Dual-Rectification of Social Networks for Recommendation AAAI 2026 Visual Intention Grounding for Egocentric Assistants ICCV 2025 EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering CVPR 2025 SynTag: Enhancing the Geometric Robustness of Inversion-based Generative Image Watermarking ICCV 2025 Intermediate Connectors and Geometric Priors for Language-Guided Affordance Segmentation on Unseen Object Categories ICCV 2025 Geometric Alignment and Prior Modulation for View-Guided Point Cloud Completion on Unseen Categories ICCV 2025 Factor Graph-based Interpretable Neural Networks ICLR 2025 Generalized Video Moment Retrieval ICLR 2025 MSCI: Addressing CLIP's Inherent Limitations for Compositional Zero-Shot Learning IJCAI 2025 Can I Trust Your Answer? Visually Grounded Video Question Answering CVPR 2024 Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives ACL 2024 Multimodal Learning for Embryo Viability Prediction in Clinical IVF MICCAI 2024 MoRA: LoRA Guided Multi-Modal Disease Diagnosis with Missing Modality MICCAI 2024 LASO: Language-guided Affordance Segmentation on 3D Object CVPR 2024 An Empirical Study Towards Prompt-Tuning for Graph Contrastive Pre-Training in Recommendations NIPS 2023 Discovering Spatio-Temporal Rationales for Video Question Answering ICCV 2023 Invariant Grounding for Video Question Answering CVPR 2022 Video as Conditional Graph Hierarchy for Multi-Granular Question Answering AAAI 2022 Video Question Answering: Datasets, Algorithms and Challenges EMNLP 2022 Scaling Vision Transformers to Gigapixel Images via Hierarchical Self-Supervised Learning CVPR 2022