Yicong Li
21 papers · 2022–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (34) π Interdisciplinary Bridge π Renaissance Researcher (6) π Conference Polyglot (9) π§ Keyword Pioneer
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Keyword Champion
(3)
π€
Dynamic Duo
(10)
β
The Questioner
β‘
Prolific Year
(8)
π
Conference Pioneer
π
Century Club
(19)
ποΈ
Keyword Collector
(85)
π₯
Unstoppable
(5)
Conferences
CVPR (5)
ICCV (5)
AAAI (3)
ICLR (2)
MICCAI (2)
ACL (1)
EMNLP (1)
IJCAI (1)
NIPS (1)
Top co-authors
Keywords
multimodal learning
(6)
video question answering
(6)
cross-modal interaction
(3)
video understanding
(3)
graph neural network
(3)
temporal grounding
(2)
self-supervised learning
(2)
visual question answering
(2)
affordance segmentation
(2)
domain generalization
(2)
visual grounding
(2)
vision-language model
(2)
egocentric vision
(2)
temporal reasoning
(1)
vision transformer
(1)
link prediction
(1)
3d reconstruction
(1)
hierarchical learning
(1)
temporal dynamics
(1)
semantic segmentation
(1)
Papers
AnchorDS: Anchoring Dynamic Sources for Semantically Consistent Text-to-3D Generation
AAAI 2026
DRSoRec: Dual-Rectification of Social Networks for Recommendation
AAAI 2026
Visual Intention Grounding for Egocentric Assistants
ICCV 2025
EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering
CVPR 2025
SynTag: Enhancing the Geometric Robustness of Inversion-based Generative Image Watermarking
ICCV 2025
Intermediate Connectors and Geometric Priors for Language-Guided Affordance Segmentation on Unseen Object Categories
ICCV 2025
Geometric Alignment and Prior Modulation for View-Guided Point Cloud Completion on Unseen Categories
ICCV 2025
Factor Graph-based Interpretable Neural Networks
ICLR 2025
Generalized Video Moment Retrieval
ICLR 2025
MSCI: Addressing CLIP's Inherent Limitations for Compositional Zero-Shot Learning
IJCAI 2025
Can I Trust Your Answer? Visually Grounded Video Question Answering
CVPR 2024
Video-Language Understanding: A Survey from Model Architecture, Model Training, and Data Perspectives
ACL 2024
Multimodal Learning for Embryo Viability Prediction in Clinical IVF
MICCAI 2024
MoRA: LoRA Guided Multi-Modal Disease Diagnosis with Missing Modality
MICCAI 2024
LASO: Language-guided Affordance Segmentation on 3D Object
CVPR 2024
An Empirical Study Towards Prompt-Tuning for Graph Contrastive Pre-Training in Recommendations
NIPS 2023
Discovering Spatio-Temporal Rationales for Video Question Answering
ICCV 2023
Invariant Grounding for Video Question Answering
CVPR 2022
Video as Conditional Graph Hierarchy for Multi-Granular Question Answering
AAAI 2022
Video Question Answering: Datasets, Algorithms and Challenges
EMNLP 2022
Scaling Vision Transformers to Gigapixel Images via Hierarchical Self-Supervised Learning
CVPR 2022