Kun Yan
16 papers · 2019–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π§ Keyword Pioneer π Interdisciplinary Bridge π Conference Polyglot (8) π Academic Marathon (6) π Cross-Pollinator (10)
πΊοΈ
Taxonomy Completionist
(47)
π
Interdisciplinary Bridge
π
Conference Polyglot
(8)
π§¬
Topic Evolution
π
Century Club
(16)
ποΈ
Keyword Collector
(81)
π
Trend Setter
π₯
Unstoppable
(5)
Conferences
CVPR (4)
ACL (3)
NIPS (3)
AAAI (2)
ECCV (1)
IJCNLP (1)
MICCAI (1)
WACV (1)
Top co-authors
Keywords
multimodal learning
(4)
contrastive learning
(3)
attention mechanism
(3)
vision-language model
(3)
semi-supervised learning
(2)
image captioning
(2)
attention guidance
(2)
visual grounding
(2)
few-shot learning
(2)
autoregressive generation
(1)
visual question answering
(1)
semantic segmentation
(1)
video generation
(1)
prototype learning
(1)
metric learning
(1)
video understanding
(1)
multi-modal learning
(1)
domain adaptation
(1)
multi-label classification
(1)
image generation
(1)
Papers
Galaxy Walker: Geometry-aware VLMs For Galaxy-scale Understanding
CVPR 2025
Minimizing Labeled, Maximizing Unlabeled: An Image-Driven Approach for Video Instance Segmentation
CVPR 2025
Endplate3D-QCT: A High-Resolution Dataset and Benchmark for Automated 3D Segmentation of Lumbar Vertebral Endplates in QCT
MICCAI 2025
Taming Teacher Forcing for Masked Autoregressive Video Generation
CVPR 2025
HORIZON: High-Resolution Semantically Controlled Panorama Synthesis
AAAI 2024
A Large-Scale Human-Centric Benchmark for Referring Expression Comprehension in the LMM Era
NIPS 2024
Voila-A: Aligning Vision-Language Models with User's Gaze Attention
NIPS 2024
Two-Shot Video Object Segmentation
CVPR 2023
CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding
ACL 2023
KU-DMIS-MSRA at RadSum23: Pre-trained Vision-Language Model for Radiology Report Summarization
ACL 2023
Learning Temporal Video Procedure Segmentation From an Automatically Collected Large Dataset
WACV 2022
Inferring Prototypes for Multi-Label Few-Shot Image Classification with Word Vector Guided Attention
AAAI 2022
Trace Controlled Text to Image Generation
ECCV 2022
Control Image Captioning Spatially and Temporally
IJCNLP 2021
Control Image Captioning Spatially and Temporally
ACL 2021
Gate Decorator: Global Filter Pruning Method for Accelerating Deep Convolutional Neural Networks
NIPS 2019