Kun Yan

16 papers · 2019–2025 · 8 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (8) 🏃 Academic Marathon (6) 🐝 Cross-Pollinator (10)

🗺️ Taxonomy Completionist (47) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (8) 🧬 Topic Evolution 💎 Century Club (16) 🗃️ Keyword Collector (81) 📈 Trend Setter 🔥 Unstoppable (5)

Conferences

CVPR (4) ACL (3) NIPS (3) AAAI (2) ECCV (1) IJCNLP (1) MICCAI (1) WACV (1)

Top co-authors

Nan Duan (8) Lei Ji (8) Shuai Ma (5) Ping Wang (4) Ming Zhou (4) Chenfei Wu (3) Fangyun Wei (3) Chenbin Zhang (3) Chang Xu (2) Jinjing Zhao (2)

Keywords

multimodal learning (4) contrastive learning (3) attention mechanism (3) vision-language model (3) semi-supervised learning (2) image captioning (2) attention guidance (2) visual grounding (2) few-shot learning (2) autoregressive generation (1) visual question answering (1) semantic segmentation (1) video generation (1) prototype learning (1) metric learning (1) video understanding (1) multi-modal learning (1) domain adaptation (1) multi-label classification (1) image generation (1)

Papers

Galaxy Walker: Geometry-aware VLMs For Galaxy-scale Understanding CVPR 2025 Minimizing Labeled, Maximizing Unlabeled: An Image-Driven Approach for Video Instance Segmentation CVPR 2025 Endplate3D-QCT: A High-Resolution Dataset and Benchmark for Automated 3D Segmentation of Lumbar Vertebral Endplates in QCT MICCAI 2025 Taming Teacher Forcing for Masked Autoregressive Video Generation CVPR 2025 HORIZON: High-Resolution Semantically Controlled Panorama Synthesis AAAI 2024 A Large-Scale Human-Centric Benchmark for Referring Expression Comprehension in the LMM Era NIPS 2024 Voila-A: Aligning Vision-Language Models with User's Gaze Attention NIPS 2024 Two-Shot Video Object Segmentation CVPR 2023 CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding ACL 2023 KU-DMIS-MSRA at RadSum23: Pre-trained Vision-Language Model for Radiology Report Summarization ACL 2023 Learning Temporal Video Procedure Segmentation From an Automatically Collected Large Dataset WACV 2022 Inferring Prototypes for Multi-Label Few-Shot Image Classification with Word Vector Guided Attention AAAI 2022 Trace Controlled Text to Image Generation ECCV 2022 Control Image Captioning Spatially and Temporally IJCNLP 2021 Control Image Captioning Spatially and Temporally ACL 2021 Gate Decorator: Global Filter Pruning Method for Accelerating Deep Convolutional Neural Networks NIPS 2019