Kun Yao
10 papers · 2022–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+4 more ↓ Show less ↑
π Cross-Pollinator (6) π Interdisciplinary Bridge π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (13) π Renaissance Researcher (5)
π
Conference Polyglot
(8)
π
Grand Slam
β‘
Prolific Year
(5)
π
Century Club
(10)
Conferences
ICCV (2)
ICLR (2)
AAAI (1)
CVPR (1)
ECCV (1)
ICML (1)
IJCAI (1)
NIPS (1)
Top co-authors
Keywords
object detection
(2)
human pose estimation
(2)
vision transformer
(1)
visual question answering
(1)
multimodal learning
(1)
document understanding
(1)
person re-identification
(1)
cross-modal retrieval
(1)
multimodal large language model
(1)
multimodal fusion
(1)
feature aggregation
(1)
cross-domain generalization
(1)
pedestrian attribute recognition
(1)
efficient transformer
(1)
token merging
(1)
visual document understanding
(1)
multi-person pose estimation
(1)
keypoint detection
(1)
training convergence
(1)
masked image modeling
(1)
Papers
Interpretable Face Anti-Spoofing: Enhancing Generalization with Multimodal Large Language Models
AAAI 2025
Towards Unified Multi-granularity Text Detection with Interactive Attention
ICML 2024
Textual Grounding for Open-vocabulary Visual Information Extraction in Layout-diversified Documents
ECCV 2024
FROSTER: Frozen CLIP is A Strong Teacher for Open-Vocabulary Action Recognition
ICLR 2024
Fast-StrucTexT: An Efficient Hourglass Transformer with Modality-guided Dynamic Token Merge for Document Understanding
IJCAI 2023
Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
ICCV 2023
Group Pose: A Simple Baseline for End-to-End Multi-Person Pose Estimation
ICCV 2023
StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training
ICLR 2023
HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception
NIPS 2023
ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval
CVPR 2022