conftrace_

Kun Yao

10 papers · 2022–2025 · 8 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+4 more ↓

🐝 Cross-Pollinator (6) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (13) 🌈 Renaissance Researcher (5)

🌍 Conference Polyglot (8) 🏆 Grand Slam ⚡ Prolific Year (5) 💎 Century Club (10)

Conferences

ICCV (2) ICLR (2) AAAI (1) CVPR (1) ECCV (1) ICML (1) IJCAI (1) NIPS (1)

Top co-authors

Jingdong Wang (7) Errui Ding (7) Junyu Han (5) Chengquan Zhang (4) Jian Wang (3) Qiang Chen (2) Xiameng Qin (2) Yulin Li (2) Mengjun Cheng (2) Hao Zhou (2)

Keywords

object detection (2) human pose estimation (2) vision transformer (1) visual question answering (1) multimodal learning (1) document understanding (1) person re-identification (1) cross-modal retrieval (1) multimodal large language model (1) multimodal fusion (1) feature aggregation (1) cross-domain generalization (1) pedestrian attribute recognition (1) efficient transformer (1) token merging (1) visual document understanding (1) multi-person pose estimation (1) keypoint detection (1) training convergence (1) masked image modeling (1)

Papers

Interpretable Face Anti-Spoofing: Enhancing Generalization with Multimodal Large Language Models AAAI 2025 Towards Unified Multi-granularity Text Detection with Interactive Attention ICML 2024 Textual Grounding for Open-vocabulary Visual Information Extraction in Layout-diversified Documents ECCV 2024 FROSTER: Frozen CLIP is A Strong Teacher for Open-Vocabulary Action Recognition ICLR 2024 Fast-StrucTexT: An Efficient Hourglass Transformer with Modality-guided Dynamic Token Merge for Document Understanding IJCAI 2023 Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment ICCV 2023 Group Pose: A Simple Baseline for End-to-End Multi-Person Pose Estimation ICCV 2023 StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training ICLR 2023 HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception NIPS 2023 ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval CVPR 2022