Bumsoo Kim

14 papers · 2020–2025 · 8 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🏃 Academic Marathon (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (8) 🐝 Cross-Pollinator (12)

🗺️ Taxonomy Completionist (31) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer ⚡ Prolific Year (5) 🗃️ Keyword Collector (60) 🔥 Unstoppable (6) 💎 Century Club (14) ❓ The Questioner

Conferences

CVPR (4) AAAI (2) ECCV (2) WACV (2) ACL (1) EMNLP (1) ICCV (1) NIPS (1)

Top co-authors

Seung Hwan Kim (3) Yeonsik Jo (2) Jinhyung Kim (2) Jaewoo Kang (2) Gunhee Kim (2) Junhyun Lee (2) Eun-Sol Kim (2) Minjung Kim (2) Seunghwan Kim (2) Soonyoung Lee (2)

Keywords

contrastive learning (4) object detection (3) multimodal learning (2) image-text alignment (2) 3d vision (2) multimodal large language model (2) knowledge distillation (2) human-object interaction detection (2) vision-language model (2) in-context learning (2) pose estimation (1) image captioning (1) transfer learning (1) video understanding (1) metric learning (1) set prediction (1) action recognition (1) instruction tuning (1) efficient inference (1) human pose estimation (1)

Papers

Make VLM Recognize Visual Hallucination on Cartoon Character Image with Pose Information WACV 2025 ImagePiece: Content-aware Re-tokenization for Efficient Image Recognition AAAI 2025 Is `Right' Right? Enhancing Object Orientation Understanding in Multimodal Large Language Models through Egocentric Instruction Tuning CVPR 2025 Generative Modeling of Class Probability for Multi-Modal Representation Learning CVPR 2025 Retrieval Enhanced Feedback via In-context Neural Error-book EMNLP 2025 Expediting Contrastive Language-Image Pretraining via Self-Distilled Encoders AAAI 2024 UNSPAT: Uncertainty-Guided SpatioTemporal Transformer for 3D Human Pose and Shape Estimation on Videos WACV 2024 See It All: Contextualized Late Aggregation for 3D Dense Captioning ACL 2024 Bi-directional Contextual Attention for 3D Dense Captioning ECCV 2024 Misalign, Contrast then Distill: Rethinking Misalignments in Language-Image Pre-training ICCV 2023 MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction Detection CVPR 2022 UniCLIP: Unified Framework for Contrastive Language-Image Pre-training NIPS 2022 HOTR: End-to-End Human-Object Interaction Detection With Transformers CVPR 2021 UnionDet: Union-Level Detector Towards Real-Time Human-Object Interaction Detection ECCV 2020