Bumsoo Kim
14 papers · 2020–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π Academic Marathon (5) π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (8) π Cross-Pollinator (12)
πΊοΈ
Taxonomy Completionist
(31)
π
Interdisciplinary Bridge
π§
Keyword Pioneer
β‘
Prolific Year
(5)
ποΈ
Keyword Collector
(60)
π₯
Unstoppable
(6)
π
Century Club
(14)
β
The Questioner
Conferences
CVPR (4)
AAAI (2)
ECCV (2)
WACV (2)
ACL (1)
EMNLP (1)
ICCV (1)
NIPS (1)
Top co-authors
Keywords
contrastive learning
(4)
object detection
(3)
multimodal learning
(2)
image-text alignment
(2)
3d vision
(2)
multimodal large language model
(2)
knowledge distillation
(2)
human-object interaction detection
(2)
vision-language model
(2)
in-context learning
(2)
pose estimation
(1)
image captioning
(1)
transfer learning
(1)
video understanding
(1)
metric learning
(1)
set prediction
(1)
action recognition
(1)
instruction tuning
(1)
efficient inference
(1)
human pose estimation
(1)
Papers
Make VLM Recognize Visual Hallucination on Cartoon Character Image with Pose Information
WACV 2025
ImagePiece: Content-aware Re-tokenization for Efficient Image Recognition
AAAI 2025
Is `Right' Right? Enhancing Object Orientation Understanding in Multimodal Large Language Models through Egocentric Instruction Tuning
CVPR 2025
Generative Modeling of Class Probability for Multi-Modal Representation Learning
CVPR 2025
Retrieval Enhanced Feedback via In-context Neural Error-book
EMNLP 2025
Expediting Contrastive Language-Image Pretraining via Self-Distilled Encoders
AAAI 2024
UNSPAT: Uncertainty-Guided SpatioTemporal Transformer for 3D Human Pose and Shape Estimation on Videos
WACV 2024
See It All: Contextualized Late Aggregation for 3D Dense Captioning
ACL 2024
Bi-directional Contextual Attention for 3D Dense Captioning
ECCV 2024
Misalign, Contrast then Distill: Rethinking Misalignments in Language-Image Pre-training
ICCV 2023
MSTR: Multi-Scale Transformer for End-to-End Human-Object Interaction Detection
CVPR 2022
UniCLIP: Unified Framework for Contrastive Language-Image Pre-training
NIPS 2022
HOTR: End-to-End Human-Object Interaction Detection With Transformers
CVPR 2021
UnionDet: Union-Level Detector Towards Real-Time Human-Object Interaction Detection
ECCV 2020