Yutong Bai

23 papers · 2019–2026 · 11 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🌈 Renaissance Researcher (7) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (10) 🏃 Academic Marathon (6) 🗺️ Taxonomy Completionist (44)

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🌍 Conference Polyglot (10) 🧬 Topic Evolution ⚡ Prolific Year (5) 💎 Century Club (22) ❓ The Questioner (2) 🔥 Unstoppable (7) 🗃️ Keyword Collector (88)

Conferences

CVPR (6) ICLR (4) NIPS (3) EMNLP (2) WACV (2) AAAI (1) ACL (1) CORL (1) ECCV (1) ICCV (1) MIDL (1)

Top co-authors

Alan L. Yuille (7) Alan Yuille (7) Cihang Xie (4) Trevor Darrell (3) Qihang Yu (3) Adam Kortylewski (3) Yuyin Zhou (2) Yixiao Zhang (2) Zongwei Zhou (2) Jitendra Malik (2)

Keywords

vision transformer (5) contrastive learning (4) convolutional neural network (3) masked autoencoder (2) image segmentation (2) medical imaging (2) knowledge distillation (2) semantic segmentation (1) pose estimation (1) transformer architecture (1) network architecture (1) feature extraction (1) object detection (1) self-supervised learning (1) adversarial robustness (1) image classification (1) data augmentation (1) attention mechanism (1) information retrieval (1) representation learning (1)

Papers

Probing Audio-Visual Reasoning in Multimodal Language Models through the Lens of Audio ACL 2026 KiVA: Kid-inspired Visual Analogies for Testing Large Multimodal Models ICLR 2025 AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time EMNLP 2025 Finding Visual Task Vectors ECCV 2024 Evaluating Multiview Object Consistency in Humans and Image Models NIPS 2024 LLARVA: Vision-Action Instruction Tuning Enhances Robot Learning CORL 2024 Sequential Modeling Enables Scalable Learning for Large Vision Models CVPR 2024 Learning Dynamic Multi-attribute Interest for Personalized Product Search EMNLP 2024 Discovering Failure Modes of Text-guided Diffusion Models via Adversarial Search ICLR 2024 Masked Autoencoders Enable Efficient Knowledge Distillers CVPR 2023 CoKe: Contrastive Learning for Robust Keypoint Detection WACV 2023 Delving Into Masked Autoencoders for Multi-Label Thorax Disease Classification WACV 2023 Can CNNs Be More Robust Than Transformers? ICLR 2023 Making Your First Choice: To Address Cold Start Problem in Medical Active Learning MIDL 2023 Point-Level Region Contrast for Object Detection Pre-Training CVPR 2022 TransFG: A Transformer Architecture for Fine-Grained Recognition AAAI 2022 Fast AdvProp ICLR 2022 Are Transformers more robust than CNNs? NIPS 2021 Mask Guided Matting via Progressive Refinement Network CVPR 2021 Glance-and-Gaze Vision Transformer NIPS 2021 C2FNAS: Coarse-to-Fine Neural Architecture Search for 3D Medical Image Segmentation CVPR 2020 CLEVR-Ref+: Diagnosing Visual Reasoning With Referring Expressions CVPR 2019 Semantic Part Detection via Matching: Learning to Generalize to Novel Viewpoints From Limited Training Data ICCV 2019