Yutong Bai
23 papers · 2019–2026 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π Renaissance Researcher (7) π Interdisciplinary Bridge π Conference Polyglot (10) π Academic Marathon (6) πΊοΈ Taxonomy Completionist (44)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Conference Polyglot
(10)
π§¬
Topic Evolution
β‘
Prolific Year
(5)
π
Century Club
(22)
β
The Questioner
(2)
π₯
Unstoppable
(7)
ποΈ
Keyword Collector
(88)
Conferences
CVPR (6)
ICLR (4)
NIPS (3)
EMNLP (2)
WACV (2)
AAAI (1)
ACL (1)
CORL (1)
ECCV (1)
ICCV (1)
MIDL (1)
Top co-authors
Keywords
vision transformer
(5)
contrastive learning
(4)
convolutional neural network
(3)
masked autoencoder
(2)
image segmentation
(2)
medical imaging
(2)
knowledge distillation
(2)
semantic segmentation
(1)
pose estimation
(1)
transformer architecture
(1)
network architecture
(1)
feature extraction
(1)
object detection
(1)
self-supervised learning
(1)
adversarial robustness
(1)
image classification
(1)
data augmentation
(1)
attention mechanism
(1)
information retrieval
(1)
representation learning
(1)
Papers
Probing Audio-Visual Reasoning in Multimodal Language Models through the Lens of Audio
ACL 2026
KiVA: Kid-inspired Visual Analogies for Testing Large Multimodal Models
ICLR 2025
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
EMNLP 2025
Finding Visual Task Vectors
ECCV 2024
Evaluating Multiview Object Consistency in Humans and Image Models
NIPS 2024
LLARVA: Vision-Action Instruction Tuning Enhances Robot Learning
CORL 2024
Sequential Modeling Enables Scalable Learning for Large Vision Models
CVPR 2024
Learning Dynamic Multi-attribute Interest for Personalized Product Search
EMNLP 2024
Discovering Failure Modes of Text-guided Diffusion Models via Adversarial Search
ICLR 2024
Masked Autoencoders Enable Efficient Knowledge Distillers
CVPR 2023
CoKe: Contrastive Learning for Robust Keypoint Detection
WACV 2023
Delving Into Masked Autoencoders for Multi-Label Thorax Disease Classification
WACV 2023
Can CNNs Be More Robust Than Transformers?
ICLR 2023
Making Your First Choice: To Address Cold Start Problem in Medical Active Learning
MIDL 2023
Point-Level Region Contrast for Object Detection Pre-Training
CVPR 2022
TransFG: A Transformer Architecture for Fine-Grained Recognition
AAAI 2022
Fast AdvProp
ICLR 2022
Are Transformers more robust than CNNs?
NIPS 2021
Mask Guided Matting via Progressive Refinement Network
CVPR 2021
Glance-and-Gaze Vision Transformer
NIPS 2021
C2FNAS: Coarse-to-Fine Neural Architecture Search for 3D Medical Image Segmentation
CVPR 2020
CLEVR-Ref+: Diagnosing Visual Reasoning With Referring Expressions
CVPR 2019
Semantic Part Detection via Matching: Learning to Generalize to Novel Viewpoints From Limited Training Data
ICCV 2019