Jianbo Yuan
16 papers · 2022–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
π Cross-Pollinator (13) π Renaissance Researcher (5) π Conference Polyglot (6) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (22)
π§
Keyword Pioneer
π
Conference Polyglot
(6)
π€
Dynamic Duo
(10)
π
Century Club
(15)
π₯
Unstoppable
(5)
β‘
Prolific Year
(5)
Conferences
ICLR (5)
ACL (3)
WACV (3)
ICML (2)
AAAI (1)
CVPR (1)
ECCV (1)
Top co-authors
Keywords
large language model
(3)
multimodal large language model
(3)
contrastive learning
(2)
direct preference optimization
(1)
attention mechanism
(1)
preference optimization
(1)
multimodal learning
(1)
instance segmentation
(1)
semantic alignment
(1)
adaptive sampling
(1)
model alignment
(1)
exploration policy
(1)
vision language model
(1)
cross-modal alignment
(1)
data selection
(1)
vision-language model
(1)
frequency domain
(1)
multimodal representation
(1)
image understanding
(1)
multimodal understanding
(1)
Papers
InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization
AAAI 2026
Learning Compact Video Representations for Efficient Long-form Video Understanding in Large Multimodal Models
WACV 2026
DavIR: Data Selection via Implicit Reward for Large Language Models
ACL 2025
Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling
ICLR 2024
InfiMM: Advancing Multimodal Understanding with an Open-Sourced Visual Language Model
ACL 2024
Let Models Speak Ciphers: Multiagent Debate through Embeddings
ICLR 2024
An Expert is Worth One Token: Synergizing Multiple Expert LLMs as Generalist via Expert Token Routing
ACL 2024
LEMON: Lossless model expansion
ICLR 2024
InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks
ICML 2024
Self-Infilling Code Generation
ICML 2024
HiCLIP: Contrastive Language-Image Pretraining with Hierarchy-aware Attention
ICLR 2023
Efficient Attention via Control Variates
ICLR 2023
Revisiting Multimodal Representation in Contrastive Learning: From Patch and Token Embeddings to Finite Discrete Tokens
CVPR 2023
Discrete Cosin TransFormer: Image Modeling From Frequency Domain
WACV 2023
More Than Just Attention: Improving Cross-Modal Attentions With Contrastive Constraints for Image-Text Matching
WACV 2023
Hierarchically Self-Supervised Transformer for Human Skeleton Representation Learning
ECCV 2022