Yuexiang Zhai
14 papers · 2019–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π Academic Marathon (6) π§ Keyword Pioneer π Interdisciplinary Bridge π Conference Polyglot (6) π Cross-Pollinator (13)
π
Cross-Pollinator
(13)
π
Renaissance Researcher
(6)
πΊοΈ
Taxonomy Completionist
(28)
π
Grand Slam
π€
Dynamic Duo
(10)
π
Century Club
(13)
ποΈ
Keyword Collector
(50)
π₯
Unstoppable
(7)
β
The Questioner
(2)
Conferences
ICLR (3)
ICML (3)
NIPS (3)
JMLR (2)
AAAI (1)
CVPR (1)
ICCV (1)
Top co-authors
Keywords
multimodal large language model
(2)
vision language model
(2)
reinforcement learning
(2)
transformer architecture
(1)
contrastive learning
(1)
self-supervised learning
(1)
3d reconstruction
(1)
policy optimization
(1)
matrix factorization
(1)
decision making
(1)
sparse representation
(1)
curriculum learning
(1)
sample complexity
(1)
batch normalization
(1)
multimodal learning
(1)
benchmark evaluation
(1)
visual reasoning
(1)
visual grounding
(1)
instruction following
(1)
representation learning
(1)
Papers
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs
AAAI 2026
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
ICML 2025
LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
ICML 2025
RLIF: Interactive Imitation Learning as Reinforcement Learning
ICLR 2024
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
NIPS 2024
Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs
CVPR 2024
White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is?
JMLR 2024
Understanding the Complexity Gains of Single-Task RL with a Curriculum
ICML 2023
Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity
NIPS 2022
Convolutional Normalization: Improving Deep Convolutional Network Robustness and Training
NIPS 2021
Geometric Analysis of Nonconvex Optimization Landscapes for Overcomplete Learning
ICLR 2020
Understanding l4-based Dictionary Learning: Interpretation, Stability, and Robustness
ICLR 2020
Complete Dictionary Learning via L4-Norm Maximization over the Orthogonal Group
JMLR 2020
Learning to Reconstruct 3D Manhattan Wireframes From a Single Image
ICCV 2019