Yuexiang Zhai

14 papers · 2019–2026 · 7 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🏃 Academic Marathon (6) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (6) 🐝 Cross-Pollinator (13)

🐝 Cross-Pollinator (13) 🌈 Renaissance Researcher (6) 🗺️ Taxonomy Completionist (28) 🏆 Grand Slam 🤝 Dynamic Duo (10) 💎 Century Club (13) 🗃️ Keyword Collector (50) 🔥 Unstoppable (7) ❓ The Questioner (2)

Conferences

ICLR (3) ICML (3) NIPS (3) JMLR (2) AAAI (1) CVPR (1) ICCV (1)

Top co-authors

Yi Ma (10) Sergey Levine (6) Shengbang Tong (5) Saining Xie (3) Tianzhe Chu (3) Zhihui Zhu (2) Yann LeCun (2) Xiao Li (2) Qing Qu (2) Hao Bai (2)

Keywords

multimodal large language model (2) vision language model (2) reinforcement learning (2) transformer architecture (1) contrastive learning (1) self-supervised learning (1) 3d reconstruction (1) policy optimization (1) matrix factorization (1) decision making (1) sparse representation (1) curriculum learning (1) sample complexity (1) batch normalization (1) multimodal learning (1) benchmark evaluation (1) visual reasoning (1) visual grounding (1) instruction following (1) representation learning (1)

Papers

Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs AAAI 2026 SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training ICML 2025 LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models ICML 2025 RLIF: Interactive Imitation Learning as Reinforcement Learning ICLR 2024 Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning NIPS 2024 Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs CVPR 2024 White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is? JMLR 2024 Understanding the Complexity Gains of Single-Task RL with a Curriculum ICML 2023 Unpacking Reward Shaping: Understanding the Benefits of Reward Engineering on Sample Complexity NIPS 2022 Convolutional Normalization: Improving Deep Convolutional Network Robustness and Training NIPS 2021 Geometric Analysis of Nonconvex Optimization Landscapes for Overcomplete Learning ICLR 2020 Understanding l4-based Dictionary Learning: Interpretation, Stability, and Robustness ICLR 2020 Complete Dictionary Learning via L4-Norm Maximization over the Orthogonal Group JMLR 2020 Learning to Reconstruct 3D Manhattan Wireframes From a Single Image ICCV 2019