Xiyao Wang

16 papers · 2022–2025 · 8 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🐝 Cross-Pollinator (11) 🌈 Renaissance Researcher (5) 🌍 Conference Polyglot (8) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge

🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (29) 👑 Triple Crown 🤝 Dynamic Duo (14) ❓ The Questioner ⚡ Prolific Year (7) 🗃️ Keyword Collector (50) 💎 Century Club (16)

Conferences

ICLR (4) ICML (3) EMNLP (2) NAACL (2) NIPS (2) ACL (1) CVPR (1) ICCV (1)

Top co-authors

Furong Huang (14) Ruijie Zheng (7) Huazhe Xu (6) Yanchao Sun (4) Xiaoyu Liu (4) Yuhang Zhou (4) Hal Daume III (3) Huaxiu Yao (3) Yuancheng Xu (3) Jing Zhu (2)

Keywords

vision-language model (3) modality alignment (2) model-based reinforcement learning (2) sample efficiency (2) domain generalization (2) policy learning (2) preference learning (1) visual reinforcement learning (1) domain adaptation (1) large multimodal model (1) instruction following (1) preference optimization (1) reinforcement learning from human feedback (1) model alignment (1) state representation (1) continuous control (1) knowledge distillation (1) distribution shift (1) benchmark evaluation (1) contrastive learning (1)

Papers

Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement NAACL 2025 LLaVA-Critic: Learning to Evaluate Multimodal Models CVPR 2025 World Models with Hints of Large Language Models for Goal Achieving NAACL 2025 DISCO Balances the Scales: Adaptive Domain- and Difficulty-Aware Reinforcement Learning on Imbalanced Data EMNLP 2025 Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension ICCV 2025 COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL ICLR 2024 Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate ICML 2024 Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences ACL 2024 Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss ICML 2024 Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation EMNLP 2024 Calibrated Self-Rewarding Vision Language Models NIPS 2024 DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization ICLR 2024 Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function ICLR 2023 $\texttt{TACO}$: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning NIPS 2023 Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy ICML 2023 Transfer RL across Observation Feature Spaces via Model-Based Regularization ICLR 2022