Xiyao Wang
16 papers · 2022–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π Cross-Pollinator (11) π Renaissance Researcher (5) π Conference Polyglot (8) π§ Keyword Pioneer π Interdisciplinary Bridge
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(29)
π
Triple Crown
π€
Dynamic Duo
(14)
β
The Questioner
β‘
Prolific Year
(7)
ποΈ
Keyword Collector
(50)
π
Century Club
(16)
Conferences
ICLR (4)
ICML (3)
EMNLP (2)
NAACL (2)
NIPS (2)
ACL (1)
CVPR (1)
ICCV (1)
Top co-authors
Keywords
vision-language model
(3)
modality alignment
(2)
model-based reinforcement learning
(2)
sample efficiency
(2)
domain generalization
(2)
policy learning
(2)
preference learning
(1)
visual reinforcement learning
(1)
domain adaptation
(1)
large multimodal model
(1)
instruction following
(1)
preference optimization
(1)
reinforcement learning from human feedback
(1)
model alignment
(1)
state representation
(1)
continuous control
(1)
knowledge distillation
(1)
distribution shift
(1)
benchmark evaluation
(1)
contrastive learning
(1)
Papers
Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement
NAACL 2025
LLaVA-Critic: Learning to Evaluate Multimodal Models
CVPR 2025
World Models with Hints of Large Language Models for Goal Achieving
NAACL 2025
DISCO Balances the Scales: Adaptive Domain- and Difficulty-Aware Reinforcement Learning on Imbalanced Data
EMNLP 2025
Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension
ICCV 2025
COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL
ICLR 2024
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate
ICML 2024
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
ACL 2024
Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss
ICML 2024
Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation
EMNLP 2024
Calibrated Self-Rewarding Vision Language Models
NIPS 2024
DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization
ICLR 2024
Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function
ICLR 2023
$\texttt{TACO}$: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning
NIPS 2023
Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy
ICML 2023
Transfer RL across Observation Feature Spaces via Model-Based Regularization
ICLR 2022