Weitong ZHANG
20 papers · 2020–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
🌍 Conference Polyglot (6) 🏃 Academic Marathon (5) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (12)
🧭
Keyword Pioneer
🐝
Cross-Pollinator
(12)
🤝
Dynamic Duo
(11)
👑
Triple Crown
🏆
Grand Slam
💎
Century Club
(19)
🗃️
Keyword Collector
(52)
🚀
Conference Pioneer
⚡
Prolific Year
(8)
🔥
Unstoppable
(6)
Conferences
ICLR (6)
ICML (4)
NIPS (4)
MICCAI (3)
AAAI (1)
CVPR (1)
EMNLP (1)
Top co-authors
Keywords
sample complexity
(3)
reward-free exploration
(3)
diffusion model
(2)
linear mixture mdp
(2)
value-targeted regression
(2)
function approximation
(1)
image retrieval
(1)
imitation learning
(1)
model safety
(1)
image reconstruction
(1)
model-based reinforcement learning
(1)
magnetic resonance imaging
(1)
linear function approximation
(1)
finite-time analysis
(1)
policy gradient
(1)
regret bound
(1)
upper confidence bound
(1)
generative model
(1)
adversarial attack
(1)
epistemic uncertainty
(1)
Papers
Reinforcement Learning Without Explicit Rewards: Theory and Practice
AAAI 2026
Energy-Weighted Flow Matching for Offline Reinforcement Learning
ICLR 2025
Image Generation Diversity Issues and How to Tame Them
CVPR 2025
Phi: Preference Hijacking in Multi-modal Large Language Models at Inference Time
EMNLP 2025
Anyprefer: An Agentic Framework for Preference Data Synthesis
ICLR 2025
CREAM: Consistency Regularized Self-Rewarding Language Models
ICLR 2025
Mitigating Object Hallucination in Large Vision-Language Models via Image-Grounded Guidance
ICML 2025
Mesh4D: A Motion-Aware Multi-View Variational Autoencoder for 3D+t Mesh Reconstruction
MICCAI 2025
Multi-Agent Reasoning for Cardiovascular Imaging Phenotype Analysis
MICCAI 2025
Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs
ICLR 2024
Universal Topology Refinement for Medical Image Segmentation with Polynomial Feature Synthesis
MICCAI 2024
Stability and Generalizability in SDE Diffusion Models with Measure-Preserving Dynamics
NIPS 2024
Achieving Constant Regret in Linear Markov Decision Processes
NIPS 2024
Uncertainty-Aware Reward-Free Exploration with General Function Approximation
ICML 2024
Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs
ICML 2023
On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits
ICML 2023
Learning Neural Contextual Bandits through Perturbed Rewards
ICLR 2022
Reward-Free Model-Based Reinforcement Learning with Linear Function Approximation
NIPS 2021
Neural Thompson Sampling
ICLR 2021
A Finite-Time Analysis of Two Time-Scale Actor-Critic Methods
NIPS 2020