Weitong ZHANG

20 papers · 2020–2026 · 7 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🌍 Conference Polyglot (6) 🏃 Academic Marathon (5) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (12)

🧭 Keyword Pioneer 🐝 Cross-Pollinator (12) 🤝 Dynamic Duo (11) 👑 Triple Crown 🏆 Grand Slam 💎 Century Club (19) 🗃️ Keyword Collector (52) 🚀 Conference Pioneer ⚡ Prolific Year (8) 🔥 Unstoppable (6)

Conferences

ICLR (6) ICML (4) NIPS (4) MICCAI (3) AAAI (1) CVPR (1) EMNLP (1)

Top co-authors

Quanquan Gu (11) Bernhard Kainz (5) Dongruo Zhou (4) Jiafan He (3) Liu Li (3) Chengqi Zang (2) Cheng Ouyang (2) Zhiyuan Fan (2) Wenjia Bai (2) Ying Wei (2)

Keywords

sample complexity (3) reward-free exploration (3) diffusion model (2) linear mixture mdp (2) value-targeted regression (2) function approximation (1) image retrieval (1) imitation learning (1) model safety (1) image reconstruction (1) model-based reinforcement learning (1) magnetic resonance imaging (1) linear function approximation (1) finite-time analysis (1) policy gradient (1) regret bound (1) upper confidence bound (1) generative model (1) adversarial attack (1) epistemic uncertainty (1)

Papers

Reinforcement Learning Without Explicit Rewards: Theory and Practice AAAI 2026 Energy-Weighted Flow Matching for Offline Reinforcement Learning ICLR 2025 Image Generation Diversity Issues and How to Tame Them CVPR 2025 Phi: Preference Hijacking in Multi-modal Large Language Models at Inference Time EMNLP 2025 Anyprefer: An Agentic Framework for Preference Data Synthesis ICLR 2025 CREAM: Consistency Regularized Self-Rewarding Language Models ICLR 2025 Mitigating Object Hallucination in Large Vision-Language Models via Image-Grounded Guidance ICML 2025 Mesh4D: A Motion-Aware Multi-View Variational Autoencoder for 3D+t Mesh Reconstruction MICCAI 2025 Multi-Agent Reasoning for Cardiovascular Imaging Phenotype Analysis MICCAI 2025 Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs ICLR 2024 Universal Topology Refinement for Medical Image Segmentation with Polynomial Feature Synthesis MICCAI 2024 Stability and Generalizability in SDE Diffusion Models with Measure-Preserving Dynamics NIPS 2024 Achieving Constant Regret in Linear Markov Decision Processes NIPS 2024 Uncertainty-Aware Reward-Free Exploration with General Function Approximation ICML 2024 Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs ICML 2023 On the Interplay Between Misspecification and Sub-optimality Gap in Linear Contextual Bandits ICML 2023 Learning Neural Contextual Bandits through Perturbed Rewards ICLR 2022 Reward-Free Model-Based Reinforcement Learning with Linear Function Approximation NIPS 2021 Neural Thompson Sampling ICLR 2021 A Finite-Time Analysis of Two Time-Scale Actor-Critic Methods NIPS 2020