Yuan Zhou

56 papers · 2014–2026 · 14 conferences · across top CS/AI conferences

Achievements

+15 more ↓

🧭 Keyword Pioneer 🌍 Conference Polyglot (14) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (11) 🏃 Academic Marathon (11)

🐝 Cross-Pollinator (12) 🌈 Renaissance Researcher (9) 🧭 Keyword Pioneer 🏆 Grand Slam 🧬 Topic Evolution 🏆 Keyword Champion (3) 🤝 Dynamic Duo (11) 👑 Triple Crown 🔬 Deep Specialist (10) 👥 Mega-Team (32) 🗃️ Keyword Collector (210) ⚡ Prolific Year (6) 💎 Century Club (45) 🔥 Unstoppable (9) 📈 Trend Setter

Conferences

AAAI (13) ICML (13) NIPS (8) ICLR (5) ACL (3) AISTATS (3) CVPR (3) COLT (2) CORL (1) IJCAI (1) INTERSPEECH (1) JMLR (1) MICCAI (1) NAACL (1)

Top co-authors

Jian Peng (11) Hanwang Zhang (6) Qingshan Xu (6) Xi Chen (6) Yining Wang (5) Zhizhou Ren (4) Junbao Zhou (4) Zihan Zhang (4) Yuxuan Wang (4) Xiangyang Ji (4)

Research topics

Applications (1) Privacy (1)

Keywords

regret bound (9) multi-armed bandit (7) sample complexity (6) online learning (4) bayesian inference (3) thresholding bandit (3) large language model (3) markov chain monte carlo (3) probabilistic programming (3) markov decision process (3) video generation (3) online algorithm (2) stochastic optimization (2) upper confidence bound (2) diffusion model (2) assortment optimization (2) minimax regret (2) model compression (2) video understanding (2) reinforcement learning (2)

Papers

Phased One-Step Adversarial Equilibrium for Video Diffusion Models AAAI 2026 DragNeXt: Rethinking Drag-Based Image Editing AAAI 2026 Backdoors in RLVR: Jailbreak Backdoors in LLMs From Verifiable Reward ACL 2026 Debiasing Diffusion Priors via 3D Attention for Consistent Gaussian Splatting AAAI 2026 Beyond Counting: Evaluating Abstract and Emotional Reasoning in Vision-Language Models AAAI 2026 LookFlow: Training-Free and Efficient High-Resolution Image Synthesis via Dynamic Lookahead Guidance Flow AAAI 2026 ReaSon: Reinforced Causal Search with Information Bottleneck for Video Understanding AAAI 2026 Towards Intrinsic Interpretability of Large Language Models: A Survey of Design Principles and Architectures ACL 2026 NeuSpring: Neural Spring Fields for Reconstruction and Simulation of Deformable Objects from Videos AAAI 2026 Pushing Rendering Boundaries: Hard Gaussian Splatting AAAI 2026 Personalize Your Gaussian: Consistent 3D Scene Personalization from a Single Image AAAI 2026 Almost Optimal Batch-Regret Tradeoff for Batch Linear Contextual Bandits ICLR 2025 Safety-Polarized and Prioritized Reinforcement Learning ICML 2025 An LLM-Empowered Adaptive Evolutionary Algorithm for Multi-Component Deep Learning Systems AAAI 2025 Exploiting the Shadows: Unveiling Privacy Leaks through Lower-Ranked Tokens in Large Language Models ACL 2025 CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search NAACL 2025 Graph Disentanglement Learning for fMRI Analysis: Decoupling Disease, Covariates, and Individual Variability MICCAI 2025 CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction CVPR 2025 Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation CVPR 2025 On Path to Multimodal Generalist: General-Level and General-Bench ICML 2025 Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment ICML 2024 Robust Situational Reinforcement Learning in Face of Context Disturbances ICML 2023 Doodle to Object: Practical Zero-Shot Sketch-Based 3D Shape Retrieval AAAI 2023 Learning Sparse Group Models Through Boolean Relaxation ICLR 2023 ABC-KD: Attention-Based-Compression Knowledge Distillation for Deep Learning-Based Noise Suppression INTERSPEECH 2023 High-Fidelity and Freely Controllable Talking Head Video Generation CVPR 2023 Proximal Exploration for Model-guided Protein Sequence Design ICML 2022 Near-Optimal Regret Bounds for Multi-batch Reinforcement Learning NIPS 2022 Imitation Learning from Observations under Transition Model Disparity ICLR 2022 Learning Long-Term Reward Redistribution via Randomized Return Decomposition ICLR 2022 Off-Policy Reinforcement Learning with Delayed Rewards ICML 2022 Dynamic Car Dispatching and Pricing: Revenue and Fairness for Ridesharing Platforms IJCAI 2022 Probabilistic Programs with Stochastic Conditioning ICML 2021 Model-Free Reinforcement Learning: from Clipped Pseudo-Regret to Sample Complexity ICML 2021 Tight Regret Bounds for Infinite-armed Linear Contextual Bandits AISTATS 2021 Near-Optimal MNL Bandits Under Risk Criteria AAAI 2021 Learning Guidance Rewards with Trajectory-space Smoothing NIPS 2020 Multinomial Logit Bandit with Low Switching Cost ICML 2020 A PTAS for the Bayesian Thresholding Bandit Problem AISTATS 2020 Divide, Conquer, and Combine: a New Inference Strategy for Probabilistic Programs with Stochastic Support ICML 2020 Adaptive Double-Exploration Tradeoff for Outlier Detection AAAI 2020 Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity CORL 2020 Almost Optimal Model-Free Reinforcement Learningvia Reference-Advantage Decomposition NIPS 2020 Root-n-Regret for Learning in Markov Decision Processes with Function Approximation and Low Bellman Rank COLT 2020 Dynamic Assortment Optimization with Changing Contextual Information JMLR 2020 Thresholding Bandit with Optimal Aggregate Regret NIPS 2019 Channel Gating Neural Networks NIPS 2019 Off-Policy Evaluation and Learning from Logged Bandit Feedback: Error Reduction via Surrogate Policy ICLR 2019 Exploration via Hindsight Goal Generation NIPS 2019 Nearly Minimax-Optimal Regret for Linearly Parameterized Bandits COLT 2019 LF-PPL: A Low-Level First Order Probabilistic Programming Language for Non-Differentiable Models AISTATS 2019 Near-Optimal Policies for Dynamic Multinomial Logit Assortment Selection Models NIPS 2018 Tight Bounds for Collaborative PAC Learning via Multiplicative Weights NIPS 2018 Best Arm Identification in Linear Bandits with Linear Dimension Dependency ICML 2018 Adaptive Multiple-Arm Identification ICML 2017 Optimal PAC Multiple Arm Identification with Applications to Crowdsourcing ICML 2014