Yuan Zhou
56 papers · 2014–2026 · 14 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
π§ Keyword Pioneer π Conference Polyglot (14) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (11) π Academic Marathon (11)
π
Cross-Pollinator
(12)
π
Renaissance Researcher
(9)
π§
Keyword Pioneer
π
Grand Slam
π§¬
Topic Evolution
π
Keyword Champion
(3)
π€
Dynamic Duo
(11)
π
Triple Crown
π¬
Deep Specialist
(10)
π₯
Mega-Team
(32)
ποΈ
Keyword Collector
(210)
β‘
Prolific Year
(6)
π
Century Club
(45)
π₯
Unstoppable
(9)
π
Trend Setter
Conferences
AAAI (13)
ICML (13)
NIPS (8)
ICLR (5)
ACL (3)
AISTATS (3)
CVPR (3)
COLT (2)
CORL (1)
IJCAI (1)
INTERSPEECH (1)
JMLR (1)
MICCAI (1)
NAACL (1)
Top co-authors
Research topics
Keywords
regret bound
(9)
multi-armed bandit
(7)
sample complexity
(6)
online learning
(4)
bayesian inference
(3)
thresholding bandit
(3)
large language model
(3)
markov chain monte carlo
(3)
probabilistic programming
(3)
markov decision process
(3)
video generation
(3)
online algorithm
(2)
stochastic optimization
(2)
upper confidence bound
(2)
diffusion model
(2)
assortment optimization
(2)
minimax regret
(2)
model compression
(2)
video understanding
(2)
reinforcement learning
(2)
Papers
Phased One-Step Adversarial Equilibrium for Video Diffusion Models
AAAI 2026
DragNeXt: Rethinking Drag-Based Image Editing
AAAI 2026
Backdoors in RLVR: Jailbreak Backdoors in LLMs From Verifiable Reward
ACL 2026
Debiasing Diffusion Priors via 3D Attention for Consistent Gaussian Splatting
AAAI 2026
Beyond Counting: Evaluating Abstract and Emotional Reasoning in Vision-Language Models
AAAI 2026
LookFlow: Training-Free and Efficient High-Resolution Image Synthesis via Dynamic Lookahead Guidance Flow
AAAI 2026
ReaSon: Reinforced Causal Search with Information Bottleneck for Video Understanding
AAAI 2026
Towards Intrinsic Interpretability of Large Language Models: A Survey of Design Principles and Architectures
ACL 2026
NeuSpring: Neural Spring Fields for Reconstruction and Simulation of Deformable Objects from Videos
AAAI 2026
Pushing Rendering Boundaries: Hard Gaussian Splatting
AAAI 2026
Personalize Your Gaussian: Consistent 3D Scene Personalization from a Single Image
AAAI 2026
Almost Optimal Batch-Regret Tradeoff for Batch Linear Contextual Bandits
ICLR 2025
Safety-Polarized and Prioritized Reinforcement Learning
ICML 2025
An LLM-Empowered Adaptive Evolutionary Algorithm for Multi-Component Deep Learning Systems
AAAI 2025
Exploiting the Shadows: Unveiling Privacy Leaks through Lower-Ranked Tokens in Large Language Models
ACL 2025
CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
NAACL 2025
Graph Disentanglement Learning for fMRI Analysis: Decoupling Disease, Covariates, and Individual Variability
MICCAI 2025
CARE Transformer: Mobile-Friendly Linear Visual Transformer via Decoupled Dual Interaction
CVPR 2025
Long Video Diffusion Generation with Segmented Cross-Attention and Content-Rich Video Data Curation
CVPR 2025
On Path to Multimodal Generalist: General-Level and General-Bench
ICML 2025
Advancing DRL Agents in Commercial Fighting Games: Training, Integration, and Agent-Human Alignment
ICML 2024
Robust Situational Reinforcement Learning in Face of Context Disturbances
ICML 2023
Doodle to Object: Practical Zero-Shot Sketch-Based 3D Shape Retrieval
AAAI 2023
Learning Sparse Group Models Through Boolean Relaxation
ICLR 2023
ABC-KD: Attention-Based-Compression Knowledge Distillation for Deep Learning-Based Noise Suppression
INTERSPEECH 2023
High-Fidelity and Freely Controllable Talking Head Video Generation
CVPR 2023
Proximal Exploration for Model-guided Protein Sequence Design
ICML 2022
Near-Optimal Regret Bounds for Multi-batch Reinforcement Learning
NIPS 2022
Imitation Learning from Observations under Transition Model Disparity
ICLR 2022
Learning Long-Term Reward Redistribution via Randomized Return Decomposition
ICLR 2022
Off-Policy Reinforcement Learning with Delayed Rewards
ICML 2022
Dynamic Car Dispatching and Pricing: Revenue and Fairness for Ridesharing Platforms
IJCAI 2022
Probabilistic Programs with Stochastic Conditioning
ICML 2021
Model-Free Reinforcement Learning: from Clipped Pseudo-Regret to Sample Complexity
ICML 2021
Tight Regret Bounds for Infinite-armed Linear Contextual Bandits
AISTATS 2021
Near-Optimal MNL Bandits Under Risk Criteria
AAAI 2021
Learning Guidance Rewards with Trajectory-space Smoothing
NIPS 2020
Multinomial Logit Bandit with Low Switching Cost
ICML 2020
A PTAS for the Bayesian Thresholding Bandit Problem
AISTATS 2020
Divide, Conquer, and Combine: a New Inference Strategy for Probabilistic Programs with Stochastic Support
ICML 2020
Adaptive Double-Exploration Tradeoff for Outlier Detection
AAAI 2020
Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity
CORL 2020
Almost Optimal Model-Free Reinforcement Learningvia Reference-Advantage Decomposition
NIPS 2020
Root-n-Regret for Learning in Markov Decision Processes with Function Approximation and Low Bellman Rank
COLT 2020
Dynamic Assortment Optimization with Changing Contextual Information
JMLR 2020
Thresholding Bandit with Optimal Aggregate Regret
NIPS 2019
Channel Gating Neural Networks
NIPS 2019
Off-Policy Evaluation and Learning from Logged Bandit Feedback: Error Reduction via Surrogate Policy
ICLR 2019
Exploration via Hindsight Goal Generation
NIPS 2019
Nearly Minimax-Optimal Regret for Linearly Parameterized Bandits
COLT 2019
LF-PPL: A Low-Level First Order Probabilistic Programming Language for Non-Differentiable Models
AISTATS 2019
Near-Optimal Policies for Dynamic Multinomial Logit Assortment Selection Models
NIPS 2018
Tight Bounds for Collaborative PAC Learning via Multiplicative Weights
NIPS 2018
Best Arm Identification in Linear Bandits with Linear Dimension Dependency
ICML 2018
Adaptive Multiple-Arm Identification
ICML 2017
Optimal PAC Multiple Arm Identification with Applications to Crowdsourcing
ICML 2014