Qingpeng Cai
14 papers · 2016–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
π Renaissance Researcher (6) π Interdisciplinary Bridge π Conference Polyglot (6) π Academic Marathon (9) πΊοΈ Taxonomy Completionist (27)
π
Conference Polyglot
(6)
π
Academic Marathon
(9)
π
Renaissance Researcher
(6)
π
Grand Slam
π
Century Club
(14)
ποΈ
Keyword Collector
(64)
π
Conference Pioneer
Conferences
AAAI (6)
NIPS (3)
IJCAI (2)
CVPR (1)
ICLR (1)
ICML (1)
Top co-authors
Keywords
deep deterministic policy gradient
(3)
reinforcement learning
(3)
model-based reinforcement learning
(2)
user simulation
(2)
recommender system
(2)
deep reinforcement learning
(2)
continuous control
(2)
policy optimization
(1)
pseudo labeling
(1)
semi-supervised learning
(1)
image captioning
(1)
medical image classification
(1)
hierarchical reinforcement learning
(1)
markov decision process
(1)
sentiment analysis
(1)
adversarial learning
(1)
exploration exploitation
(1)
ensemble learning
(1)
value iteration
(1)
value function
(1)
Papers
Flow Factorization for Efficient Generative Flow Networks
AAAI 2025
Random Policy Evaluation Uncovers Policies of Generative Flow Networks
ICML 2025
LLM-Powered User Simulator for Recommender System
AAAI 2025
ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor
ICLR 2023
State Regularized Policy Optimization on Data with Dynamics Shift
NIPS 2023
KuaiSim: A Comprehensive Simulator for Recommender Systems
NIPS 2023
MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-Based Image Captioning
AAAI 2022
BoostMIS: Boosting Medical Image Semi-Supervised Learning With Adaptive Pseudo Labeling and Informative Active Annotation
CVPR 2022
Reinforcement Learning with Dynamic Boltzmann Softmax Updates
IJCAI 2020
Deterministic Value-Policy Gradients
AAAI 2020
Softmax Deep Double Deterministic Policy Gradients
NIPS 2020
A Deep Reinforcement Learning Framework for Rebalancing Dockless Bike Sharing Systems
AAAI 2019
Policy Optimization with Model-Based Explorations
AAAI 2019
Facility Location with Minimax Envy
IJCAI 2016