Jincheng Mei
21 papers · 2016–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π Cross-Pollinator (10) π§ Keyword Pioneer π Academic Marathon (9) π Conference Polyglot (7) π Renaissance Researcher (6)
π
Renaissance Researcher
(6)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(30)
π€
Dynamic Duo
(16)
π
Triple Crown
π§¬
Topic Evolution
π₯
Unstoppable
(7)
ποΈ
Keyword Collector
(78)
π
Century Club
(21)
π
Trend Setter
Conferences
NIPS (7)
ICML (6)
ICLR (3)
AISTATS (2)
EMNLP (1)
IJCAI (1)
UAI (1)
Top co-authors
Keywords
convergence rate
(4)
global convergence
(3)
sample efficiency
(2)
convergence analysis
(2)
stochastic gradient
(2)
policy gradient
(2)
regret bound
(2)
online learning
(2)
reinforcement learning
(2)
softmax policy
(2)
natural policy gradient
(2)
multi-armed bandit
(2)
mirror descent
(2)
sequential decision making
(1)
sentiment analysis
(1)
maximum entropy
(1)
stochastic gradient descent
(1)
policy optimization
(1)
probabilistic modeling
(1)
topic modeling
(1)
Papers
Faster WIND: Accelerating Iterative Best-of-$N$ Distillation for LLM Alignment
AISTATS 2025
Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF
ICLR 2025
Small steps no more: Global convergence of stochastic gradient bandits for arbitrary learning rates
NIPS 2024
Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation
ICML 2024
Stochastic Gradient Succeeds for Bandits
ICML 2023
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
ICML 2023
Ordering-based Conditions for Global Convergence of Policy Gradient Methods
NIPS 2023
Understanding and mitigating the limitations of prioritized experience replay
UAI 2022
The Role of Baselines in Policy Gradient Optimization
NIPS 2022
On the Global Convergence Rates of Decentralized Softmax Gradient Play in Markov Potential Games
NIPS 2022
Understanding and Leveraging Overparameterization in Recursive Value Estimation
ICLR 2022
Understanding the Effect of Stochasticity in Policy Optimization
NIPS 2021
On the Optimality of Batch Policy Optimization Algorithms
ICML 2021
Leveraging Non-uniformity in First-order Non-convex Optimization
ICML 2021
Frequency-based Search-control in Dyna
ICLR 2020
Escaping the Gravitational Pull of Softmax
NIPS 2020
On the Global Convergence Rates of Softmax Policy Gradient Methods
ICML 2020
Maximum Entropy Monte-Carlo Planning
NIPS 2019
On Principled Entropy Exploration in Policy Optimization
IJCAI 2019
Identifying and Tracking Sentiments and Topics from Social Media Texts during Natural Disasters
EMNLP 2017
On the Reducibility of Submodular Functions
AISTATS 2016