Jincheng Mei

21 papers · 2016–2025 · 7 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🐝 Cross-Pollinator (10) 🧭 Keyword Pioneer 🏃 Academic Marathon (9) 🌍 Conference Polyglot (7) 🌈 Renaissance Researcher (6)

🌈 Renaissance Researcher (6) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (30) 🤝 Dynamic Duo (16) 👑 Triple Crown 🧬 Topic Evolution 🔥 Unstoppable (7) 🗃️ Keyword Collector (78) 💎 Century Club (21) 📈 Trend Setter

Conferences

NIPS (7) ICML (6) ICLR (3) AISTATS (2) EMNLP (1) IJCAI (1) UAI (1)

Top co-authors

Dale Schuurmans (16) Bo Dai (13) Csaba Szepesvári (10) Chenjun Xiao (8) Alekh Agarwal (3) Martin Müller (2) Tong Yang (2) Amir-massoud Farahmand (2) Lihong Li (2) Yuejie Chi (2)

Keywords

convergence rate (4) global convergence (3) sample efficiency (2) convergence analysis (2) stochastic gradient (2) policy gradient (2) regret bound (2) online learning (2) reinforcement learning (2) softmax policy (2) natural policy gradient (2) multi-armed bandit (2) mirror descent (2) sequential decision making (1) sentiment analysis (1) maximum entropy (1) stochastic gradient descent (1) policy optimization (1) probabilistic modeling (1) topic modeling (1)

Papers

Faster WIND: Accelerating Iterative Best-of-$N$ Distillation for LLM Alignment AISTATS 2025 Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF ICLR 2025 Small steps no more: Global convergence of stochastic gradient bandits for arbitrary learning rates NIPS 2024 Target Networks and Over-parameterization Stabilize Off-policy Bootstrapping with Function Approximation ICML 2024 Stochastic Gradient Succeeds for Bandits ICML 2023 Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice ICML 2023 Ordering-based Conditions for Global Convergence of Policy Gradient Methods NIPS 2023 Understanding and mitigating the limitations of prioritized experience replay UAI 2022 The Role of Baselines in Policy Gradient Optimization NIPS 2022 On the Global Convergence Rates of Decentralized Softmax Gradient Play in Markov Potential Games NIPS 2022 Understanding and Leveraging Overparameterization in Recursive Value Estimation ICLR 2022 Understanding the Effect of Stochasticity in Policy Optimization NIPS 2021 On the Optimality of Batch Policy Optimization Algorithms ICML 2021 Leveraging Non-uniformity in First-order Non-convex Optimization ICML 2021 Frequency-based Search-control in Dyna ICLR 2020 Escaping the Gravitational Pull of Softmax NIPS 2020 On the Global Convergence Rates of Softmax Policy Gradient Methods ICML 2020 Maximum Entropy Monte-Carlo Planning NIPS 2019 On Principled Entropy Exploration in Policy Optimization IJCAI 2019 Identifying and Tracking Sentiments and Topics from Social Media Texts during Natural Disasters EMNLP 2017 On the Reducibility of Submodular Functions AISTATS 2016