Papers
Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation
Shangtong Zhang, Bo Liu, Hengshuai Yao et al.
Provably Efficient Exploration in Policy Optimization
Qi Cai, Zhuoran Yang, Chi Jin et al.
Provably Efficient Model-based Policy Adaptation
Yuda Song, Aditi Mavalankar, Wen Sun et al.
Proving the Lottery Ticket Hypothesis: Pruning is All You Need
Eran Malach, Gilad Yehudai, Shai Shalev-Schwartz et al.
Puzzle Mix: Exploiting Saliency and Local Statistics for Optimal Mixup
Jang-Hyun Kim, Wonho Choo, Hyun Oh Song
Quadratically Regularized Subgradient Methods for Weakly Convex Optimization with Weakly Convex Constraints
Runchao Ma, Qihang Lin, Tianbao Yang
Quantized Decentralized Stochastic Learning over Directed Graphs
Hossein Taheri, Aryan Mokhtari, Hamed Hassani et al.
Quantum Boosting
Srinivasan Arunachalam, Reevu Maity
Quantum Expectation-Maximization for Gaussian mixture models
Iordanis Kerenidis, Alessandro Luongo, Anupam Prakash
Q-value Path Decomposition for Deep Multiagent Reinforcement Learning
Yaodong Yang, Jianye Hao, Guangyong Chen et al.
R2-B2: Recursive Reasoning-Based Bayesian Optimization for No-Regret Learning in Games
Zhongxiang Dai, Yizhou Chen, Bryan Kian Hsiang Low et al.
Radioactive data: tracing through training
Alexandre Sablayrolles, Matthijs Douze, Cordelia Schmid et al.
Random extrapolation for primal-dual coordinate descent
Ahmet Alacaoglu, Olivier Fercoq, Volkan Cevher
Random Hypervolume Scalarizations for Provable Multi-Objective Black Box Optimization
Richard Zhang, Daniel Golovin
Randomization matters How to defend against strong adversarial attacks
Rafael Pinot, Raphael Ettedgui, Geovani Rizk et al.
Randomized Block-Diagonal Preconditioning for Parallel Learning
Celestine Mendler-Dünner, Aurelien Lucchi
Randomized Smoothing of All Shapes and Sizes
Greg Yang, Tony Duan, J. Edward Hu et al.
Randomly Projected Additive Gaussian Processes for Regression
Ian Delbridge, David Bindel, Andrew Gordon Wilson
Random Matrix Theory Proves that Deep Learning Representations of GAN-data Behave as Gaussian Mixtures
Mohamed El Amine Seddik, Cosme Louart, Mohamed Tamaazousti et al.
Rank Aggregation from Pairwise Comparisons in the Presence of Adversarial Corruptions
Arpit Agarwal, Shivani Agarwal, Sanjeev Khanna et al.
Rate-distortion optimization guided autoencoder for isometric embedding in Euclidean latent space
Keizo Kato, Jing Zhou, Tomotake Sasaki et al.
Ready Policy One: World Building Through Active Learning
Philip Ball, Jack Parker-Holder, Aldo Pacchiano et al.
Real-Time Optimisation for Online Learning in Auctions
Lorenzo Croissant, Marc Abeille, Clement Calauzenes
Recht-Re Noncommutative Arithmetic-Geometric Mean Conjecture is False
Zehua Lai, Lek-Heng Lim