Papers
An Empirical Study of Stochastic Gradient Descent with Structured Covariance Noise
Yeming Wen, Kevin Luk, Maxime Gazeau et al.
A nonasymptotic law of iterated logarithm for general M-estimators
Nicolas Schreuder, Victor-Emmanuel Brunel, Arnak Dalalyan
A Nonparametric Off-Policy Policy Gradient
Samuele Tosatto, Joao Carvalho, Hany Abdulsamad et al.
An Optimal Algorithm for Adversarial Bandits with Arbitrary Delays
Julian Zimmert, Yevgeny Seldin
A Novel Confidence-Based Algorithm for Structured Bandits
Andrea Tirinzoni, Alessandro Lazaric, Marcello Restelli
AP-Perf: Incorporating Generic Performance Metrics in Differentiable Learning
Rizal Fathony, Zico Kolter
Approximate Cross-validation: Guarantees for Model Assessment and Selection
Ashia Wilson, Maximilian Kasy, Lester Mackey
Approximate Cross-Validation in High Dimensions with Guarantees
William Stephenson, Tamara Broderick
Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value Functions
Lars Buesing, Nicolas Heess, Theophane Weber
Approximate Inference with Wasserstein Gradient Flows
Charlie Frogner, Tomaso Poggio
A Practical Algorithm for Multiplayer Bandits when Arm Means Vary Among Players
Abbas Mehrabian, Etienne Boursier, Emilie Kaufmann et al.
A Primal-Dual Solver for Large-Scale Tracking-by-Assignment
Stefan Haller, Mangal Prakash, Lisa Hutschenreiter et al.
A principled approach for generating adversarial images under non-smooth dissimilarity metrics
Aram-Alexandre Pooladian, Chris Finlay, Tim Hoheisel et al.
A PTAS for the Bayesian Thresholding Bandit Problem
Jian Peng, Yue Qin, Yadi Wei et al.
A Reduction from Reinforcement Learning to No-Regret Online Learning
Ching-An Cheng, Remi Tachet Combes, Byron Boots et al.
A Robust Univariate Mean Estimator is All You Need
Adarsh Prasad, Sivaraman Balakrishnan, Pradeep Ravikumar
A Rule for Gradient Estimator Selection, with an Application to Variational Inference
Tomas Geffner, Justin Domke
ASAP: Architecture Search, Anneal and Prune
Asaf Noy, Niv Nayman, Tal Ridnik et al.
A Simple Approach for Non-stationary Linear Bandits
Peng Zhao, Lijun Zhang, Yuan Jiang et al.
A single algorithm for both restless and rested rotting bandits
Julien Seznec, Pierre Menard, Alessandro Lazaric et al.
Assessing Local Generalization Capability in Deep Models
Huan Wang, Nitish Shirish Keskar, Caiming Xiong et al.
A Stein Goodness-of-fit Test for Directional Distributions
Wenkai Xu, Takeru Matsuda
Asymptotically Efficient Off-Policy Evaluation for Tabular Reinforcement Learning
Ming Yin, Yu-Xiang Wang