Papers
A general class of surrogate functions for stable and efficient reinforcement learning
Sharan Vaswani, Olivier Bachem, Simone Totaro et al.
A general sample complexity analysis of vanilla policy gradient
Rui Yuan, Robert M. Gower, Alessandro Lazaric
A Globally Convergent Evolutionary Strategy for Stochastic Constrained Optimization with Applications to Reinforcement Learning
Youssef Diouane, Aurelien Lucchi, Vihang Prakash Patil
A Last Switch Dependent Analysis of Satiation and Seasonality in Bandits
Pierre Laforgue, Giulia Clerici, Nicolò Cesa-Bianchi et al.
Aligned Multi-Task Gaussian Process
Olga Mikheeva, Ieva Kazlauskaite, Adam Hartshorne et al.
Almost Optimal Universal Lower Bound for Learning Causal DAGs with Atomic Interventions
Vibhor Porwal, Piyush Srivastava, Gaurav Sinha
A Manifold View of Adversarial Risk
Wenjia Zhang, Yikai Zhang, Xiaoling Hu et al.
Amortised Likelihood-free Inference for Expensive Time-series Simulators with Signatured Ratio Estimation
Joel Dyer, Patrick W. Cannon, Sebastian M. Schmon
Amortized Rejection Sampling in Universal Probabilistic Programming
Saeid Naderiparizi, Adam Scibior, Andreas Munk et al.
An Alternate Policy Gradient Estimator for Softmax Policies
Shivam Garg, Samuele Tosatto, Yangchen Pan et al.
Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation
Anas Barakat, Pascal Bianchi, Julien Lehmann
A New Notion of Individually Fair Clustering: $α$-Equitable $k$-Center
Darshan Chakrabarti, John P. Dickerson, Seyed A. Esmaeili et al.
An Information-theoretical Approach to Semi-supervised Learning under Covariate-shift
Gholamali Aminian, Mahed Abroshan, Mohammad Mahdi Khalili et al.
An Information-Theoretic Justification for Model Pruning
Berivan Isik, Tsachy Weissman, Albert No
A Non-asymptotic Approach to Best-Arm Identification for Gaussian Bandits
Antoine Barrier, Aurélien Garivier, Tomáš Kocák
An Online Learning Approach to Interpolation and Extrapolation in Domain Generalization
Elan Rosenfeld, Pradeep Ravikumar, Andrej Risteski
An Optimal Algorithm for Strongly Convex Minimization under Affine Constraints
Adil Salim, Laurent Condat, Dmitry Kovalev et al.
An Unsupervised Hunt for Gravitational Lenses
Stephen Sheng, Keerthi Vasan G C, Chi Po P Choi et al.
Approximate Function Evaluation via Multi-Armed Bandits
Tavor Z. Baharav, Gary Cheng, Mert Pilanci et al.
Approximate Top-$m$ Arm Identification with Heterogeneous Reward Variances
Ruida Zhou, Chao Tian
A Predictive Approach to Bayesian Nonparametric Survival Analysis
Edwin Fong, Brieuc Lehmann
A prior-based approximate latent Riemannian metric
Georgios Arvanitidis, Bogdan M. Georgiev, Bernhard Schölkopf
A Random Matrix Perspective on Mixtures of Nonlinearities in High Dimensions
Ben Adlam, Jake A. Levinson, Jeffrey Pennington
Are All Linear Regions Created Equal?
Matteo Gamba, Adrian Chmielewski-Anders, Josephine Sullivan et al.
A Single-Timescale Method for Stochastic Bilevel Optimization
Tianyi Chen, Yuejiao Sun, Quan Xiao et al.