Papers
4,122 papers found
On Instrumental Variable Regression for Deep Offline Policy Evaluation
Yutian Chen, Liyuan Xu, Caglar Gulcehre et al.
Online Mirror Descent and Dual Averaging: Keeping Pace in the Dynamic Case
Huang Fang, Nicholas J. A. Harvey, Victor S. Portella et al.
Online Nonnegative CP-dictionary Learning for Markovian Data
Hanbaek Lyu, Christopher Strohmeier, Deanna Needell
On Low-rank Trace Regression under General Sampling Distribution
Nima Hamidi, Mohsen Bayati
On Mixup Regularization
Luigi Carratino, Moustapha Cissé, Rodolphe Jenatton et al.
On Regularized Square-root Regression Problems: Distributionally Robust Interpretation and Fast Computations
Hong T.M. Chu, Kim-Chuan Toh, Yangjing Zhang
On the Approximation of Cooperative Heterogeneous Multi-Agent Reinforcement Learning (MARL) using Mean Field Control (MFC)
Washim Uddin Mondal, Mridul Agarwal, Vaneet Aggarwal et al.
On the Complexity of Approximating Multimarginal Optimal Transport
Tianyi Lin, Nhat Ho, Marco Cuturi et al.
On the Efficiency of Entropic Regularized Algorithms for Optimal Transport
Tianyi Lin, Nhat Ho, Michael I. Jordan
On the Robustness to Misspecification of α-posteriors and Their Variational Approximations
Marco Avella Medina, José Luis Montiel Olea, Cynthia Rush et al.
Optimality and Stability in Non-Convex Smooth Games
Guojun Zhang, Pascal Poupart, Yaoliang Yu
Optimal Transport for Stationary Markov Chains via Policy Iteration
Kevin O'Connor, Kevin McGoff, Andrew B. Nobel
Oracle Complexity in Nonsmooth Nonconvex Optimization
Guy Kornowski, Ohad Shamir
Overparameterization of Deep ResNet: Zero Loss and Mean-field Analysis
Zhiyan Ding, Shi Chen, Qin Li et al.
OVERT: An Algorithm for Safety Verification of Neural Network Control Policies for Nonlinear Systems
Chelsea Sidrane, Amir Maleki, Ahmed Irfan et al.
PAC Guarantees and Effective Algorithms for Detecting Novel Categories
Si Liu, Risheek Garrepalli, Dan Hendrycks et al.
Pathfinder: Parallel quasi-Newton variational inference
Lu Zhang, Bob Carpenter, Andrew Gelman et al.
PECOS: Prediction for Enormous and Correlated Output Spaces
Hsiang-Fu Yu, Kai Zhong, Jiong Zhang et al.
Policy Gradient and Actor-Critic Learning in Continuous Time and Space: Theory and Algorithms
Yanwei Jia, Xun Yu Zhou
Posterior Asymptotics for Boosted Hierarchical Dirichlet Process Mixtures
Marta Catalano, Pierpaolo De Blasi, Antonio Lijoi et al.
Power Iteration for Tensor PCA
Jiaoyang Huang, Daniel Z. Huang, Qing Yang et al.
Principal Components Bias in Over-parameterized Linear Models, and its Manifestation in Deep Neural Networks
Guy Hacohen, Daphna Weinshall
Prior Adaptive Semi-supervised Learning with Application to EHR Phenotyping
Yichi Zhang, Molei Liu, Matey Neykov et al.