Papers
On Relativistic f-Divergences
Alexia Jolicoeur-Martineau
On Second-Order Group Influence Functions for Black-Box Predictions
Samyadeep Basu, Xuchen You, Soheil Feizi
On Semi-parametric Inference for BART
Veronika Rockova
On the consistency of top-k surrogate losses
Forest Yang, Sanmi Koyejo
On the Convergence of Nesterov’s Accelerated Gradient Method in Stochastic Settings
Mahmoud Assran, Mike Rabbat
On the Expressivity of Neural Networks for Deep Reinforcement Learning
Kefan Dong, Yuping Luo, Tianhe Yu et al.
On the Generalization Benefit of Noise in Stochastic Gradient Descent
Samuel Smith, Erich Elsen, Soham De
On the Generalization Effects of Linear Transformations in Data Augmentation
Sen Wu, Hongyang Zhang, Gregory Valiant et al.
On the Global Convergence Rates of Softmax Policy Gradient Methods
Jincheng Mei, Chenjun Xiao, Csaba Szepesvari et al.
On the Global Optimality of Model-Agnostic Meta-Learning
Lingxiao Wang, Qi Cai, Zhuoran Yang et al.
On the (In)tractability of Computing Normalizing Constants for the Product of Determinantal Point Processes
Naoto Ohsaka, Tatsuya Matsuoka
On the Iteration Complexity of Hypergradient Computation
Riccardo Grazzi, Luca Franceschi, Massimiliano Pontil et al.
On the Noisy Gradient Descent that Generalizes as SGD
Jingfeng Wu, Wenqing Hu, Haoyi Xiong et al.
On the Number of Linear Regions of Convolutional Neural Networks
Huan Xiong, Lei Huang, Mengyang Yu et al.
On the Power of Compressed Sensing with Generative Models
Akshay Kamath, Eric Price, Sushrut Karmalkar
On the Relation between Quality-Diversity Evaluation and Distribution-Fitting Goal in Text Generation
Jianing Li, Yanyan Lan, Jiafeng Guo et al.
On the Sample Complexity of Adversarial Multi-Source PAC Learning
Nikola Konstantinov, Elias Frantar, Dan Alistarh et al.
On the Theoretical Properties of the Network Jackknife
Qiaohui Lin, Robert Lunde, Purnamrita Sarkar
On the Unreasonable Effectiveness of the Greedy Algorithm: Greedy Adapts to Sharpness
Sebastian Pokutta, Mohit Singh, Alfredo Torrico
On Unbalanced Optimal Transport: An Analysis of Sinkhorn Algorithm
Khiem Pham, Khang Le, Nhat Ho et al.
On Validation and Planning of An Optimal Decision Rule with Application in Healthcare Studies
Hengrui Cai, Wenbin Lu, Rui Song
On Variational Learning of Controllable Representations for Text without Supervision
Peng Xu, Jackie Chi Kit Cheung, Yanshuai Cao
Operation-Aware Soft Channel Pruning using Differentiable Masks
Minsoo Kang, Bohyung Han
Optimal approximation for unconstrained non-submodular minimization
Marwa El Halabi, Stefanie Jegelka
Optimal Bounds between f-Divergences and Integral Probability Metrics
Rohit Agrawal, Thibaut Horel