Papers
11,015 papers found
Neural Text Generation With Unlikelihood Training
Sean Welleck, Ilia Kulikov, Stephen Roller et al.
NeurQuRI: Neural Question Requirement Inspector for Answerability Prediction in Machine Reading Comprehension
Seohyun Back, Sai Chetan Chinthakindi, Akhil Kedia et al.
Never Give Up: Learning Directed Exploration Strategies
Adrià Puigdomènech Badia, Pablo Sprechmann, Alex Vitvitskyi et al.
Non-Autoregressive Dialog State Tracking
Hung Le, Richard Socher, Steven C.H. Hoi
Novelty Detection Via Blurring
Sungik Choi, Sae-Young Chung
Oblique Decision Trees from Derivatives of ReLU Networks
Guang-He Lee, Tommi S. Jaakkola
Observational Overfitting in Reinforcement Learning
Xingyou Song, Yiding Jiang, Stephen Tu et al.
On Bonus Based Exploration Methods In The Arcade Learning Environment
Adrien Ali Taiga, William Fedus, Marlos C. Machado et al.
Once-for-All: Train One Network and Specialize it for Efficient Deployment
Han Cai, Chuang Gan, Tianzhe Wang et al.
On Computation and Generalization of Generative Adversarial Imitation Learning
Minshuo Chen, Yizhou Wang, Tianyi Liu et al.
One-Shot Pruning of Recurrent Neural Networks by Jacobian Spectrum Evaluation
Shunshi Zhang, Bradly C. Stadie
On Generalization Error Bounds of Noisy Gradient Methods for Non-Convex Learning
Jian Li, Xuanyuan Luo, Mingda Qiao
On Identifiability in Transformers
Gino Brunner, Yang Liu, Damian Pascual et al.
Online and stochastic optimization beyond Lipschitz continuity: A Riemannian approach
Kimon Antonakopoulos, E. Veronica Belmega, Panayotis Mertikopoulos
On Mutual Information Maximization for Representation Learning
Michael Tschannen, Josip Djolonga, Paul K. Rubenstein et al.
On Robustness of Neural Ordinary Differential Equations
Hanshu YAN, Jiawei DU, Vincent TAN et al.
On Solving Minimax Optimization Locally: A Follow-the-Ridge Approach
Yuanhao Wang*, Guodong Zhang*, Jimmy Ba
On the Convergence of FedAvg on Non-IID Data
Xiang Li, Kaixuan Huang, Wenhao Yang et al.
On the Equivalence between Positional Node Embeddings and Structural Graph Representations
Balasubramaniam Srinivasan, Bruno Ribeiro
On the Global Convergence of Training Deep Linear ResNets
Difan Zou, Philip M. Long, Quanquan Gu
On the interaction between supervision and self-play in emergent communication
Ryan Lowe*, Abhinav Gupta*, Jakob Foerster et al.
On the Need for Topology-Aware Generative Models for Manifold-Based Defenses
Uyeong Jang, Susmit Jha, Somesh Jha
On the Relationship between Self-Attention and Convolutional Layers
Jean-Baptiste Cordonnier, Andreas Loukas, Martin Jaggi
On the "steerability" of generative adversarial networks
Ali Jahanian*, Lucy Chai*, Phillip Isola
On the Variance of the Adaptive Learning Rate and Beyond
Liyuan Liu, Haoming Jiang, Pengcheng He et al.