Papers
11,951 papers found
Structured Neural Summarization
Patrick Fernandes, Miltiadis Allamanis, Marc Brockschmidt
Subgradient Descent Learns Orthogonal Dictionaries
Yu Bai, Qijia Jiang, Ju Sun
Supervised Community Detection with Line Graph Neural Networks
Zhengdao Chen, Lisha Li, Joan Bruna
Supervised Policy Update for Deep Reinforcement Learning
Quan Vuong, Yiming Zhang, Keith W. Ross
Synthetic Datasets for Neural Program Synthesis
Richard Shin, Neel Kant, Kavi Gupta et al.
Systematic Generalization: What Is Required and Can It Be Learned?
Dzmitry Bahdanau*, Shikhar Murty*, Michael Noukhovitch et al.
Taming the Noisy Gradient: Train Deep Neural Networks with Small Batch Sizes
Yikai Zhang, Hui Qu, Chao Chen et al.
Temporal Difference Variational Auto-Encoder
Karol Gregor, George Papamakarios, Frederic Besse et al.
textTOvec: DEEP CONTEXTUALIZED NEURAL AUTOREGRESSIVE TOPIC MODELS OF LANGUAGE WITH DISTRIBUTED COMPOSITIONAL PRIOR
Pankaj Gupta, Yatin Chaudhary, Florian Buettner et al.
The Comparative Power of ReLU Networks and Polynomial Kernels in the Presence of Sparse Latent Structure
Frederic Koehler, Andrej Risteski
The Deep Weight Prior
Andrei Atanov, Arsenii Ashukha, Kirill Struminsky et al.
The Laplacian in RL: Learning Representations with Efficient Approximations
Yifan Wu, George Tucker, Ofir Nachum
The Limitations of Adversarial Training and the Blind-Spot Attack
Huan Zhang*, Hongge Chen*, Zhao Song et al.
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
Jonathan Frankle, Michael Carbin
The Monophthongs of Formal Nigerian English: An Acoustic Analysis
Nisad Jamakovic, Robert Fuchs
The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision
Jiayuan Mao, Chuang Gan, Pushmeet Kohli et al.
Theoretical Analysis of Auto Rate-Tuning by Batch Normalization
Sanjeev Arora, Zhiyuan Li, Kaifeng Lyu
There Are Many Consistent Explanations of Unlabeled Data: Why You Should Average
Ben Athiwaratkun, Marc Finzi, Pavel Izmailov et al.
The relativistic discriminator: a key element missing from standard GAN
Alexia Jolicoeur-Martineau
The role of over-parametrization in generalization of neural networks
Behnam Neyshabur, Zhiyuan Li, Srinadh Bhojanapalli et al.
The Singular Values of Convolutional Layers
Hanie Sedghi, Vineet Gupta, Philip M. Long
The Unusual Effectiveness of Averaging in GAN Training
Yasin Yaz{\i}c{\i}, Chuan-Sheng Foo, Stefan Winkler et al.
Three Mechanisms of Weight Decay Regularization
Guodong Zhang, Chaoqi Wang, Bowen Xu et al.
TimbreTron: A WaveNet(CycleGAN(CQT(Audio))) Pipeline for Musical Timbre Transfer
Sicong Huang, Qiyang Li, Cem Anil et al.
Time-Agnostic Prediction: Predicting Predictable Video Frames
Dinesh Jayaraman, Frederik Ebert, Alexei Efros et al.