Papers
Training Deep Convolutional Neural Networks to Play Go
Christopher Clark, Amos Storkey
Trust Region Policy Optimization
John Schulman, Sergey Levine, Pieter Abbeel et al.
Universal Value Function Approximators
Tom Schaul, Daniel Horgan, Karol Gregor et al.
Un-regularizing: approximate proximal point and faster stochastic algorithms for empirical risk minimization
Roy Frostig, Rong Ge, Sham Kakade et al.
Unsupervised Domain Adaptation by Backpropagation
Yaroslav Ganin, Victor Lempitsky
Unsupervised Learning of Video Representations using LSTMs
Nitish Srivastava, Elman Mansimov, Ruslan Salakhudinov
Variational Generative Stochastic Networks with Collaborative Shaping
Philip Bachman, Doina Precup
Variational Inference for Gaussian Process Modulated Poisson Processes
Chris Lloyd, Tom Gunter, Michael Osborne et al.
Variational Inference with Normalizing Flows
Danilo Rezende, Shakir Mohamed
Vector-Space Markov Random Fields via Exponential Families
Wesley Tansey, Oscar Hernan Madrid Padilla, Arun Sai Suggala et al.
Weight Uncertainty in Neural Network
Charles Blundell, Julien Cornebise, Koray Kavukcuoglu et al.
Yinyang K-Means: A Drop-In Replacement of the Classic K-Means with Consistent Speedup
Yufei Ding, Yue Zhao, Xipeng Shen et al.
A Bayesian Framework for Online Classifier Ensemble
Qinxun Bai, Henry Lam, Stan Sclaroff
A Bayesian Wilcoxon signed-rank test based on the Dirichlet process
Alessio Benavoli, Giorgio Corani, Francesca Mangili et al.
Accelerated Proximal Stochastic Dual Coordinate Ascent for Regularized Loss Minimization
Shai Shalev-Shwartz, Tong Zhang
A Clockwork RNN
Jan Koutnik, Klaus Greff, Faustino Gomez et al.
A Compilation Target for Probabilistic Programming Languages
Brooks Paige, Frank Wood
A Consistent Histogram Estimator for Exchangeable Graph Models
Stanley Chan, Edoardo Airoldi
A Convergence Rate Analysis for LogitBoost, MART and Their Variant
Peng Sun, Tong Zhang, Jie Zhou
Active Detection via Adaptive Submodularity
Yuxin Chen, Hiroaki Shioi, Cesar Fuentes Montesinos et al.
Active Learning of Parameterized Skills
Bruno Da Silva, George Konidaris, Andrew Barto
Active Transfer Learning under Model Shift
Xuezhi Wang, Tzu-Kuo Huang, Jeff Schneider
Adaptive Monte Carlo via Bandit Allocation
James Neufeld, Andras Gyorgy, Csaba Szepesvari et al.
Adaptivity and Optimism: An Improved Exponentiated Gradient Algorithm
Jacob Steinhardt, Percy Liang