Papers
21,849 papers found
TD_gamma: Re-evaluating Complex Backups in Temporal Difference Learning
George Konidaris, Scott Niekum, Philip S. Thomas
t-divergence Based Approximate Inference
Nan Ding, Yuan Qi, S.v.n. Vishwanathan
Testing a Bayesian Measure of Representativeness Using a Large Image Database
Joshua T. Abbott, Katherine A. Heller, Zoubin Ghahramani et al.
The Doubly Correlated Nonparametric Topic Model
Dae I. Kim, Erik B. Sudderth
The Fast Convergence of Boosting
Matus J. Telgarsky
The Fixed Points of Off-Policy TD
J. Z. Kolter
The Impact of Unlabeled Patterns in Rademacher Complexity Theory for Kernel Classifiers
Luca Oneto, Davide Anguita, Alessandro Ghio et al.
The Kernel Beta Process
Lu Ren, Yingjian Wang, Lawrence Carin et al.
The Local Rademacher Complexity of Lp-Norm Multiple Kernel Learning
Marius Kloft, Gilles Blanchard
The Manifold Tangent Classifier
Salah Rifai, Yann N. Dauphin, Pascal Vincent et al.
Thinning Measurement Models and Questionnaire Design
Ricardo Silva
Trace Lasso: a trace norm regularization for correlated designs
Edouard Grave, Guillaume R. Obozinski, Francis R. Bach
Transfer from Multiple MDPs
Alessandro Lazaric, Marcello Restelli
Transfer Learning by Borrowing Examples for Multiclass Object Detection
Joseph J. Lim, Ruslan Salakhutdinov, Antonio Torralba
Two is better than one: distinct roles for familiarity and recollection in retrieving palimpsest memories
Cristina Savin, Peter Dayan, Máté Lengyel
Understanding the Intrinsic Memorability of Images
Phillip Isola, Devi Parikh, Antonio Torralba et al.
Uniqueness of Belief Propagation on Signed Graphs
Yusuke Watanabe
Unsupervised learning models of primary cortical receptive fields and receptive field plasticity
Maneesh Bhand, Ritvik Mudur, Bipin Suresh et al.
Variance Penalizing AdaBoost
Pannagadatta K. Shivaswamy, Tony Jebara
Variance Reduction in Monte-Carlo Tree Search
Joel Veness, Marc Lanctot, Michael Bowling
Variational Gaussian Process Dynamical Systems
Andreas Damianou, Michalis K. Titsias, Neil D. Lawrence
Variational Learning for Recurrent Spiking Networks
Danilo J. Rezende, Daan Wierstra, Wulfram Gerstner