Papers
4,025 papers found
Thompson Sampling for Linear-Quadratic Control Problems
Marc Abeille, Alessandro Lazaric
Tracking Objects with Higher Order Interactions via Delayed Column Generation
Shaofei Wang, Steffen Wolf, Charless Fowlkes et al.
Trading off Rewards and Errors in Multi-Armed Bandits
Akram Erraqabi, Alessandro Lazaric, Michal Valko et al.
Unsupervised Sequential Sensor Acquisition
Manjesh Hanawal, Csaba Szepesvari, Venkatesh Saligrama
Value-Aware Loss Function for Model-based Reinforcement Learning
Amir-Massoud Farahmand, Andre Barreto, Daniel Nikovski
Accelerating Online Convex Optimization via Adaptive Prediction
Mehryar Mohri, Scott Yang
A Column Generation Bound Minimization Approach with PAC-Bayesian Generalization Guarantees
Jean-Francis Roy, Mario Marchand, François Laviolette
A Convex Surrogate Operator for General Non-Modular Loss Functions
Jiaqian Yu, Matthew Blaschko
Active Learning Algorithms for Graphical Model Selection
Gautamd Dasarathy, Aarti Singh, Maria-Florina Balcan et al.
AdaDelay: Delay Adaptive Distributed Stochastic Optimization
Suvrit Sra, Adams Wei Yu, Mu Li et al.
A Deep Generative Deconvolutional Image Model
Yunchen Pu, Win Yuan, Andrew Stevens et al.
A Fast and Reliable Policy Improvement Algorithm
Yasin Abbasi-Yadkori, Peter L. Bartlett, Stephen J. Wright
A Fixed-Point Operator for Inference in Variational Bayesian Latent Gaussian Models
Rishit Sheth, Roni Khardon
A Lasso-based Sparse Knowledge Gradient Policy for Sequential Optimal Learning
Yan Li, Han Liu, Warren Powell
A Linearly-Convergent Stochastic L-BFGS Algorithm
Philipp Moritz, Robert Nishihara, Michael Jordan
An Improved Convergence Analysis of Cyclic Block Coordinate Descent-type Methods for Strongly Convex Minimization
Xingguo Li, Tuo Zhao, Raman Arora et al.
A PAC RL Algorithm for Episodic POMDPs
Zhaohan Daniel Guo, Shayan Doroudi, Emma Brunskill
Approximate Inference Using DC Programming For Collective Graphical Models
Thien Nguyen, Akshat Kumar, Hoong Chuin Lau et al.
A Robust-Equitable Copula Dependence Measure for Feature Selection
Yale Chang, Yi Li, Adam Ding et al.
Back to the Future: Radial Basis Function Networks Revisited
Qichao Que, Mikhail Belkin
(Bandit) Convex Optimization with Biased Noisy Gradient Oracles
Xiaowei Hu, Prashanth L.A., András György et al.
Batch Bayesian Optimization via Local Penalization
Javier Gonzalez, Zhenwen Dai, Philipp Hennig et al.
Bayesian Generalised Ensemble Markov Chain Monte Carlo
Jes Frellsen, Ole Winther, Zoubin Ghahramani et al.
Bayesian Markov Blanket Estimation
Dinu Kaufmann, Sonali Parbhoo, Aleksander Wieczorek et al.