Papers
Minimax Weight and Q-Function Learning for Off-Policy Evaluation
Masatoshi Uehara, Jiawei Huang, Nan Jiang
Min-Max Optimization without Gradients: Convergence and Applications to Black-Box Evasion and Poisoning Attacks
Sijia Liu, Songtao Lu, Xiangyi Chen et al.
Missing Data Imputation using Optimal Transport
Boris Muzellec, Julie Josse, Claire Boyer et al.
Mix-n-Match : Ensemble and Compositional Methods for Uncertainty Calibration in Deep Learning
Jize Zhang, Bhavya Kailkhura, T. Yong-Jin Han
Model-Based Reinforcement Learning with Value-Targeted Regression
Alex Ayoub, Zeyu Jia, Csaba Szepesvari et al.
Model-free Reinforcement Learning in Infinite-horizon Average-reward Markov Decision Processes
Chen-Yu Wei, Mehdi Jafarnia Jahromi, Haipeng Luo et al.
Model Fusion with Kullback-Leibler Divergence
Sebastian Claici, Mikhail Yurochkin, Soumya Ghosh et al.
Modulating Surrogates for Bayesian Optimization
Erik Bodin, Markus Kaiser, Ieva Kazlauskaite et al.
Momentum-Based Policy Gradient Methods
Feihu Huang, Shangqian Gao, Jian Pei et al.
Momentum Improves Normalized SGD
Ashok Cutkosky, Harsh Mehta
MoNet3D: Towards Accurate Monocular 3D Object Localization in Real Time
Xichuan Zhou, Yicong Peng, Chunqiao Long et al.
Moniqua: Modulo Quantized Communication in Decentralized SGD
Yucheng Lu, Christopher De Sa
Monte-Carlo Tree Search as Regularized Policy Optimization
Jean-Bastien Grill, Florent Altché, Yunhao Tang et al.
More Data Can Expand The Generalization Gap Between Adversarially Robust and Standard Models
Lin Chen, Yifei Min, Mingrui Zhang et al.
More Information Supervised Probabilistic Deep Face Embedding Learning
Ying Huang, Shangfeng Qiu, Wenwei Zhang et al.
Multi-Agent Determinantal Q-Learning
Yaodong Yang, Ying Wen, Jun Wang et al.
Multi-Agent Routing Value Iteration Network
Quinlan Sykora, Mengye Ren, Raquel Urtasun
Multiclass Neural Network Minimization via Tropical Newton Polytope Approximation
Georgios Smyrnis, Petros Maragos
Multidimensional Shape Constraints
Maya Gupta, Erez Louidor, Oleksandr Mangylov et al.
Multi-fidelity Bayesian Optimization with Max-value Entropy Search and its Parallelization
Shion Takeno, Hitoshi Fukuoka, Yuhki Tsukada et al.
Multigrid Neural Memory
Tri Huynh, Michael Maire, Matthew Walter
Multilinear Latent Conditioning for Generating Unseen Attribute Combinations
Markos Georgopoulos, Grigorios Chrysos, Maja Pantic et al.
Multinomial Logit Bandit with Low Switching Cost
Kefan Dong, Yingkai Li, Qin Zhang et al.
Multi-objective Bayesian Optimization using Pareto-frontier Entropy
Shinya Suzuki, Shion Takeno, Tomoyuki Tamura et al.
Multi-Objective Molecule Generation using Interpretable Substructures
Wengong Jin, Dr.Regina Barzilay, Tommi Jaakkola