Papers
938 papers found
Solving structured hierarchical games using differential backward induction
Zun Li, Feiran Jia, Aditya Mate et al.
Stability of SGD: Tightness analysis and improved bounds
Yikai Zhang, Wenjia Zhang, Sammy Bald et al.
Stackmix: a complementary mix algorithm
John Chen, Samarth Sinha, Anastasios Kyrillidis
ST-MAML : A stochastic-task based method for task-heterogeneous meta-learning
Zhe Wang, Jake Grigsby, Arshdeep Sekhon et al.
Sublinear time algorithms for greedy selection in high dimensions
Qi Chen, Kai Liu, Ruilong Yao et al.
Superposing many tickets into one: A performance booster for sparse neural network training
Lu Yin, Vlado Menkovski, Meng Fang et al.
SymNet 2.0: Effectively handling Non-Fluents and Actions in Generalized Neural Policies for RDDL Relational MDPs
Vishal Sharma, Daman Arora, Florian Geißer et al.
Systematized event-aware learning for multi-object tracking
Hyemin Lee, Daijin Kim
Temporal abstractions-augmented temporally contrastive learning: An alternative to the Laplacian in RL
Akram Erraqabi, Marlos C. Machado, Mingde Zhao et al.
Test for non-negligible adverse shifts
Vathy M Kamulete
The optimal noise in noise-contrastive learning is not what you think
Omar Chehab, Alexandre Gramfort, Aapo Hyvärinen
Toward learning human-aligned cross-domain robust models by countering misaligned features
Haohan Wang, Zeyi Huang, Hanlin Zhang et al.
Towards painless policy optimization for constrained MDPs
Arushi Jain, Sharan Vaswani, Reza Babanezhad et al.
Towards unsupervised open world semantic segmentation
Svenja Uhlemeyer, Matthias Rottmann, Hanno Gottschalk
Uncertainty-aware pseudo-labeling for quantum calculations
Kexin Huang, Vishnu Sresht, Brajesh Rai et al.
Understanding and mitigating the limitations of prioritized experience replay
Yangchen Pan, Jincheng Mei, Amir-massoud Farahmand et al.
Using hierarchies to efficiently combine evidence with Dempster’s rule of combination
Daira Pinto Prieto, Ronald de Haan
Variational- and metric-based deep latent space for out-of-distribution detection
Or Dinari, Oren Freifeld
Variational message passing neural network for Maximum-A-Posteriori (MAP) inference
Zijun Cui, Hanjing Wang, Tian Gao et al.
Variational multiple shooting for Bayesian ODEs with Gaussian processes
Pashupati Hegde, Çağatay Yıldız, Harri Lähdesmäki et al.
Voronoi density estimator for high-dimensional data: Computation, compactification and convergence
Vladislav Polianskii, Giovanni Luca Marchetti, Alexander Kravberg et al.
VQ-Flows: Vector quantized local normalizing flows
Sahil Sidheekh, Chris B. Dock, Tushar Jain et al.
X-MEN: guaranteed XOR-maximum entropy constrained inverse reinforcement learning
Fan Ding, Yexiang Xue
A Bayesian nonparametric conditional two-sample test with an application to Local Causal Discovery
Philip A. Boeken, Joris M. Mooij
Action redundancy in reinforcement learning
Nir Baram, Guy Tennenholtz, Shie Mannor