Papers
Partially Observed Exchangeable Modeling
Yang Li, Junier Oliva
Path Planning using Neural A* Search
Ryo Yonetani, Tatsunori Taniai, Mohammadamin Barekatain et al.
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training
Kimin Lee, Laura M Smith, Pieter Abbeel
Perceiver: General Perception with Iterative Attention
Andrew Jaegle, Felix Gimeno, Andy Brock et al.
Permutation Weighting
David Arbour, Drew Dimmery, Arjun Sondhi
Personalized Federated Learning using Hypernetworks
Aviv Shamsian, Aviv Navon, Ethan Fetaya et al.
Phasic Policy Gradient
Karl W Cobbe, Jacob Hilton, Oleg Klimov et al.
PHEW : Constructing Sparse Networks that Learn Fast and Generalize Well without Training Data
Shreyas Malakarjun Patil, Constantine Dovrolis
PID Accelerated Value Iteration Algorithm
Amir-Massoud Farahmand, Mohammad Ghavamzadeh
PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models
Chaoyang He, Shen Li, Mahdi Soltanolkotabi et al.
PixelTransformer: Sample Conditioned Signal Generation
Shubham Tulsiani, Abhinav Gupta
PODS: Policy Optimization via Differentiable Simulation
Miguel Angel Zamora Mora, Momchil Peychev, Sehoon Ha et al.
Pointwise Binary Classification with Pairwise Confidence Comparisons
Lei Feng, Senlin Shu, Nan Lu et al.
Poisson-Randomised DirBN: Large Mutation is Needed in Dirichlet Belief Networks
Xuhui Fan, Bin Li, Yaqiong Li et al.
Policy Analysis using Synthetic Controls in Continuous-Time
Alexis Bellot, Mihaela van der Schaar
Policy Caches with Successor Features
Mark Nemecek, Ronald Parr
Policy Gradient Bayesian Robust Optimization for Imitation Learning
Zaynah Javed, Daniel S Brown, Satvik Sharma et al.
Policy Information Capacity: Information-Theoretic Measure for Task Complexity in Deep Reinforcement Learning
Hiroki Furuta, Tatsuya Matsushima, Tadashi Kozuno et al.
Poolingformer: Long Document Modeling with Pooling Attention
Hang Zhang, Yeyun Gong, Yelong Shen et al.
PopSkipJump: Decision-Based Attack for Probabilistic Classifiers
Carl-Johann Simon-Gabriel, Noman Ahmed Sheikh, Andreas Krause
Positive-Negative Momentum: Manipulating Stochastic Gradient Noise to Improve Generalization
Zeke Xie, Li Yuan, Zhanxing Zhu et al.
Posterior Value Functions: Hindsight Baselines for Policy Gradient Methods
Chris Nota, Philip Thomas, Bruno C. Da Silva
Post-selection inference with HSIC-Lasso
Tobias Freidling, Benjamin Poignard, Héctor Climente-González et al.