Papers
11,951 papers found
Playing the lottery with rewards and multiple languages: lottery tickets in RL and NLP
Haonan Yu, Sergey Edunov, Yuandong Tian et al.
Plug and Play Language Models: A Simple Approach to Controlled Text Generation
Sumanth Dathathri, Andrea Madotto, Janice Lan et al.
Policy Learning of MDPs with Mixed Continuous/Discrete Variables: A Case Study on Model-Free Control of Markovian Jump Systems
Joao Paulo Jansch-Porto, Bin Hu, Geir Dullerud
Poly-encoders: Architectures and Pre-training Strategies for Fast and Accurate Multi-sentence Scoring
Samuel Humeau, Kurt Shuster, Marie-Anne Lachaux et al.
Population-Guided Parallel Policy Search for Reinforcement Learning
Whiyoung Jung, Giseung Park, Youngchul Sung
Posterior sampling for multi-agent reinforcement learning: solving extensive games with imperfect information
Yichi Zhou, Jialian Li, Jun Zhu
Precision Gating: Improving Neural Network Efficiency with Dynamic Dual-Precision Activations
Yichi Zhang, Ritchie Zhao, Weizhe Hua et al.
Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control
Nir Levine, Yinlam Chow, Rui Shu et al.
Prediction Poisoning: Towards Defenses Against DNN Model Stealing Attacks
Tribhuvanesh Orekondy, Bernt Schiele, Mario Fritz
Pretrained Encyclopedia: Weakly Supervised Knowledge-Pretrained Language Model
Wenhan Xiong, Jingfei Du, William Yang Wang et al.
Pre-training Tasks for Embedding-based Large-scale Retrieval
Wei-Cheng Chang, Felix X. Yu, Yin-Wen Chang et al.
Principled Weight Initialization for Hypernetworks
Oscar Chang, Lampros Flokas, Hod Lipson
Probabilistic Connection Importance Inference and Lossless Compression of Deep Neural Networks
Xin Xing, Long Sha, Pengyu Hong et al.
Probability Calibration for Knowledge Graph Embedding Models
Pedro Tabacof, Luca Costabello
Program Guided Agent
Shao-Hua Sun, Te-Lin Wu, Joseph J. Lim
PROGRESSIVE LEARNING AND DISENTANGLEMENT OF HIERARCHICAL REPRESENTATIONS
Zhiyuan Li, Jaideep Vitthal Murkute, Prashnna Kumar Gyawali et al.
Progressive Memory Banks for Incremental Domain Adaptation
Nabiha Asghar, Lili Mou, Kira A. Selby et al.
Projection-Based Constrained Policy Optimization
Tsung-Yen Yang, Justinian Rosca, Karthik Narasimhan et al.
Prosodic Characteristics of Genuine and Mock (Im)polite Mandarin Utterances
Chengwei Xu, Wentao Gu
Provable Benefit of Orthogonal Initialization in Optimizing Deep Linear Networks
Wei Hu, Lechao Xiao, Jeffrey Pennington
Provable Filter Pruning for Efficient Neural Networks
Lucas Liebenwein, Cenk Baykal, Harry Lang et al.
Provable robustness against all adversarial $l_p$-perturbations for $p\geq 1$
Francesco Croce, Matthias Hein
ProxSGD: Training Structured Neural Networks under Regularization and Constraints
Yang Yang, Yaxiong Yuan, Avraam Chatzimichailidis et al.