Co-occurring keywords
Papers
Tutoring Helps Students Learn Better: Improving Knowledge Distillation for BERT with Tutor Network
EMNLP 2022
On the convergence of policy gradient methods to Nash equilibria in general stochastic games
NIPS 2022
TaSIL: Taylor Series Imitation Learning
NIPS 2022
Efficient Unsupervised Sentence Compression by Fine-tuning Transformers with Reinforcement Learning
ACL 2022
Scalable and Robust Self-Learning for Skill Routing in Large-Scale Conversational AI Systems
NAACL 2022
Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms
AISTATS 2022