Papers
8,340 papers found
Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning
Mingqi Yuan, Bo Li, Xin Jin et al.
Autoregressive Diffusion Model for Graph Generation
Lingkai Kong, Jiaming Cui, Haotian Sun et al.
Auxiliary Learning as an Asymmetric Bargaining Game
Aviv Shamsian, Aviv Navon, Neta Glazer et al.
Auxiliary Modality Learning with Generalized Curriculum Distillation
Yu Shen, Xijun Wang, Peng Gao et al.
Averaged Method of Multipliers for Bi-Level Optimization without Lower-Level Strong Convexity
Risheng Liu, Yaohua Liu, Wei Yao et al.
A Watermark for Large Language Models
John Kirchenbauer, Jonas Geiping, Yuxin Wen et al.
Bag of Tricks for Training Data Extraction from Language Models
Weichen Yu, Tianyu Pang, Qian Liu et al.
Bandit Multi-linear DR-Submodular Maximization and Its Applications on Adversarial Submodular Bandits
Zongqi Wan, Jialin Zhang, Wei Chen et al.
Bandit Online Linear Optimization with Hints and Queries
Aditya Bhaskara, Ashok Cutkosky, Ravi Kumar et al.
Bandits with Knapsacks: Advice on Time-Varying Demands
Lixing Lyu, Wang Chi Cheung
Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning
Jiatai Huang, Yan Dai, Longbo Huang
Bayesian Design Principles for Frequentist Sequential Learning
Yunbei Xu, Assaf Zeevi
Bayesian Estimation of Differential Privacy
Santiago Zanella-Beguelin, Lukas Wutschitz, Shruti Tople et al.
Bayesian Neural Networks Avoid Encoding Complex and Perturbation-Sensitive Concepts
Qihan Ren, Huiqi Deng, Yunuo Chen et al.
Bayesian online change point detection with Hilbert space approximate Student-t process
Jeremy Sellier, Petros Dellaportas
Bayesian Progressive Deep Topic Model with Knowledge Informed Textual Data Coarsening Process
Zhibin Duan, Xinyang Liu, Yudi Su et al.
Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models
Wenhao Ding, Tong Che, Ding Zhao et al.
Bayes-optimal Learning of Deep Random Networks of Extensive-width
Hugo Cui, Florent Krzakala, Lenka Zdeborova
Beam Tree Recursive Cells
Jishnu Ray Chowdhury, Cornelia Caragea
BEATs: Audio Pre-Training with Acoustic Tokenizers
Sanyuan Chen, Yu Wu, Chengyi Wang et al.
Behavior Contrastive Learning for Unsupervised Skill Discovery
Rushuai Yang, Chenjia Bai, Hongyi Guo et al.
Benign Overfitting in Deep Neural Networks under Lazy Training
Zhenyu Zhu, Fanghui Liu, Grigorios Chrysos et al.
Benign Overfitting in Two-layer ReLU Convolutional Neural Networks
Yiwen Kou, Zixiang Chen, Yuanzhou Chen et al.
Best Arm Identification in Multi-Agent Multi-Armed Bandits
Filippo Vannella, Alexandre Proutiere, Jaeseong Jeong
Best of Both Worlds Policy Optimization
Christoph Dann, Chen-Yu Wei, Julian Zimmert