Papers
Stabilizing Transformer Training by Preventing Attention Entropy Collapse
Shuangfei Zhai, Tatiana Likhomanenko, Etai Littwin et al.
Stable and Consistent Prediction of 3D Characteristic Orientation via Invariant Residual Learning
Seungwook Kim, Chunghyun Park, Yoonwoo Jeong et al.
Stable Estimation of Heterogeneous Treatment Effects
Anpeng Wu, Kun Kuang, Ruoxuan Xiong et al.
State and parameter learning with PARIS particle Gibbs
Gabriel Cardoso, Yazid Janati El Idrissi, Sylvain Le Corff et al.
Statistical Foundations of Prior-Data Fitted Networks
Thomas Nagler
Statistical Indistinguishability of Learning Algorithms
Alkis Kalavasis, Amin Karbasi, Shay Moran et al.
Statistical Inference and A/B Testing for First-Price Pacing Equilibria
Luofeng Liao, Christian Kroer
Statistical Inference on Multi-armed Bandits with Delayed Feedback
Lei Shi, Jingshen Wang, Tianhao Wu
Statistical Learning under Heterogeneous Distribution Shift
Max Simchowitz, Anurag Ajay, Pulkit Agrawal et al.
STEERING : Stein Information Directed Exploration for Model-Based Reinforcement Learning
Souradip Chakraborty, Amrit Bedi, Alec Koppel et al.
Stein Variational Goal Generation for adaptive Exploration in Multi-Goal Reinforcement Learning
Nicolas Castanet, Olivier Sigaud, Sylvain Lamprier
STEP: Learning N:M Structured Sparsity Masks from Scratch with Precondition
Yucheng Lu, Shivani Agrawal, Suvinay Subramanian et al.
Stochastic Gradient Descent-Induced Drift of Representation in a Two-Layer Neural Network
Farhad Pashakhanloo, Alexei Koulakov
Stochastic Gradient Succeeds for Bandits
Jincheng Mei, Zixin Zhong, Bo Dai et al.
Stochastic Marginal Likelihood Gradients using Neural Tangent Kernels
Alexander Immer, Tycho F. A. Van Der Ouderaa, Mark Van Der Wilk et al.
Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies
Ilyas Fatkhullin, Anas Barakat, Anastasia Kireeva et al.
Straightening Out the Straight-Through Estimator: Overcoming Optimization Challenges in Vector Quantized Networks
Minyoung Huh, Brian Cheung, Pulkit Agrawal et al.
Strategic Classification with Unknown User Manipulations
Tosca Lechner, Ruth Urner, Shai Ben-David
Stratified Adversarial Robustness with Rejection
Jiefeng Chen, Jayaram Raghuram, Jihye Choi et al.
Streaming Active Learning with Deep Neural Networks
Akanksha Saran, Safoora Yousefi, Akshay Krishnamurthy et al.
Streaming Submodular Maximization with Differential Privacy
Anamay Chaturvedi, Huy Nguyen, Thy Dinh Nguyen
StriderNet: A Graph Reinforcement Learning Approach to Optimize Atomic Structures on Rough Energy Landscapes
Vaibhav Bihani, Sahil Manchanda, Srikanth Sastry et al.
Structural Re-weighting Improves Graph Domain Adaptation
Shikun Liu, Tianchun Li, Yongbin Feng et al.
Structured Cooperative Learning with Graphical Model Priors
Shuangtong Li, Tianyi Zhou, Xinmei Tian et al.