Papers
Smooth Min-Max Monotonic Networks
Christian Igel
Smoothness Adaptive Hypothesis Transfer Learning
Haotian Lin, Matthew Reimherr
Smooth Tchebycheff Scalarization for Multi-Objective Optimization
Xi Lin, Xiaoyuan Zhang, Zhiyuan Yang et al.
Sobolev Space Regularised Pre Density Models
Mark Kozdoba, Binyamin Perets, Shie Mannor
Socialized Learning: Making Each Other Better Through Multi-Agent Collaboration
Xinjie Yao, Yu Wang, Pengfei Zhu et al.
Soft Prompt Recovers Compressed LLMs, Transferably
Zhaozhuo Xu, Zirui Liu, Beidi Chen et al.
Solving Hierarchical Information-Sharing Dec-POMDPs: An Extensive-Form Game Approach
Johan Peralez, Aurélien Delage, Olivier Buffet et al.
Solving Poisson Equations using Neural Walk-on-Spheres
Hong Chul Nam, Julius Berner, Anima Anandkumar
SPABA: A Single-Loop and Probabilistic Stochastic Bilevel Algorithm Achieving Optimal Sample Complexity
Tianshu Chu, Dachuan Xu, Wei Yao et al.
SPADE: Sparsity-Guided Debugging for Deep Neural Networks
Arshia Soltani Moakhar, Eugenia Iofinova, Elias Frantar et al.
SparQ Attention: Bandwidth-Efficient LLM Inference
Luka Ribar, Ivan Chelombiev, Luke Hudlass-Galley et al.
Sparse and Structured Hopfield Networks
Saul José Rodrigues Dos Santos, Vlad Niculae, Daniel C Mcnamee et al.
Sparse Cocktail: Every Sparse Pattern Every Sparse Ratio All At Once
Zhangheng Li, Shiwei Liu, Tianlong Chen et al.
Sparse Dimensionality Reduction Revisited
Mikael Møller Høgsgaard, Lior Kamma, Kasper Green Larsen et al.
Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency
Vithursan Thangarasa, Shreyas Saxena, Abhay Gupta et al.
Sparse Inducing Points in Deep Gaussian Processes: Enhancing Modeling with Denoising Diffusion Variational Inference
Jian Xu, Delu Zeng, John Paisley
Sparse is Enough in Fine-tuning Pre-trained Large Language Models
Weixi Song, Zuchao Li, Lefei Zhang et al.
Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications
Zixuan Hu, Yongxian Wei, Li Shen et al.
Sparser, Better, Deeper, Stronger: Improving Static Sparse Training with Exact Orthogonal Initialization
Aleksandra Nowak, Łukasz Gniecki, Filip Szatkowski et al.
Sparsest Models Elude Pruning: An Exposé of Pruning’s Current Capabilities
Stephen Zhang, Vardan Papyan
Sparse-to-dense Multimodal Image Registration via Multi-Task Learning
Kaining Zhang, Jiayi Ma
SparseTSF: Modeling Long-term Time Series Forecasting with *1k* Parameters
Shengsheng Lin, Weiwei Lin, Wentai Wu et al.
Spectral Phase Transition and Optimal PCA in Block-Structured Spiked Models
Pierre Mergny, Justin Ko, Florent Krzakala
Spectral Preconditioning for Gradient Methods on Graded Non-convex Functions
Nikita Doikov, Sebastian U Stich, Martin Jaggi
Speech Self-Supervised Learning Using Diffusion Model Synthetic Data
Heting Gao, Kaizhi Qian, Junrui Ni et al.