Papers
11,015 papers found
Sparse Distributed Memory is a Continual Learner
Trenton Bricken, Xander Davies, Deepak Singh et al.
Sparse Mixture-of-Experts are Domain Generalizable Learners
Bo Li, Yifei Shen, Jingkang Yang et al.
Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers
Tianlong Chen, Zhenyu Zhang, AJAY KUMAR JAISWAL et al.
Sparse Random Networks for Communication-Efficient Federated Learning
Berivan Isik, Francesco Pase, Deniz Gunduz et al.
Sparse Token Transformer with Attention Back Tracking
Heejun Lee, Minki Kang, Youngwan Lee et al.
Sparse tree-based Initialization for Neural Networks
Patrick Lutz, Ludovic Arnould, Claire Boyer et al.
Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints
Aran Komatsuzaki, Joan Puigcerver, James Lee-Thorp et al.
Sparsity-Constrained Optimal Transport
Tianlin Liu, Joan Puigcerver, Mathieu Blondel
Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!
Shiwei Liu, Tianlong Chen, Zhenyu Zhang et al.
Spatial Attention Kinetic Networks with E(n)-Equivariance
Yuanqing Wang, John Chodera
Spatio-temporal point processes with deep non-stationary kernels
Zheng Dong, Xiuyuan Cheng, Yao Xie
Specformer: Spectral Graph Neural Networks Meet Transformers
Deyu Bo, Chuan Shi, Lele Wang et al.
Spectral Augmentation for Self-Supervised Learning on Graphs
Lu Lin, Jinghui Chen, Hongning Wang
Spectral Decomposition Representation for Reinforcement Learning
Tongzheng Ren, Tianjun Zhang, Lisa Lee et al.
SpeedyZero: Mastering Atari with Limited Data and Time
Yixuan Mei, Jiaxuan Gao, Weirui Ye et al.
Spherical Sliced-Wasserstein
Clément Bonet, Paul Berg, Nicolas Courty et al.
Spikformer: When Spiking Neural Network Meets Transformer
Zhaokun Zhou, Yuesheng Zhu, Chao He et al.
Spiking Convolutional Neural Networks for Text Classification
Changze Lv, Jianhan Xu, Xiaoqing Zheng
SQA3D: Situated Question Answering in 3D Scenes
Xiaojian Ma, Silong Yong, Zilong Zheng et al.
Squeeze Training for Adversarial Robustness
Qizhang Li, Yiwen Guo, Wangmeng Zuo et al.
StableDR: Stabilized Doubly Robust Learning for Recommendation on Data Missing Not at Random
Haoxuan Li, Chunyuan Zheng, Peng Wu
Stable Target Field for Reduced Variance Score Estimation in Diffusion Models
Yilun Xu, Shangyuan Tong, Tommi S. Jaakkola
STaSy: Score-based Tabular data Synthesis
Jayoung Kim, Chaejeong Lee, Noseong Park
Stateful Active Facilitator: Coordination and Environmental Heterogeneity in Cooperative Multi-Agent Reinforcement Learning
Dianbo Liu, Vedant Shah, Oussama Boussif et al.