Papers
11,015 papers found
Skip-Attention: Improving Vision Transformers by Paying Less Attention
Shashanka Venkataramanan, Amir Ghodrati, Yuki M Asano et al.
Sliced Denoising: A Physics-Informed Molecular Pre-Training Method
Yuyan Ni, Shikun Feng, Wei-Ying Ma et al.
Sliced Wasserstein Estimation with Control Variates
Khai Nguyen, Nhat Ho
SliceGPT: Compress Large Language Models by Deleting Rows and Columns
Saleh Ashkboos, Maximilian L. Croci, Marcelo Gennari do Nascimento et al.
SLiMe: Segment Like Me
Aliasghar Khani, Saeid Asgari, Aditya Sanghi et al.
Small-scale proxies for large-scale Transformer training instabilities
Mitchell Wortsman, Peter J Liu, Lechao Xiao et al.
SmartPlay : A Benchmark for LLMs as Intelligent Agents
Yue Wu, Xuan Tang, Tom Mitchell et al.
Smooth ECE: Principled Reliability Diagrams via Kernel Smoothing
Jaroslaw Blasiok, Preetum Nakkiran
SNIP: Bridging Mathematical Symbolic and Numeric Realms with Unified Pre-training
Kazem Meidani, Parshin Shojaee, Chandan K. Reddy et al.
Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Community
Arman Isajanyan, Artur Shatveryan, David Kocharian et al.
Social-Transmotion: Promptable Human Trajectory Prediction
Saeed Saadatnejad, Yang Gao, Kaouther Messaoud et al.
SocioDojo: Building Lifelong Analytical Agents with Real-world Text and Time Series
Junyan Cheng, Peter Chin
Soft Contrastive Learning for Time Series
Seunghan Lee, Taeyoung Park, Kibok Lee
Soft Mixture Denoising: Beyond the Expressive Bottleneck of Diffusion Models
Yangming Li, Boris van Breugel, Mihaela van der Schaar
Soft Robust MDPs and Risk-Sensitive MDPs: Equivalence, Policy Gradient, and Sample Complexity
Runyu Zhang, Yang Hu, Na Li
SOHES: Self-supervised Open-world Hierarchical Entity Segmentation
Shengcao Cao, Jiuxiang Gu, Jason Kuen et al.
SOInter: A Novel Deep Energy-Based Interpretation Method for Explaining Structured Output Models
S. Fatemeh Seyyedsalehi, Mahdieh Soleymani Baghshah, Hamid R. Rabiee
Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-Verification
Aojun Zhou, Ke Wang, Zimu Lu et al.
Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image Super-Resolution
Yiyang Ma, Huan Yang, Wenhan Yang et al.
Solving High Frequency and Multi-Scale PDEs with Gaussian Processes
Shikai Fang, Madison Cooley, Da Long et al.
Solving Homogeneous and Heterogeneous Cooperative Tasks with Greedy Sequential Execution
Shanqi Liu, Dong Xing, Pengjie Gu et al.
Solving Inverse Problems with Latent Diffusion Models via Hard Data Consistency
Bowen Song, Soo Min Kwon, Zecheng Zhang et al.
Some Fundamental Aspects about Lipschitz Continuity of Neural Networks
Grigory Khromov, Sidak Pal Singh
Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training
Hong Liu, Zhiyuan Li, David Leo Wright Hall et al.
SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents
Xuhui Zhou, Hao Zhu, Leena Mathur et al.