Papers
SteerVLM: Robust Model Control through Lightweight Activation Steering for Vision Language Models
Anushka Sivakumar, Andrew Zhang, Zaber Ibn Abdul Hakim et al.
StepER: Step-wise Knowledge Distillation for Enhancing Reasoning Ability in Multi-Step Retrieval-Augmented Language Models
Kyumin Lee, Minjin Jeon, Sanghwan Jang et al.
Step Guided Reasoning: Improving Mathematical Reasoning using Guidance Generation and Step Reasoning
Lang Cao, Yingtian Zou, Chao Peng et al.
StepKE: Stepwise Knowledge Editing for Multi-Hop Question Answering
Jaewook Lee, Dahyun Jung, Heuiseok Lim
Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback
Yen-Ting Lin, Di Jin, Tengyu Xu et al.
Step-level Verifier-guided Hybrid Test-Time Scaling for Large Language Models
Kaiyan Chang, Yonghao Shi, Chenglong Wang et al.
StepSearch: Igniting LLMs Search Ability via Step-Wise Proximal Policy Optimization
Xuhui Zheng, Kang An, Ziliang Wang et al.
Stepwise Informativeness Search for Improving LLM Reasoning
Siyuan Wang, Enda Zhao, Xiang Ren
Stepwise Reasoning Checkpoint Analysis: A Test Time Scaling Method to Enhance LLMs’ Reasoning
Zezhong Wang, Xingshan Zeng, Weiwen Liu et al.
StereoDetect: Detecting Stereotypes and Anti-stereotypes the Correct Way Using Social Psychological Underpinnings
Kaustubh Shivshankar Shejole, Pushpak Bhattacharyya
S*: Test Time Scaling for Code Generation
Dacheng Li, Shiyi Cao, Chengkun Cao et al.
Sticker-TTS: Learn to Utilize Historical Experience with a Sticker-driven Test-Time Scaling Framework
Jie Chen, Jinhao Jiang, Yingqian Min et al.
Stimulate the Critical Thinking of LLMs via Debiasing Discussion
Ruiyu Xiao, Lei Wu, Yuanxing Liu et al.
Stop Looking for “Important Tokens” in Multimodal Language Models: Duplication Matters More
Zichen Wen, Yifeng Gao, Shaobo Wang et al.
Stop Playing the Guessing Game! Evaluating Conversational Recommender Systems via Target-free User Simulation
SungHwan Kim, Kwangwook Seo, Tongyoung Kim et al.
Stratified Selective Sampling for Instruction Tuning with Dedicated Scoring Strategy
Paramita Mirza, Lucas Weber, Fabian Küch
STREAQ: Selective Tiered Routing for Effective and Affordable Contact Center Quality Assurance
Prajwal Sood, Rajdeep Agrawal, Mayank Sati et al.
Stress-Testing the Reasoning Competence of Language Models With Formal Proofs
Konstantine Arkoudas, Serafim Batzoglou
STRICT: Stress-Test of Rendering Image Containing Text
Tianyu Zhang, Xinyu Wang, Lu Li et al.
Stronger Baselines for Retrieval-Augmented Generation with Long-Context Language Models
Alex Laitenberger, Christopher D Manning, Nelson F. Liu
Structural Patent Classification Using Label Hierarchy Optimization
Mengting Gui, Shufeng Hao, Chongyang Shi et al.
Structural Reward Model: Enhancing Interpretability, Efficiency, and Scalability in Reward Modeling
Xiaoyu Liu, Di Liang, Hongyu Shan et al.
Structure-aware Propagation Generation with Large Language Models for Fake News Detection
Mengyang Chen, Lingwei Wei, Wei Zhou et al.
Structure-Conditional Minimum Bayes Risk Decoding
Bryan Eikema, Anna Rutkiewicz, Mario Giulianelli