Papers
STARQA: A Question Answering Dataset for Complex Analytical Reasoning over Structured Databases
Mounica Maddela, Lingjue Xie, Daniel Preotiuc-Pietro et al.
START: Self-taught Reasoner with Tools
Chengpeng Li, Mingfeng Xue, Zhenru Zhang et al.
Static or Dynamic: Towards Query-Adaptive Token Selection for Video Question Answering
Yumeng Shi, Quanyu Long, Wenya Wang
Static Word Embeddings for Sentence Semantic Representation
Takashi Wada, Yuki Hirakawa, Ryotaro Shimizu et al.
Statistical and Neural Methods for Hawaiian Orthography Modernization
Jaden Kapali, Keaton Williamson, Winston Wu
StatsChartMWP: A Dataset for Evaluating Multimodal Mathematical Reasoning Abilities on Math Word Problems with Statistical Charts
Dan Zhu, Tianqiao Liu, Zitao Liu
STEAM: A Semantic-Level Knowledge Editing Framework for Large Language Models
Geunyeong Jeong, Juoh Sun, Seonghee Lee et al.
STEER-BENCH: A Benchmark for Evaluating the Steerability of Large Language Models
Kai Chen, Zihao He, Taiwei Shi et al.
Steering Language Models in Multi-Token Generation: A Case Study on Tense and Aspect
Alina Klerings, Jannik Brinkmann, Daniel Ruffinelli et al.
Steering LLM Reasoning Through Bias-Only Adaptation
Viacheslav Sinii, Alexey Gorbatovski, Artem Cherepanov et al.
Steering LVLMs via Sparse Autoencoder for Hallucination Mitigation
Zhenglin Hua, Jinghan He, Zijun Yao et al.
SteerVLM: Robust Model Control through Lightweight Activation Steering for Vision Language Models
Anushka Sivakumar, Andrew Zhang, Zaber Ibn Abdul Hakim et al.
StepER: Step-wise Knowledge Distillation for Enhancing Reasoning Ability in Multi-Step Retrieval-Augmented Language Models
Kyumin Lee, Minjin Jeon, Sanghwan Jang et al.
Step Guided Reasoning: Improving Mathematical Reasoning using Guidance Generation and Step Reasoning
Lang Cao, Yingtian Zou, Chao Peng et al.
StepKE: Stepwise Knowledge Editing for Multi-Hop Question Answering
Jaewook Lee, Dahyun Jung, Heuiseok Lim
Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback
Yen-Ting Lin, Di Jin, Tengyu Xu et al.
Step-level Verifier-guided Hybrid Test-Time Scaling for Large Language Models
Kaiyan Chang, Yonghao Shi, Chenglong Wang et al.
StepSearch: Igniting LLMs Search Ability via Step-Wise Proximal Policy Optimization
Xuhui Zheng, Kang An, Ziliang Wang et al.
Stepwise Informativeness Search for Improving LLM Reasoning
Siyuan Wang, Enda Zhao, Xiang Ren
Stepwise Reasoning Checkpoint Analysis: A Test Time Scaling Method to Enhance LLMs’ Reasoning
Zezhong Wang, Xingshan Zeng, Weiwen Liu et al.
StereoDetect: Detecting Stereotypes and Anti-stereotypes the Correct Way Using Social Psychological Underpinnings
Kaustubh Shivshankar Shejole, Pushpak Bhattacharyya
S*: Test Time Scaling for Code Generation
Dacheng Li, Shiyi Cao, Chengkun Cao et al.
Sticker-TTS: Learn to Utilize Historical Experience with a Sticker-driven Test-Time Scaling Framework
Jie Chen, Jinhao Jiang, Yingqian Min et al.