Papers
Step-GRPO: Enhancing Reasoning Quality and Efficiency via Structured PRM-Based Reinforcement Learning
Weijie Li, Jin Wang, Liang-Chih Yu et al.
Step-GRPO: Internalizing Dynamic Early Exit for Efficient Reasoning
Benteng Chen, Weida Wang, Shufei Zhang et al.
StepHint: Multi-level Stepwise Hints Enhance Reinforcement Learning to Reason
Kaiyi Zhang, Ang Lv, Jinpeng Li et al.
STEP-Nav: Spatial-Temporal Efficient Visual Token Pruning for Vision-and-Language Navigation with Large Language Models
Yantao Lu, Shiqi Sun, Ning Liu et al.
Stepwise Contrastive Reasoning for Retrieval-Augmented Generation over Knowledge Graphs
Chenxiao Lin, Ye Luo, KunHong Liu et al.
Stereotype Bias in a Bilingual Setting: A Culturally Grounded Evaluation in Kazakhstan
Nurkhan Laiyk, Daniil Orel, Ayana Mussabayeva et al.
Steve: Your Personal AI Career Coach
Balaji Rao, Naveen Mathews Renji, Elena Korshakova et al.
Still Between Us? Evaluating and Improving Voice Assistant Robustness to Third-Party Interruptions
Dongwook Lee, Eunwoo Song, Che Hyun Lee et al.
STK-Adapter: Incorporating Evolving Graph and Event Chain for Temporal Knowledge Graph Extrapolation
Shuyuan Zhao, Wei Chen, Weijie Zhang et al.
ST-LLM: Spatial Transcriptomics Embedding with Large Language Models
Zhetao Xu, Xiaohua Wan, Le Li et al.
STMI: Segmentation-Guided Token Modulation with Cross-Modal Hypergraph Interaction for Multi-Modal Object Re-Identification
Xingguo Xu, Zhanyu Liu, Weixiang Zhou et al.
Stochastic Decentralized Optimization of Non-Smooth Convex and Convex-Concave Problems over Time-Varying Networks
Maxim Divilkovskiy, Alexander Gasnikov
Stochastic Parrots or True Virtuosos? Digging Deeper Into the Audio-Video Understanding of AVQA Models
Sara Pernille Jensen, Hallvard Innset Hurum, Anna-Maria Christodoulou
Stochastic Universal Adversarial Perturbations with Fixed Optimization Constraint and Ensured High-probability Transferability
Yulin Jin, Xiaoyu Zhang, Haoyu Tong et al.
STOLA: Self-Adaptive Touch-Language Framework for Tactile Commonsense Reasoning in Open-Ended Scenarios
Ning Cheng, Jinan Xu, Jialing Chen et al.
Stop Fixating on Prompts: Reasoning Hijacking and Constraint Tightening for Red-Teaming LLM Agents
Yanxu Mao, Peipei Liu, Tiehan Cui et al.
Stop Hardening Everything: A Training-Free Neuron-Level Defense for Neural Ranking Models
Yu-An Liu, Ruqing Zhang, Hongru Song et al.
Stop Mixing Things Up! BISCUIT Teaches Vision-Language Models to Learn New Concepts from Images on the Spot
Jiahua Bao, Siyao Cheng, Jiaxing Du et al.
Stop Taking Tokenizers for Granted: They Are Core Design Decisions in Large Language Models
Sawsan Alqahtani, Mir Tafseer Nayeem, Md Tahmid Rahman Laskar et al.
Stop When Enough: Adaptive Early-Stopping for Chain-of-Thought Reasoning
Renliang Sun, Wei Cheng, Dawei Li et al.
StoryBox: Collaborative Multi-Agent Simulation for Hybrid Bottom-Up Long-Form Story Generation Using Large Language Models
Zehao Chen, Rong Pan, Haoran Li
StoryCoder: Narrative Reformulation for Structured Reasoning in LLM Code Generation
Geonhui Jang, Dongyoon Han, YoungJoon Yoo
Stratagem: Learning Transferable Reasoning via Trajectory-Modulated Game Self-Play
Xiachong Feng, Deyi Yin, Xiaocheng Feng et al.
Strategic Reasoning over Golog Programs in the Nondeterministic Situation Calculus
Giuseppe De Giacomo, Yves Lesperance, Matteo Mancanelli