Papers
Steering Visuomotor Policy in Open Worlds via Cross-View Goal Alignment
Shaofei Cai, Zhancun Mu, Anji Liu et al.
SteerMusic: Enhanced Musical Consistency for Zero-shot Text-Guided and Personalized Music Editing
Xinlei Niu, Kin Wai Cheuk, Jing Zhang et al.
StegaVAR: Privacy-Preserving Video Action Recognition via Steganographic Domain Analysis
Lixin Chen, Chaomeng Chen, Jiale Zhou et al.
STELAR-VISION: Self-Topology-Aware Efficient Learning for Aligned Reasoning in Vision
Chen Li, Han Zhang, Zhantao Yang et al.
Step Back to Leap Forward: Self-Backtracking for Symbolic Reasoning and Planning in Language Models
Xiao-Wen Yang, Xuan-Yi Zhu, Ding-Chu Zhang et al.
Step-by-step Layered Design Generation
Faizan Farooq Khan, Joseph K J, Koustava Goswami et al.
StepFun-Formalizer: Unlocking the Autoformalization Potential of LLMs Through Knowledge-Reasoning Fusion
Yutong Wu, Di Huang, Ruosi Wan et al.
Step-GRPO: Enhancing Reasoning Quality and Efficiency via Structured PRM-Based Reinforcement Learning
Weijie Li, Jin Wang, Liang-Chih Yu et al.
STEP-Nav: Spatial-Temporal Efficient Visual Token Pruning for Vision-and-Language Navigation with Large Language Models
Yantao Lu, Shiqi Sun, Ning Liu et al.
Stepwise Contrastive Reasoning for Retrieval-Augmented Generation over Knowledge Graphs
Chenxiao Lin, Ye Luo, KunHong Liu et al.
Steve: Your Personal AI Career Coach
Balaji Rao, Naveen Mathews Renji, Elena Korshakova et al.
ST-LLM: Spatial Transcriptomics Embedding with Large Language Models
Zhetao Xu, Xiaohua Wan, Le Li et al.
STMI: Segmentation-Guided Token Modulation with Cross-Modal Hypergraph Interaction for Multi-Modal Object Re-Identification
Xingguo Xu, Zhanyu Liu, Weixiang Zhou et al.
Stochastic Decentralized Optimization of Non-Smooth Convex and Convex-Concave Problems over Time-Varying Networks
Maxim Divilkovskiy, Alexander Gasnikov
Stochastic Universal Adversarial Perturbations with Fixed Optimization Constraint and Ensured High-probability Transferability
Yulin Jin, Xiaoyu Zhang, Haoyu Tong et al.
STOLA: Self-Adaptive Touch-Language Framework for Tactile Commonsense Reasoning in Open-Ended Scenarios
Ning Cheng, Jinan Xu, Jialing Chen et al.
Stop Mixing Things Up! BISCUIT Teaches Vision-Language Models to Learn New Concepts from Images on the Spot
Jiahua Bao, Siyao Cheng, Jiaxing Du et al.
StoryBox: Collaborative Multi-Agent Simulation for Hybrid Bottom-Up Long-Form Story Generation Using Large Language Models
Zehao Chen, Rong Pan, Haoran Li
Strategic Reasoning over Golog Programs in the Nondeterministic Situation Calculus
Giuseppe De Giacomo, Yves Lesperance, Matteo Mancanelli
Strategic Tool Enhanced AI Agent for Multi-Issue Negotiation (Student Abstract)
Daiki Kitashima, Ryota Higa, Katsuhide Fujita
Stratified Knowledge-Density Super-Network for Scalable Vision Transformers
Longhua Li, Lei Qi, Xin Geng
Stratos: An End-to-End Distillation Pipeline for Customized LLMs Under Distributed Cloud Environments
Ziming Dai, Tuo Zhang, Fei Gao et al.
Streaming Generated Gaussian Process Experts for Online Learning and Control
Zewen Yang, Dongfa Zhang, Xiaobing Dai et al.