Papers
Streaming Generation of Co-Speech Gestures via Accelerated Rolling Diffusion
Evgeniia Vu, Andrei Boiarov, Dmitry Vetrov
StreamingTalker: Audio-driven 3D Facial Animation with Autoregressive Diffusion Model
Yifan Yang, Zhi Cen, Sida Peng et al.
StreamKV: Streaming Video Question-Answering with Segment-based KV Cache Retrieval and Compression
Yilong Chen, Xiang Bai, Zhibin Wang et al.
StreamSTGS: Streaming Spatial and Temporal Gaussian Grids for Real-Time Free-Viewpoint Video
Zhihui Ke, Yvyang Liu, Xiaobo Zhou et al.
STRIDE-QA: Visual Question Answering Dataset for Spatiotemporal Reasoning in Urban Driving Scenes
Keishi Ishihara, Kento Sasaki, Tsubasa Takahashi et al.
Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection
Xinbin Yuan, Zhaohui Zheng, Yuxuan Li et al.
StrokeFusion: Vector Sketch Generation via Joint Stroke-UDF Encoding and Latent Sequence Diffusion
Jin Zhou, Yi Zhou, Hongliang Yang et al.
Structural Approach to Guiding a Present-Biased Agent
Tatiana Belova, Yuriy Dementiev, Artur Ignatiev et al.
Structural Entropy Guided Incremental Learning for Open-World Multimodal Social Event Detection
Zhiwei Yang, Haimei Qin, Xiaoyan Yu et al.
Structure-Aware Encodings of Argumentation Properties for Clique-width
Yasir Mahmood, Markus Hecher, Johanna Groven et al.
Structure-based RNA Design by Step-wise Optimization of Latent Diffusion Model
Qi Si, Xuyang Liu, Penglei Wang et al.
Structure Detection for Contextual Reinforcement Learning
Tianyue Zhou, Jung-Hoon Cho, Cathy Wu
Structure-Enhanced Adapter for Self-Supervised Heterogeneous Graph Learning
Fengyu Yan, Di Jin, Xiaobao Wang et al.
Structures Meet Semantics: Multimodal Fusion via Graph Contrastive Learning
Jiangfeng Sun, SiHao He, Zhonghong Ou et al.
ST-SAM: Multimodal Scene Text Segmentation with Dense Visual and Sparse Textual Prompts via SAM
Jin Wei, Yaqiang Wu, Jiayi Yan et al.
ST-TPP: Learning Semi-Transductive Temporal Point Processes with Gromov-Wasserstein Barycentric Regularization
Qingmei Wang, Tianyu Huang, Yujie Long et al.
Studying Classifier(-Free) Guidance from a Classifier-Centric Perspective
Xiaoming Zhao, Alex Schwing
ST-VLM: A Spatial-to-Image Multimodal Spatial-Temporal Prediction Framework with Vision-Language Model
Tong Zhao, Junping Du, Zhe Xue et al.
Style4D-Bench: A Benchmark Suite for 4D Stylization
Beiqi Chen, Shuai Shao, Haitang Feng et al.
StyleBreak: Revealing Alignment Vulnerabilities in Large Audio-Language Models via Style-Aware Audio Jailbreak
Hongyi Li, Chengxuan Zhou, Chu Wang et al.
StyleDrive: Towards Driving-Style Aware Benchmarking of End-To-End Autonomous Driving
Ruiyang Hao, Bowen Jing, Haibao Yu et al.
Style-First Authorship Verification for Academic Integrity in the Generative AI Era (Student Abstract)
Jun Jang, Thai Le, Bo Wang
StyleFM: Frequency Manipulation Empowered by Recursive Attention on Diffusion Models for Arbitrary Style Transfer
Yingnan Ma, Zhenye Liu, Siying Liu et al.
StyleSentinel: Reliable Artistic Copyright Verification via Stylistic Fingerprints
Lingxiao Chen, Liqin Wang, Wei Lu