Papers
SVQNet: Sparse Voxel-Adjacent Query Network for 4D Spatio-Temporal LiDAR Semantic Segmentation
Xuechao Chen, Shuangjie Xu, Xiaoyi Zou et al.
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications
Abdelrahman Shaker, Muhammad Maaz, Hanoona Rasheed et al.
SwinLSTM: Improving Spatiotemporal Prediction Accuracy using Swin Transformer and LSTM
Song Tang, Chuang Li, Pu Zhang et al.
SYENet: A Simple Yet Effective Network for Multiple Low-Level Vision Tasks with Real-Time Performance on Mobile Device
Weiran Gou, Ziyao Yi, Yan Xiang et al.
SynBody: Synthetic Dataset with Layered Human Models for 3D Human Perception and Modeling
Zhitao Yang, Zhongang Cai, Haiyi Mei et al.
Synchronize Feature Extracting and Matching: A Single Branch Framework for 3D Object Tracking
Teli Ma, Mengmeng Wang, Jimin Xiao et al.
Synthesizing Diverse Human Motions in 3D Indoor Scenes
Kaifeng Zhao, Yan Zhang, Shaofei Wang et al.
Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models
Ziyi Wang, Xumin Yu, Yongming Rao et al.
Talking Head Generation with Probabilistic Audio-to-Visual Diffusion Priors
Zhentao Yu, Zixin Yin, Deyu Zhou et al.
TALL: Thumbnail Layout for Deepfake Video Detection
Yuting Xu, Jian Liang, Gengyun Jia et al.
Taming Contrast Maximization for Learning Sequential, Low-latency, Event-based Optical Flow
Federico Paredes-Vallés, Kirk Y. W. Scheper, Christophe De Wagter et al.
Tangent Model Composition for Ensembling and Continual Fine-tuning
Tian Yu Liu, Stefano Soatto
Tangent Sampson Error: Fast Approximate Two-view Reprojection Error for Central Camera Models
Mikhail Terekhov, Viktor Larsson
TAPIR: Tracking Any Point with Per-Frame Initialization and Temporal Refinement
Carl Doersch, Yi Yang, Mel Vecerik et al.
TARGET: Federated Class-Continual Learning via Exemplar-Free Distillation
Jie Zhang, Chen Chen, Weiming Zhuang et al.
Task Agnostic Restoration of Natural Video Dynamics
Muhammad Kashif Ali, Dongjin Kim, Tae Hyun Kim
Task-aware Adaptive Learning for Cross-domain Few-shot Learning
Yurong Guo, Ruoyi Du, Yuan Dong et al.
Task-Oriented Multi-Modal Mutual Leaning for Vision-Language Models
Sifan Long, Zhen Zhao, Junkun Yuan et al.
Taxonomy Adaptive Cross-Domain Adaptation in Medical Imaging via Optimization Trajectory Distillation
Jianan Fan, Dongnan Liu, Hang Chang et al.
TCOVIS: Temporally Consistent Online Video Instance Segmentation
Junlong Li, Bingyao Yu, Yongming Rao et al.
Teaching CLIP to Count to Ten
Roni Paiss, Ariel Ephrat, Omer Tov et al.
TeD-SPAD: Temporal Distinctiveness for Self-Supervised Privacy-Preservation for Video Anomaly Detection
Joseph Fioresi, Ishan Rajendrakumar Dave, Mubarak Shah
Tem-Adapter: Adapting Image-Text Pretraining for Video Question Answer
Guangyi Chen, Xiao Liu, Guangrun Wang et al.
Template-guided Hierarchical Feature Restoration for Anomaly Detection
Hewei Guo, Liping Ren, Jingjing Fu et al.