Papers
Supervised Masked Knowledge Distillation for Few-Shot Transformers
Han Lin, Guangxing Han, Jiawei Ma et al.
SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes
Yiming Gao, Yan-Pei Cao, Ying Shan
SVFormer: Semi-Supervised Video Transformer for Action Recognition
Zhen Xing, Qi Dai, Han Hu et al.
SVGformer: Representation Learning for Continuous Vector Graphics Using Transformers
Defu Cao, Zhaowen Wang, Jose Echevarria et al.
SViTT: Temporal Learning of Sparse Video-Text Transformers
Yi Li, Kyle Min, Subarna Tripathi et al.
Swept-Angle Synthetic Wavelength Interferometry
Alankar Kotwal, Anat Levin, Ioannis Gkioulekas
Switchable Representation Learning Framework With Self-Compatibility
Shengsen Wu, Yan Bai, Yihang Lou et al.
Symmetric Shape-Preserving Autoencoder for Unsupervised Real Scene Point Cloud Completion
Changfeng Ma, Yinuo Chen, Pengxiao Guo et al.
Synthesizing Photorealistic Virtual Humans Through Cross-Modal Disentanglement
Siddarth Ravichandran, Ondřej Texler, Dimitar Dinev et al.
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Xubo Liu, Egor Lakomkin, Konstantinos Vougioukas et al.
System-Status-Aware Adaptive Network for Online Streaming Video Understanding
Lin Geng Foo, Jia Gong, Zhipeng Fan et al.
Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation
Lingting Zhu, Xian Liu, Xuanyu Liu et al.
Tangentially Elongated Gaussian Belief Propagation for Event-Based Incremental Optical Flow Estimation
Jun Nagata, Yusuke Sekikawa
TAPS3D: Text-Guided 3D Textured Shape Generation From Pseudo Supervision
Jiacheng Wei, Hao Wang, Jiashi Feng et al.
Target-Referenced Reactive Grasping for Dynamic Objects
Jirong Liu, Ruo Zhang, Hao-Shu Fang et al.
TarViS: A Unified Approach for Target-Based Video Segmentation
Ali Athar, Alexander Hermans, Jonathon Luiten et al.
Task Difficulty Aware Parameter Allocation & Regularization for Lifelong Learning
Wenjin Wang, Yunqing Hu, Qianglong Chen et al.
Task Residual for Tuning Vision-Language Models
Tao Yu, Zhihe Lu, Xin Jin et al.
Task-Specific Fine-Tuning via Variational Information Bottleneck for Weakly-Supervised Pathology Whole Slide Image Classification
Honglin Li, Chenglu Zhu, Yunlong Zhang et al.
TBP-Former: Learning Temporal Bird's-Eye-View Pyramid for Joint Perception and Prediction in Vision-Centric Autonomous Driving
Shaoheng Fang, Zi Wang, Yiqi Zhong et al.
Teacher-Generated Spatial-Attention Labels Boost Robustness and Accuracy of Contrastive Models
Yushi Yao, Chang Ye, Junfeng He et al.
Teaching Matters: Investigating the Role of Supervision in Vision Transformers
Matthew Walmer, Saksham Suri, Kamal Gupta et al.
Teaching Structured Vision & Language Concepts to Vision & Language Models
Sivan Doveh, Assaf Arbelle, Sivan Harary et al.
Teleidoscopic Imaging System for Microscale 3D Shape Reconstruction
Ryo Kawahara, Meng-Yu Jennifer Kuo, Shohei Nobuhara
Tell Me What Happened: Unifying Text-Guided Video Completion via Multimodal Masked Video Generation
Tsu-Jui Fu, Licheng Yu, Ning Zhang et al.