Papers
Single Image Reflection Separation via Component Synergy
Qiming Hu, Xiaojie Guo
Single-Stage Diffusion NeRF: A Unified Approach to 3D Generation and Reconstruction
Hansheng Chen, Jiatao Gu, Anpei Chen et al.
SIRA-PCR: Sim-to-Real Adaptation for 3D Point Cloud Registration
Suyi Chen, Hao Xu, Ru Li et al.
Size Does Matter: Size-aware Virtual Try-on via Clothing-oriented Transformation Try-on Network
Chieh-Yun Chen, Yi-Chung Chen, Hong-Han Shuai et al.
SKED: Sketch-guided Text-based 3D Editing
Aryan Mikaeili, Or Perel, Mehdi Safaee et al.
SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence Pre-training
Hong Yan, Yang Liu, Yushen Wei et al.
SkeleTR: Towards Skeleton-based Action Recognition in the Wild
Haodong Duan, Mingze Xu, Bing Shuai et al.
Sketch and Text Guided Diffusion Model for Colored Point Cloud Generation
Zijie Wu, Yaonan Wang, Mingtao Feng et al.
Skill Transformer: A Monolithic Policy for Mobile Manipulation
Xiaoyu Huang, Dhruv Batra, Akshara Rai et al.
Skip-Plan: Procedure Planning in Instructional Videos via Condensed Action Space Learning
Zhiheng Li, Wenjia Geng, Muheng Li et al.
SKiT: a Fast Key Information Video Transformer for Online Surgical Phase Recognition
Yang Liu, Jiayu Huo, Jingjing Peng et al.
SlaBins: Fisheye Depth Estimation using Slanted Bins on Road Environments
Jongsung Lee, Gyeongsu Cho, Jeongin Park et al.
SLAN: Self-Locator Aided Network for Vision-Language Understanding
Jiang-Tian Zhai, Qi Zhang, Tong Wu et al.
SLCA: Slow Learner with Classifier Alignment for Continual Learning on a Pre-trained Model
Gengwei Zhang, Liyuan Wang, Guoliang Kang et al.
Small Object Detection via Coarse-to-fine Proposal Generation and Imitation Learning
Xiang Yuan, Gong Cheng, Kebing Yan et al.
SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-Training
Yuanze Lin, Chen Wei, Huiyu Wang et al.
SMMix: Self-Motivated Image Mixing for Vision Transformers
Mengzhao Chen, Mingbao Lin, Zhihang Lin et al.
Smoothness Similarity Regularization for Few-Shot GAN Adaptation
Vadim Sushko, Ruyu Wang, Juergen Gall
Snow Removal in Video: A New Dataset and A Novel Method
Haoyu Chen, Jingjing Ren, Jinjin Gu et al.
SOAR: Scene-debiasing Open-set Action Recognition
Yuanhao Zhai, Ziyi Liu, Zhenyu Wu et al.
Social Diffusion: Long-term Multiple Human Motion Anticipation
Julian Tanke, Linguang Zhang, Amy Zhao et al.
SoDaCam: Software-defined Cameras via Single-Photon Imaging
Varun Sundar, Andrei Ardelean, Tristan Swedish et al.
Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation
Ziyang Chen, Shengyi Qian, Andrew Owens
Sound Source Localization is All about Cross-Modal Alignment
Arda Senocak, Hyeonggon Ryu, Junsik Kim et al.