Papers
Sub-Word Level Lip Reading With Visual Attention
K R Prajwal, Triantafyllos Afouras, Andrew Zisserman
Surface-Aligned Neural Radiance Fields for Controllable 3D Human Synthesis
Tianhan Xu, Yasuhiro Fujita, Eiichi Matsumoto
Surface Reconstruction From Point Clouds by Learning Predictive Context Priors
Baorui Ma, Yu-Shen Liu, Matthias Zwicker et al.
Surface Representation for Point Clouds
Haoxi Ran, Jun Liu, Chengjie Wang
SurfEmb: Dense and Continuous Correspondence Distributions for Object Pose Estimation With Learnt Surface Embeddings
Rasmus Laurvig Haugaard, Anders Glent Buch
Surpassing the Human Accuracy: Detecting Gallbladder Cancer From USG Images With Curriculum Learning
Soumen Basu, Mayank Gupta, Pratyaksha Rana et al.
SVIP: Sequence VerIfication for Procedures in Videos
Yicheng Qian, Weixin Luo, Dongze Lian et al.
SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering
Vipul Gupta, Zhuowan Li, Adam Kortylewski et al.
SWEM: Towards Real-Time Video Object Segmentation With Sequential Weighted Expectation-Maximization
Zhihui Lin, Tianyu Yang, Maomao Li et al.
SwinBERT: End-to-End Transformers With Sparse Attention for Video Captioning
Kevin Lin, Linjie Li, Chung-Ching Lin et al.
SwinTextSpotter: Scene Text Spotting via Better Synergy Between Text Detection and Text Recognition
Mingxin Huang, Yuliang Liu, Zhenghao Peng et al.
Swin Transformer V2: Scaling Up Capacity and Resolution
Ze Liu, Han Hu, Yutong Lin et al.
Sylph: A Hypernetwork Framework for Incremental Few-Shot Object Detection
Li Yin, Juan M. Perez-Rua, Kevin J. Liang
Symmetry and Uncertainty-Aware Object SLAM for 6DoF Object Pose Estimation
Nathaniel Merrill, Yuliang Guo, Xingxing Zuo et al.
Symmetry-Aware Neural Architecture for Embodied Visual Exploration
Shuang Liu, Takayuki Okatani
Syntax-Aware Network for Handwritten Mathematical Expression Recognition
Ye Yuan, Xiao Liu, Wondimu Dikubab et al.
Synthetic Aperture Imaging With Events and Frames
Wei Liao, Xiang Zhang, Lei Yu et al.
Synthetic Generation of Face Videos With Plethysmograph Physiology
Zhen Wang, Yunhao Ba, Pradyumna Chari et al.
TableFormer: Table Structure Understanding With Transformers
Ahmed Nassar, Nikolaos Livathinos, Maksym Lysak et al.
Talking Face Generation With Multilingual TTS
Hyoung-Kyu Song, Sang Hoon Woo, Junhyeok Lee et al.
Target-Aware Dual Adversarial Learning and a Multi-Scenario Multi-Modality Benchmark To Fuse Infrared and Visible for Object Detection
Jinyuan Liu, Xin Fan, Zhanbo Huang et al.
Targeted Supervised Contrastive Learning for Long-Tailed Recognition
Tianhong Li, Peng Cao, Yuan Yuan et al.
Target-Relevant Knowledge Preservation for Multi-Source Domain Adaptive Object Detection
Jiaxi Wu, Jiaxin Chen, Mengzhe He et al.
Task2Sim: Towards Effective Pre-Training and Transfer From Synthetic Data
Samarth Mishra, Rameswar Panda, Cheng Perng Phoo et al.