Papers
Virtual Sparse Convolution for Multimodal 3D Object Detection
Hai Wu, Chenglu Wen, Shaoshuai Shi et al.
VisFusion: Visibility-Aware Online 3D Scene Reconstruction From Videos
Huiyu Gao, Wei Mao, Miaomiao Liu
Visibility Aware Human-Object Interaction Tracking From Single RGB Camera
Xianghui Xie, Bharat Lal Bhatnagar, Gerard Pons-Moll
Visibility Constrained Wide-Band Illumination Spectrum Design for Seeing-in-the-Dark
Muyao Niu, Zhuoxiao Li, Zhihang Zhong et al.
Vision Transformers Are Good Mask Auto-Labelers
Shiyi Lan, Xitong Yang, Zhiding Yu et al.
Vision Transformers Are Parameter-Efficient Audio-Visual Learners
Yan-Bo Lin, Yi-Lin Sung, Jie Lei et al.
Visual Atoms: Pre-Training Vision Transformers With Sinusoidal Waves
Sora Takashima, Ryo Hayamizu, Nakamasa Inoue et al.
Visual Dependency Transformers: Dependency Tree Emerges From Reversed Attention
Mingyu Ding, Yikang Shen, Lijie Fan et al.
Visual DNA: Representing and Comparing Images Using Distributions of Neuron Activations
Benjamin Ramtoula, Matthew Gadd, Paul Newman et al.
Visual Exemplar Driven Task-Prompting for Unified Perception in Autonomous Driving
Xiwen Liang, Minzhe Niu, Jianhua Han et al.
Visual Language Pretrained Multiple Instance Zero-Shot Transfer for Histopathology Images
Ming Y. Lu, Bowen Chen, Andrew Zhang et al.
Visual-Language Prompt Tuning With Knowledge-Guided Context Optimization
Hantao Yao, Rui Zhang, Changsheng Xu
Visual Localization Using Imperfect 3D Models From the Internet
Vojtech Panek, Zuzana Kukelova, Torsten Sattler
Visual Programming: Compositional Visual Reasoning Without Training
Tanmay Gupta, Aniruddha Kembhavi
Visual Prompt Multi-Modal Tracking
Jiawen Zhu, Simiao Lai, Xin Chen et al.
Visual Prompt Tuning for Generative Transfer Learning
Kihyuk Sohn, Huiwen Chang, José Lezama et al.
Visual Query Tuning: Towards Effective Usage of Intermediate Representations for Parameter and Memory Efficient Transfer Learning
Cheng-Hao Tu, Zheda Mai, Wei-Lun Chao
Visual Recognition by Request
Chufeng Tang, Lingxi Xie, Xiaopeng Zhang et al.
Visual Recognition-Driven Image Restoration for Multiple Degradation With Intrinsic Semantics Recovery
Zizheng Yang, Jie Huang, Jiahao Chang et al.
Visual-Tactile Sensing for In-Hand Object Reconstruction
Wenqiang Xu, Zhenjun Yu, Han Xue et al.
Vita-CLIP: Video and Text Adaptive CLIP via Multimodal Prompting
Syed Talal Wasim, Muzammal Naseer, Salman Khan et al.
ViTs for SITS: Vision Transformers for Satellite Image Time Series
Michail Tarasiou, Erik Chavez, Stefanos Zafeiriou
VIVE3D: Viewpoint-Independent Video Editing Using 3D-Aware GANs
Anna Frühstück, Nikolaos Sarafianos, Yuanlu Xu et al.
VLPD: Context-Aware Pedestrian Detection via Vision-Language Semantic Self-Supervision
Mengyin Liu, Jie Jiang, Chao Zhu et al.
VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud
Ziqin Wang, Bowen Cheng, Lichen Zhao et al.