Papers
Virtual Correspondence: Humans as a Cue for Extreme-View Geometry
Wei-Chiu Ma, Anqi Joyce Yang, Shenlong Wang et al.
Virtual Elastic Objects
Hsiao-yu Chen, Edith Tretschk, Tuur Stuyck et al.
VisCUIT: Visual Auditor for Bias in CNN Image Classifier
Seongmin Lee, Zijie J. Wang, Judy Hoffman et al.
Visible-Thermal UAV Tracking: A Large-Scale Benchmark and New Baseline
Pengyu Zhang, Jie Zhao, Dong Wang et al.
Vision-Language Pre-Training for Boosting Scene Text Detectors
Sibo Song, Jianqiang Wan, Zhibo Yang et al.
Vision-Language Pre-Training With Triple Contrastive Learning
Jinyu Yang, Jiali Duan, Son Tran et al.
Vision Transformer Slimming: Multi-Dimension Searching in Continuous Optimization Space
Arnav Chavan, Zhiqiang Shen, Zhuang Liu et al.
Vision Transformer With Deformable Attention
Zhuofan Xia, Xuran Pan, Shiji Song et al.
VISOLO: Grid-Based Space-Time Aggregation for Efficient Online Video Instance Segmentation
Su Ho Han, Sukjun Hwang, Seoung Wug Oh et al.
VISTA: Boosting 3D Object Detection via Dual Cross-VIew SpaTial Attention
Shengheng Deng, Zhihao Liang, Lin Sun et al.
ViSTA: Vision and Scene Text Aggregation for Cross-Modal Retrieval
Mengjun Cheng, Yipeng Sun, Longchao Wang et al.
Visual Abductive Reasoning
Chen Liang, Wenguan Wang, Tianfei Zhou et al.
Visual Acoustic Matching
Changan Chen, Ruohan Gao, Paul Calamia et al.
VisualGPT: Data-Efficient Adaptation of Pretrained Language Models for Image Captioning
Jun Chen, Han Guo, Kai Yi et al.
VisualHow: Multimodal Problem Solving
Jinhui Yang, Xianyu Chen, Ming Jiang et al.
Visual Vibration Tomography: Estimating Interior Material Properties From Monocular Video
Berthy T. Feng, Alexander C. Ogren, Chiara Daraio et al.
VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks
Yi-Lin Sung, Jaemin Cho, Mohit Bansal
VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers
Estelle Aflalo, Meng Du, Shao-Yen Tseng et al.
Vox2Cortex: Fast Explicit Reconstruction of Cortical Surfaces From 3D MRI Scans With Geometric Deep Neural Networks
Fabian Bongratz, Anne-Marie Rickmann, Sebastian Pölsterl et al.
Voxel Field Fusion for 3D Object Detection
Yanwei Li, Xiaojuan Qi, Yukang Chen et al.
Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection From Point Clouds
Chenhang He, Ruihuang Li, Shuai Li et al.
VRDFormer: End-to-End Video Visual Relation Detection With Transformers
Sipeng Zheng, Shizhe Chen, Qin Jin
WALT: Watch and Learn 2D Amodal Representation From Time-Lapse Imagery
N. Dinesh Reddy, Robert Tamburo, Srinivasa G. Narasimhan
WarpingGAN: Warping Multiple Uniform Priors for Adversarial 3D Point Cloud Generation
Yingzhi Tang, Yue Qian, Qijian Zhang et al.