Papers
Visual Graph Memory With Unsupervised Representation for Visual Navigation
Obin Kwon, Nuri Kim, Yunho Choi et al.
Visual Relationship Detection Using Part-and-Sum Transformers With Composite Queries
Qi Dong, Zhuowen Tu, Haofu Liao et al.
Visual Saliency Transformer
Nian Liu, Ni Zhang, Kaiyuan Wan et al.
Visual Scene Graphs for Audio Source Separation
Moitreya Chatterjee, Jonathan Le Roux, Narendra Ahuja et al.
Visual-Textual Attentive Semantic Consistency for Medical Report Generation
Yi Zhou, Lei Huang, Tao Zhou et al.
Visual Transformers: Where Do Transformers Really Belong in Vision Models?
Bichen Wu, Chenfeng Xu, Xiaoliang Dai et al.
ViViT: A Video Vision Transformer
Anurag Arnab, Mostafa Dehghani, Georg Heigold et al.
VLGrammar: Grounded Grammar Induction of Vision and Language
Yining Hong, Qing Li, Song-Chun Zhu et al.
VMNet: Voxel-Mesh Network for Geodesic-Aware 3D Semantic Segmentation
Zeyu Hu, Xuyang Bai, Jiaxiang Shang et al.
VolumeFusion: Deep Depth Fusion for 3D Scene Reconstruction
Jaesung Choe, Sunghoon Im, Francois Rameau et al.
von Mises-Fisher Loss: An Exploration of Embedding Geometries for Supervised Learning
Tyler R. Scott, Andrew C. Gallagher, Michael C. Mozer
Voxel-Based Network for Shape Completion by Leveraging Edge Generation
Xiaogang Wang, Marcelo H Ang, Gim Hee Lee
Voxel Transformer for 3D Object Detection
Jiageng Mao, Yujing Xue, Minzhe Niu et al.
VSAC: Efficient and Accurate Estimator for H and F
Maksym Ivashechkin, Daniel Barath, Jiří Matas
Walk in the Cloud: Learning Curves for Point Clouds Shape Analysis
Tiange Xiang, Chaoyi Zhang, Yang Song et al.
Wanderlust: Online Continual Object Detection in the Real World
Jianren Wang, Xin Wang, Yue Shang-Guan et al.
Warp Consistency for Unsupervised Learning of Dense Correspondences
Prune Truong, Martin Danelljan, Fisher Yu et al.
WarpedGANSpace: Finding Non-Linear RBF Paths in GAN Latent Space
Christos Tzelepis, Georgios Tzimiropoulos, Ioannis Patras
Warp-Refine Propagation: Semi-Supervised Auto-Labeling via Cycle-Consistency
Aditya Ganeshan, Alexis Vallet, Yasunori Kudo et al.
Wasserstein Coupled Graph Learning for Cross-Modal Retrieval
Yun Wang, Tong Zhang, Xueya Zhang et al.
Watch Only Once: An End-to-End Video Action Detection Framework
Shoufa Chen, Peize Sun, Enze Xie et al.
WaveFill: A Wavelet-Based Generation Network for Image Inpainting
Yingchen Yu, Fangneng Zhan, Shijian Lu et al.
Waypoint Models for Instruction-Guided Navigation in Continuous Environments
Jacob Krantz, Aaron Gokaslan, Dhruv Batra et al.
WB-DETR: Transformer-Based Detector Without Backbone
Fanfan Liu, Haoran Wei, Wenzhe Zhao et al.
Weak Adaptation Learning: Addressing Cross-Domain Data Insufficiency With Weak Annotator
Shichao Xu, Lixu Wang, Yixuan Wang et al.