Papers
VLSlice: Interactive Vision-and-Language Slice Discovery
Eric Slyman, Minsuk Kahng, Stefan Lee
VoroMesh: Learning Watertight Surface Meshes with Voronoi Diagrams
Nissim Maruani, Roman Klokov, Maks Ovsjanikov et al.
Vox-E: Text-Guided Voxel Editing of 3D Objects
Etai Sella, Gal Fiebelman, Peter Hedman et al.
VQ3D: Learning a 3D-Aware Generative Model on ImageNet
Kyle Sargent, Jing Yu Koh, Han Zhang et al.
VQA-GNN: Reasoning with Multimodal Knowledge via Graph Neural Networks for Visual Question Answering
Yanan Wang, Michihiro Yasunaga, Hongyu Ren et al.
VQA Therapy: Exploring Answer Differences by Visually Grounding Answers
Chongyan Chen, Samreen Anjum, Danna Gurari
Waffling Around for Performance: Visual Classification with Random Words and Broad Concepts
Karsten Roth, Jae Myung Kim, A. Sophia Koepke et al.
WALDO: Future Video Synthesis Using Object Layer Decomposition and Parametric Flow Prediction
Guillaume Le Moing, Jean Ponce, Cordelia Schmid
Walking Your LiDOG: A Journey Through Multiple Domains for LiDAR Semantic Segmentation
Cristiano Saltori, Aljosa Osep, Elisa Ricci et al.
WaterMask: Instance Segmentation for Underwater Imagery
Shijie Lian, Hua Li, Runmin Cong et al.
WaveIPT: Joint Attention and Flow Alignment in the Wavelet domain for Pose Transfer
Liyuan Ma, Tingwei Gao, Haitian Jiang et al.
WaveNeRF: Wavelet-based Generalizable Neural Radiance Fields
Muyu Xu, Fangneng Zhan, Jiahui Zhang et al.
WDiscOOD: Out-of-Distribution Detection via Whitened Linear Discriminant Analysis
Yiye Chen, Yunzhi Lin, Ruinian Xu et al.
Weakly-supervised 3D Pose Transfer with Keypoints
Jinnan Chen, Chen Li, Gim Hee Lee
Weakly-Supervised Action Localization by Hierarchically-Structured Latent Attention Modeling
Guiqin Wang, Peng Zhao, Cong Zhao et al.
Weakly-Supervised Action Segmentation and Unseen Error Detection in Anomalous Instructional Videos
Reza Ghoddoosian, Isht Dwivedi, Nakul Agarwal et al.
Weakly Supervised Learning of Semantic Correspondence through Cascaded Online Correspondence Refinement
Yiwen Huang, Yixuan Sun, Chenghang Lai et al.
Weakly Supervised Referring Image Segmentation with Intra-Chunk and Inter-Chunk Consistency
Jungbeom Lee, Sungjin Lee, Jinseok Nam et al.
Weakly-Supervised Text-Driven Contrastive Learning for Facial Behavior Understanding
Xiang Zhang, Taoyue Wang, Xiaotian Li et al.
What Can a Cook in Italy Teach a Mechanic in India? Action Recognition Generalisation Over Scenarios and Locations
Chiara Plizzari, Toby Perrett, Barbara Caputo et al.
What can Discriminator do? Towards Box-free Ownership Verification of Generative Adversarial Networks
Ziheng Huang, Boheng Li, Yan Cai et al.
What Can Simple Arithmetic Operations Do for Temporal Modeling?
Wenhao Wu, Yuxin Song, Zhun Sun et al.
What Does a Platypus Look Like? Generating Customized Prompts for Zero-Shot Image Classification
Sarah Pratt, Ian Covert, Rosanne Liu et al.
What does CLIP know about a red circle? Visual prompt engineering for VLMs
Aleksandar Shtedritski, Christian Rupprecht, Andrea Vedaldi