Papers
Unsupervised Visual Representation Learning by Online Constrained K-Means
Qi Qian, Yuanhong Xu, Juhua Hu et al.
UnweaveNet: Unweaving Activity Stories
Will Price, Carl Vondrick, Dima Damen
Upright-Net: Learning Upright Orientation for 3D Point Cloud
Xufang Pang, Feng Li, Ning Ding et al.
Urban Radiance Fields
Konstantinos Rematas, Andrew Liu, Pratul P. Srinivasan et al.
URetinex-Net: Retinex-Based Deep Unfolding Network for Low-Light Image Enhancement
Wenhui Wu, Jian Weng, Pingping Zhang et al.
Use All the Labels: A Hierarchical Multi-Label Contrastive Learning Framework
Shu Zhang, Ran Xu, Caiming Xiong et al.
Using 3D Topological Connectivity for Ghost Particle Reduction in Flow Reconstruction
Christina Tsalicoglou, Thomas Rösgen
UTC: A Unified Transformer With Inter-Task Contrastive Learning for Visual Dialog
Cheng Chen, Zhenshan Tan, Qingrong Cheng et al.
V2C: Visual Voice Cloning
Qi Chen, Mingkui Tan, Yuankai Qi et al.
VALHALLA: Visual Hallucination for Machine Translation
Yi Li, Rameswar Panda, Yoon Kim et al.
vCLIMB: A Novel Video Class Incremental Learning Benchmark
Andrés Villa, Kumail Alhamoud, Victor Escorcia et al.
V-Doc: Visual Questions Answers With Documents
Yihao Ding, Zhe Huang, Runlin Wang et al.
Vector Quantized Diffusion Model for Text-to-Image Synthesis
Shuyang Gu, Dong Chen, Jianmin Bao et al.
Vehicle Trajectory Prediction Works, but Not Everywhere
Mohammadhossein Bahari, Saeed Saadatnejad, Ahmad Rahimi et al.
Versatile Multi-Modal Pre-Training for Human-Centric Perception
Fangzhou Hong, Liang Pan, Zhongang Cai et al.
VGSE: Visually-Grounded Semantic Embeddings for Zero-Shot Learning
Wenjia Xu, Yongqin Xian, Jiuniu Wang et al.
Video Demoireing With Relation-Based Temporal Consistency
Peng Dai, Xin Yu, Lan Ma et al.
Video Frame Interpolation Transformer
Zhihao Shi, Xiangyu Xu, Xiaohong Liu et al.
Video Frame Interpolation With Transformer
Liying Lu, Ruizheng Wu, Huaijia Lin et al.
VideoINR: Learning Video Implicit Neural Representation for Continuous Space-Time Super-Resolution
Zeyuan Chen, Yinbo Chen, Jingwen Liu et al.
Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation
Xiangtai Li, Wenwei Zhang, Jiangmiao Pang et al.
Video Shadow Detection via Spatio-Temporal Interpolation Consistency Training
Xiao Lu, Yihong Cao, Sheng Liu et al.
Video Swin Transformer
Ze Liu, Jia Ning, Yue Cao et al.
Video-Text Representation Learning via Differentiable Weak Temporal Alignment
Dohwan Ko, Joonmyung Choi, Juyeon Ko et al.
ViM: Out-of-Distribution With Virtual-Logit Matching
Haoqi Wang, Zhizhong Li, Litong Feng et al.