Papers
11,955 papers found
Variance Reduced Halpern Iteration for Finite-Sum Monotone Inclusions
Xufeng Cai, Ahmet Alacaoglu, Jelena Diakonikolas
Variational Bayesian Last Layers
James Harrison, John Willes, Jasper Snoek
Variational Inference for SDEs Driven by Fractional Noise
Rembert Daems, Manfred Opper, Guillaume Crevecoeur et al.
VBH-GNN: Variational Bayesian Heterogeneous Graph Neural Networks for Cross-subject Emotion Recognition
Chenyu Liu, XINLIANG ZHOU, Zhengri Zhu et al.
VCR-Graphormer: A Mini-batch Graph Transformer via Virtual Connections
Dongqi Fu, Zhigang Hua, Yan Xie et al.
VDC: Versatile Data Cleanser based on Visual-Linguistic Inconsistency by Multimodal Large Language Models
Zihao Zhu, Mingda Zhang, Shaokui Wei et al.
V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection
Yichao Shen, Zigang Geng, Yuhui Yuan et al.
VDT: General-purpose Video Diffusion Transformers via Mask Modeling
Haoyu Lu, Guoxing Yang, Nanyi Fei et al.
"Veil Privacy on Visual Data: Concealing Privacy for Humans, Unveiling for DNNs"
Shuchao Pang, Ruhao Ma, Bing Li et al.
VeRA: Vector-based Random Matrix Adaptation
Dawid Jan Kopiczko, Tijmen Blankevoort, Yuki M Asano
Verified Safe Reinforcement Learning for Neural Network Dynamic Models
Junlin Wu, Huan Zhang, Yevgeniy Vorobeychik
VersVideo: Leveraging Enhanced Temporal Diffusion Models for Versatile Video Generation
Jinxi Xiang, Ricong Huang, Jun Zhang et al.
VertiBench: Advancing Feature Distribution Diversity in Vertical Federated Learning Benchmarks
Zhaomin Wu, Junyi Hou, Bingsheng He
VETRA: A Dataset for Vehicle Tracking in Aerial Imagery - New Challenges for Multi-Object Tracking
Jens Hellekes, Manuel Mühlhaus, Reza Bahmanyar et al.
VFLAIR: A Research Library and Benchmark for Vertical Federated Learning
Tianyuan Zou, Zixuan GU, Yu He et al.
ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation
Jiaming Liu, Senqiao Yang, Peidong Jia et al.
VideoAgent: Long-form Video Understanding with Large Language Model as Agent
Xiaohan Wang, Yuhui Zhang, Orr Zohar et al.
Video Decomposition Prior: Editing Videos Layer by Layer
Gaurav Shrivastava, Ser-Nam Lim, Abhinav Shrivastava
Video Language Planning
Yilun Du, Sherry Yang, Pete Florence et al.
View-Consistent Hierarchical 3D Segmentation Using Ultrametric Feature Fields
Haodi He, Colton Stearns, Adam Harley et al.
Views Can Be Deceiving: Improved SSL Through Feature Space Augmentation
Kimia Hamidieh, Haoran Zhang, Swami Sankaranarayanan et al.
ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models
Ilker Kesen, Andrea Pedrotti, Mustafa Dogan et al.
Vision-by-Language for Training-Free Compositional Image Retrieval
Shyamgopal Karthik, Karsten Roth, Massimiliano Mancini et al.
Vision-Language Dual-Pattern Matching for Out-of-Distribution Detection
Zihan Zhang, Zhuo Xu, Xiang Xiang
Vision-Language Foundation Models as Effective Robot Imitators
Xinghang Li, Minghuan Liu, Hanbo Zhang et al.