Papers
Venom: Liquid Diffusion-Guided Gradient Inversion for Breaking Differential Privacy in Federated Learning
Bin Hu, Jingling Yuan, Jiawei Jiang et al.
Verb Mirage: Unveiling and Assessing Verb Concept Hallucinations in Multimodal Large Language Models
Zehao Wang, Xinpeng Liu, Yudonglin Zhang et al.
VeriFlow: Modeling Distributions for Neural Network Verification
Faried Abu Zaid, Daniel Neider, Mustafa Yalçıner
VerifyBench: A Systematic Benchmark for Evaluating Reasoning Verifiers Across Domains
Xuzhao Li, Xuchen Li, Shiyu Hu et al.
Versatile Vision-Language Model for 3D Computed Tomography
Jiayu Lei, Ziqing Fan, Yanyong Zhang et al.
VFCionX: Bridging Large and Small Models for Robust Vulnerability-Fixing Commit Identification
Xing Cui, Jingzheng Wu, Wenxiang Ou et al.
VGD: Value-Guided Diffusion Toward High-Utility Medical Image Segmentation
Hongyu Zhang, Haipeng Chen, Chengxin Yang et al.
VGGS: VGGT-guided Gaussian Splatting for Efficient and Faithful Sparse-View Surface Reconstruction
Peng Xiang, Liang Han, Hui Zhang et al.
VGGTFace: Topologically Consistent Facial Geometry Reconstruction in the Wild
Xin Ming, Yuxuan Han, Tianyu Huang et al.
ViCToR: Improving Visual Comprehension via Token Reconstruction for Pretraining LMMs
Yin Xie, Kaicheng Yang, Peirou Liang et al.
Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry
Junyoung Seo, Jisang Han, Jaewoo Jung et al.
VideoChat-A1: Thinking with Long Videos by Chain-of-Shot Reasoning
Zikang Wang, Boyu Chen, Zhengrong Yue et al.
Video Echoed in Music: Semantic, Temporal, and Rhythmic Alignment for Video-to-Music Generation
Xinyi Tong, Yiran Zhu, Jishang Chen et al.
Video Mirror Detection with the Motion-in-Depth Cue
Alex Warren, Ke Xu, Xin Tian et al.
VideoSeg-R1:Reasoning Video Object Segmentation via Reinforcement Learning
Zishan Xu, Yifu Guo, Yuquan Lu et al.
Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models
Meng Cao, Pengfei Hu, Yingyao Wang et al.
Video Spatial Reasoning with Object-Centric 3D Rollout
Haoran Tang, Meng Cao, Ruyang Liu et al.
ViDia2Std: A Parallel Corpus and Methods for Low-Resource Vietnamese Dialect-to-Standard Translation
Khoa Anh Ta, Nguyen Van Dinh, Kiet Van Nguyen
VietCheckMed: Explainable Regulatory Compliance Checking for Medical Advertisements on Vietnamese Social Media
Nguyen Thanh Tam, Khanh Quoc Tran, Dat Thanh Pham et al.
View-on-Graph: Zero-Shot 3D Visual Grounding via Vision-Language Reasoning on Scene Graphs
Yuanyuan Liu, Haiyang Mei, Dongyang Zhan et al.
Views Attention Fusion of Granular-ball Fuzzy Representations Split for Improved Multi-view Clustering
Shuaiyu Liu, Song Wu, Jie Xu et al.
ViG-RAG: Video-aware Graph Retrieval-Augmented Generation via Temporal and Semantic Hybrid Reasoning
Zongsheng Cao, Anran Liu, Yangfan He et al.
VIL2C: Value-of-Information Aware Low-Latency Communication for Multi-Agent Reinforcement Learning
Qian Zhang, Zhuo Sun, Yao Zhang et al.
VILTA: A VLM-in-the-Loop Adversary for Enhancing Driving Policy Robustness
Qimao Chen, Fang Li, Shaoqing Xu et al.
VipAct: Visual-Perception Enhancement via Specialized VLM Agent Collaboration and Tool-use
Zhehao Zhang, Ryan A. Rossi, Tong Yu et al.