Papers
vMFCoOp: Towards Equilibrium on a Unified Hyperspherical Manifold for Prompting Biomedical VLMs
Minye Shao, Sihan Guo, Xinrun Li et al.
VoiceCloak: A Multi-Dimensional Defense Framework Against Unauthorized Diffusion-Based Voice Cloning
Qianyue Hu, Junyan Wu, Wei Lu et al.
Voices, Faces, and Feelings: Multi-modal Emotion-Cognition Captioning for Mental Health Understanding
Zhiyuan Zhou, Yanrong Guo, Shijie Hao
VORTEX: Aligning Task Utility and Human Preferences Through LLM-Guided Reward Shaping
Guojun Xiong, Milind Tambe
Voting in Divisible Settings: A Survey
Warut Suksompong, Nicholas Teh
VP-Bench: A Comprehensive Benchmark for Visual Prompting in Multimodal Large Language Models
Mingjie Xu, Jinpeng Chen, Yuzhi Zhao et al.
VPHO: Joint Visual-Physical Cue Learning and Aggregation for Hand-Object Pose Estimation
Jun Zhou, Chi Xu, Kaifeng Tang et al.
VPN: Visual Prompt Navigation
Shuo Feng, Zihan Wang, Yuchen Li et al.
V-Pruner: A Fast and Globally-informed Token Pruning Framework for Vision Transformer
Guangzhen Yao, Jiayun Zheng, Zezhou Wang et al.
VPSentry: Semi-supervised Video Polyp Segmentation via Sentry-guided Long-term Prototype Fusion with Correlation Dynamic Propagation
Guilian Chen, Xiaoling Luo, Huisi Wu et al.
VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning
Linhan Cao, Wei Sun, Weixia Zhang et al.
VQ-Insight: Teaching VLMs for AI-Generated Video Quality Understanding via Progressive Visual Reinforcement Learning
Xuanyu Zhang, Weiqi Li, Shijie Zhao et al.
VRAgent-R1: Boosting Video Recommendation with MLLM-based Agents via Reinforcement Learning
Siran Chen, Boyu Chen, Yuxiao Luo et al.
VSPO: Validating Semantic Pitfalls in Ontology via LLM-Based CQ Generation
Hyojun Choi, Seokju Hwang, Kyong-Ho Lee
VTD-CLIP: Video-to-Text Discretization via Prompting CLIP
Wencheng Zhu, Yuexin Wang, Hongxuan Li et al.
VTinker: Guided Flow Upsampling and Texture Mapping for High-Resolution Video Frame Interpolation
Chenyang Wu, Jiayi Fu, Chun-Le Guo et al.
VulnBench: A Comprehensive Benchmark for Transformer-Based Vulnerability Detection
Jake Norton, David Eyers, Veronica Liesaputra
Vulnerability-Aware Robust Multimodal Adversarial Training
Junrui Zhang, Xinyu Zhao, Jie Peng et al.
W2S-AlignTree: Weak-to-Strong Inference-Time Alignment for Large Language Models via Monte Carlo Tree Search
Zhenyu Ding, Yuhao Wang, Tengyue Xiao et al.
Walk Before You Dance: High-fidelity and Editable Dance Synthesis via Generative Masked Motion Prior
Foram N Shah, Parshwa N Shah, Muhammad Usama Saleem et al.
Walking Further: Semantic-Aware Multimodal Gait Recognition Under Long-Range Conditions
Zhiyang Lu, Wen Jiang, Tianren Wu et al.
WALKSAFE: Risk-aware Graph Random Walk with Bi-GRPO for LLM Safety
Shilong Pan, Zhiliang Tian, Wanlong Yu et al.
Wasserstein-Aligned Hyperbolic Multi-View Clustering
Rui Wang, Yuting Jiang, Xiaoqing Luo et al.
Wasserstein-Aware Transfer: Class-Level Alignment for Robust Diffusion Model Adaptation
Zixian Huang, Chuan-Xian Ren
WaterMod: Modular Token-Rank Partitioning for Probability-Balanced LLM Watermarking
Shinwoo Park, Hyejin Park, Hyeseon Ahn et al.