Papers
Using Next Sentence Prediction to Test ChatGPT’s Text Comprehension (Student Abstract)
Ojas M Agarwal, Madelein Villegas, Jack Mostow
Utilize the Flow Before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction Tuning
Runchuan Zhu, Zhipeng Ma, Jiang Wu et al.
Utilizing Vision-Language Models for Detection of Leaf-Based Diseases in Tomatoes
James Blossom Eleojo
Utterance-level Emotion Recognition in Conversation with Conversation-level Supervision
Ximing Li, Yuanchao Dai, Zhiyao Yang et al.
V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer
Hangzhou He, Lei Zhu, Xinliang Zhang et al.
V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning
Hang Hua, Yunlong Tang, Chenliang Xu et al.
VA-AR: Learning Velocity-Aware Action Representations with Mixture of Window Attention
Jiangning Wei, Lixiong Qin, Bo Yu et al.
VarCMP: Adapting Cross-Modal Pre-Training Models for Video Anomaly Retrieval
Peng Wu, Wanshun Su, Xiangteng He et al.
VarDrop: Enhancing Training Efficiency by Reducing Variate Redundancy in Periodic Time Series Forecasting
Junhyeok Kang, Yooju Shin, Jae-Gil Lee
VCR: A “Cone of Experience” Driven Synthetic Data Generation Framework for Mathematical Reasoning
Sannyuya Liu, Jintian Feng, Xiaoxuan Shen et al.
VE-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment
Shangkun Sun, Xiaoyu Liang, Songlin Fan et al.
VEGAS: Towards Visually Explainable and Grounded Artificial Social Intelligence
Hao Li, Hao Fei, Zechao Hu et al.
Verification of Neural Networks Against Convolutional Perturbations via Parameterised Kernels
Benedikt Brückner, Alessio Lomuscio
Verifying Proportionality in Temporal Voting
Edith Elkind, Svetlana Obraztsova, Jannik Peters et al.
VerilogCoder: Autonomous Verilog Coding Agents with Graph-based Planning and Abstract Syntax Tree (AST)-based Waveform Tracing Tool
Chia-Tung Ho, Haoxing Ren, Brucek Khailany
VERO: Verification and Zero-Shot Feedback Acquisition for Few-Shot Multimodal Aspect-Level Sentiment Classification
Kai Sun, Hao Wu, Bin Shi et al.
VersaFusion: A Versatile Diffusion-Based Framework for Fine-Grained Image Editing and Enhancement
Haocun Ye, Xinlong Jiang, Chenlong Gao et al.
VersaGen: Unleashing Versatile Visual Control for Text-to-Image Synthesis
Zhipeng Chen, Lan Yang, Yonggang Qi et al.
VERSE: Verification-based Self-Play for Code Instructions
Hao Jiang, Qi Liu, Rui Li et al.
VFM-Adapter: Adapting Visual Foundation Models for Dense Prediction with Dynamic Hybrid Operation Mapping
Zheng Chen, Yu Zeng, Zehui Chen et al.
VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting
Muhammet Furkan Ilaslan, Ali Köksal, Kevin Qinghong Lin et al.
VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis
Chao Pang, Xingxing Weng, Jiang Wu et al.
VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning
Ji Soo Lee, Jongha Kim, Jeehye Na et al.
Video Anomaly Detection with Motion and Appearance Guided Patch Diffusion Model
Hang Zhou, Jiale Cai, Yuteng Ye et al.
Video Diffusion Models Are Strong Video Inpainter
Minhyeok Lee, Suhwan Cho, Chajin Shin et al.