Papers
4,428 papers found
Unified Control for Inference-Time Guidance of Denoising Diffusion Models
Maurya Goyal, Anuj Singh, Hadi Jamali-Rad
Unified Video Anomaly Detection Model for Detecting Different Anomaly Types
Kijung Lee, Youngwan Jo, Sunghyun Ahn et al.
UniGaze: Towards Universal Gaze Estimation via Large-scale Pre-Training
Jiawei Qin, Xucong Zhang, Yusuke Sugano
UniTabBank: A Large Scale Multi-Lingual, Multi-Layout, Multi-Type, Multi-Format Dataset for Table Detection
Ajoy Mondal, Saumya Mundra, Avijit Dasgupta et al.
Universal Neural Architecture Space: Covering ConvNets, Transformers and Everything in Between
Ondrej Tybl, Lukas Neumann
UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models
Lan Chen, Yuchao Gu, Qi Mao
Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting
Shu Zou, Xinyu Tian, Lukas Wesemann et al.
UNO: Unifying One-stage Video Scene Graph Generation via Object-Centric Visual Representation Learning
Huy Le, Nhat Chung, Tung Kieu et al.
Unsupervised Discovery of Long-Term Spatiotemporal Periodic Workflows in Human Activities
Fan Yang, Quanting Xie, Atsunori Moteki et al.
Unsupervised Memorability Modeling from Tip-of-the-Tongue Retrieval Queries
Sree Bhattacharyya, Yaman K. Singla, Sudhir Yarram et al.
Unsupervised Modular Adaptive Region Growing and RegionMix Classification for Wind Turbine Segmentation
Raül Pérez-Gonzalo, Riccardo Magro, Andreas Espersen et al.
Unsupervised Segmentation by Diffusing, Walking and Cutting
Daniela Ivanova, Marco Aversa, Paul Henderson et al.
Uplifting Table Tennis: A Robust, Real-World Application for 3D Trajectory and Spin Estimation
Daniel Kienzle, Katja Ludwig, Julian Lorenz et al.
V2XScene: Multi-View Consistent 3D Scene Simulation for Collaborative Perception
Yanfei Li, Yi Gong, Yuan Zeng
VADER: Towards Causal Video Anomaly Understanding with Relation-Aware Large Language Models
Ying Cheng, Yu-Ho Lin, Min-Hung Chen et al.
VAST-ReID: A Low-Light Benchmark Dataset for Person Re-Identification with Visual and Attribute-Rich Semantic Tracking
Hammad Khan, Rakesh Kumar Giri, Kamalakar Vijay Thakare et al.
VectorSynth: Fine-Grained Satellite Image Synthesis with Structured Semantics
Daniel Cher, Brian Wei, Srikumar Sastry et al.
VFace: A Training-Free Approach for Diffusion-Based Video Face Swapping
Sanoojan Baliah, Yohan Abeysinghe, Rusiru Thushara et al.
Video and Language Alignment in 2D Systems for 3D Multi-object Scenes with Multi-Information Derivative-Free Control
Jason Armitage, Rico Sennrich
VideoSketcher: A Training-Free Approach for Coherent Video Sketch Transfer
Huining Li, Bangzhen Liu, Rui Yang et al.
View-aware Cross-modal Distillation for Multi-view Action Recognition
Trung Thanh Nguyen, Yasutomo Kawanishi, Vijay John et al.
ViGG: Robust RGB-D Point Cloud Registration using Visual-Geometric Mutual Guidance
Congjia Chen, Shen Yan, Yufu Qu
Visibility guided Self-Supervised Occlusion-Resilient Human Pose Estimation
Arindam Dutta, Sarosij Bose, Rohit Kundu et al.
Vision-informed Semantic Text Alignment for Open-set Recognition in Remote Sensing
Siddhant Gole, Akash Pal, Ankit Jha et al.