Papers
UniFormerV2: Unlocking the Potential of Image ViTs for Video Understanding
Kunchang Li, Yali Wang, Yinan He et al.
UniFusion: Unified Multi-View Fusion Transformer for Spatial-Temporal Representation in Bird's-Eye-View
Zequn Qin, Jingyu Chen, Chao Chen et al.
Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation
Yaowei Li, Bang Yang, Xuxin Cheng et al.
UniKD: Universal Knowledge Distillation for Mimicking Homogeneous or Heterogeneous Object Detectors
Shanshan Lao, Guanglu Song, Boxiao Liu et al.
Unilaterally Aggregated Contrastive Learning with Hierarchical Augmentation for Anomaly Detection
Guodong Wang, Yunhong Wang, Jie Qin et al.
UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and the OpenPCSeg Codebase
Youquan Liu, Runnan Chen, Xin Li et al.
UniT3D: A Unified Transformer for 3D Dense Captioning and Visual Grounding
Zhenyu Chen, Ronghang Hu, Xinlei Chen et al.
UnitedHuman: Harnessing Multi-Source Data for High-Resolution Human Generation
Jianglin Fu, Shikai Li, Yuming Jiang et al.
UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View Representation
Haiyang Wang, Hao Tang, Shaoshuai Shi et al.
Universal Domain Adaptation via Compressive Attention Matching
Didi Zhu, Yinchuan Li, Junkun Yuan et al.
UniverSeg: Universal Medical Image Segmentation
Victor Ion Butoi, Jose Javier Gonzalez Ortiz, Tianyu Ma et al.
UniVTG: Towards Unified Video-Language Temporal Grounding
Kevin Qinghong Lin, Pengchuan Zhang, Joya Chen et al.
Unleashing Text-to-Image Diffusion Models for Visual Perception
Wenliang Zhao, Yongming Rao, Zuyan Liu et al.
Unleashing the Potential of Spiking Neural Networks with Dynamic Confidence
Chen Li, Edward G Jones, Steve Furber
Unleashing the Power of Gradient Signal-to-Noise Ratio for Zero-Shot NAS
Zihao Sun, Yu Sun, Longxing Yang et al.
Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection
Yuxin Fang, Shusheng Yang, Shijie Wang et al.
UnLoc: A Unified Framework for Video Localization Tasks
Shen Yan, Xuehan Xiong, Arsha Nagrani et al.
Unmasked Teacher: Towards Training-Efficient Video Foundation Models
Kunchang Li, Yali Wang, Yizhuo Li et al.
Unmasking Anomalies in Road-Scene Segmentation
Shyam Nandan Rai, Fabio Cermelli, Dario Fontanel et al.
Unpaired Multi-domain Attribute Translation of 3D Facial Shapes with a Square and Symmetric Geometric Map
Zhenfeng Fan, Zhiheng Zhang, Shuang Yang et al.
Unsupervised 3D Perception with 2D Vision-Language Distillation for Autonomous Driving
Mahyar Najibi, Jingwei Ji, Yin Zhou et al.
Unsupervised Accuracy Estimation of Deep Visual Models using Domain-Adaptive Adversarial Perturbation without Source Samples
JoonHo Lee, Jae Oh Woo, Hankyu Moon et al.
Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models
Nan Liu, Yilun Du, Shuang Li et al.
Unsupervised Domain Adaptive Detection with Network Stability Analysis
Wenzhang Zhou, Heng Fan, Tiejian Luo et al.