Papers
4,428 papers found
Uncertainty-Aware Interactive LiDAR Sampling for Deep Depth Completion
Kensuke Taguchi, Shogo Morita, Yusuke Hayashi et al.
Uncertainty-Aware Label Distribution Learning for Facial Expression Recognition
Nhat Le, Khanh Nguyen, Quang Tran et al.
Understanding the Role of Mixup in Knowledge Distillation: An Empirical Study
Hongjun Choi, Eun Som Jeon, Ankita Shukla et al.
Unifying Distribution Alignment as a Loss for Imbalanced Semi-Supervised Learning
Justin Lazarow, Kihyuk Sohn, Chen-Yu Lee et al.
Unifying Margin-Based Softmax Losses in Face Recognition
Yang Zhang, Simao Herdade, Kapil Thadani et al.
Universal Deep Image Compression via Content-Adaptive Optimization With Adapters
Koki Tsubota, Hiroaki Akutsu, Kiyoharu Aizawa
Unsupervised 4D LiDAR Moving Object Segmentation in Stationary Settings With Multivariate Occupancy Time Series
Thomas Kreutz, Max Mühlhäuser, Alejandro Sanchez Guinea
Unsupervised Audio-Visual Lecture Segmentation
Darshan Singh S., Anchit Gupta, C. V. Jawahar et al.
Unsupervised Multi-Object Segmentation Using Attention and Soft-Argmax
Bruno Sauvalle, Arnaud de La Fortelle
Unsupervised Video Object Segmentation via Prototype Memory Network
Minhyeok Lee, Suhwan Cho, Seunghoon Lee et al.
UPAR: Unified Pedestrian Attribute Recognition and Person Retrieval
Andreas Specker, Mickael Cormier, Jürgen Beyerer
Uplift and Upsample: Efficient 3D Human Pose Estimation With Uplifting Transformers
Moritz Einfalt, Katja Ludwig, Rainer Lienhart
Urban Scene Semantic Segmentation With Low-Cost Coarse Annotation
Anurag Das, Yongqin Xian, Yang He et al.
UVCGAN: UNet Vision Transformer Cycle-Consistent GAN for Unpaired Image-to-Image Translation
Dmitrii Torbunov, Yi Huang, Haiwang Yu et al.
Video Joint Denoising and Demosaicing With Recurrent CNNs
Valéry Dewil, Adrien Courtois, Mariano Rodríguez et al.
Video Object Matting via Hierarchical Space-Time Semantic Guidance
Yumeng Wang, Bo Xu, Ziwen Li et al.
ViewCLR: Learning Self-Supervised Video Representation for Unseen Viewpoints
Srijan Das, Michael S. Ryoo
VirtualHome Action Genome: A Simulated Spatio-Temporal Scene Graph Dataset With Consistent Relationship Labels
Yue Qiu, Yoshiki Nagasaki, Kensho Hara et al.
Vis2Rec: A Large-Scale Visual Dataset for Visit Recommendation
Michaël Soumm, Adrian Popescu, Bertrand Delezoide
Vision Transformer for NeRF-Based View Synthesis From a Single Input Image
Kai-En Lin, Yen-Chen Lin, Wei-Sheng Lai et al.
Visually Explaining 3D-CNN Predictions for Video Classification With an Adaptive Occlusion Sensitivity Analysis
Tomoki Uchiyama, Naoya Sogi, Koichiro Niinuma et al.
VLC-BERT: Visual Question Answering With Contextualized Commonsense Knowledge
Sahithya Ravi, Aditya Chinchure, Leonid Sigal et al.
VSGD-Net: Virtual Staining Guided Melanocyte Detection on Histopathological Images
Kechun Liu, Beibin Li, Wenjun Wu et al.
Watching the News: Towards VideoQA Models That Can Read
Soumya Jahagirdar, Minesh Mathew, Dimosthenis Karatzas et al.