Papers
4,428 papers found
TinyHD: Efficient Video Saliency Prediction With Heterogeneous Decoders Using Hierarchical Maps Distillation
Feiyan Hu, Simone Palazzo, Federica Proietto Salanitri et al.
Token Pooling in Vision Transformers for Image Classification
Dmitrii Marin, Jen-Hao Rick Chang, Anurag Ranjan et al.
Toward Edge-Efficient Dense Predictions With Synergistic Multi-Task Neural Architecture Search
Thanh Vu, Yanqi Zhou, Chunfeng Wen et al.
Towards a Framework for Privacy-Preserving Pedestrian Analysis
Anil Kunchala, Mélanie Bouroche, Bianca Schoen-Phelan
Towards Discriminative and Transferable One-Stage Few-Shot Object Detectors
Karim Guirguis, Mohamed Abdelsamad, George Eskandar et al.
Towards Disturbance-Free Visual Mobile Manipulation
Tianwei Ni, Kiana Ehsani, Luca Weihs et al.
Towards Equivariant Optical Flow Estimation With Deep Learning
Stefano Savian, Pietro Morerio, Alessio Del Bue et al.
Towards Few-Annotation Learning for Object Detection: Are Transformer-Based Models More Efficient?
Quentin Bouniot, Angélique Loesch, Romaric Audigier et al.
Towards Generating Ultra-High Resolution Talking-Face Videos With Lip Synchronization
Anchit Gupta, Rudrabha Mukhopadhyay, Sindhu Balachandra et al.
Towards Interpretable Video Anomaly Detection
Keval Doshi, Yasin Yilmaz
Towards MOOCs for Lipreading: Using Synthetic Talking Heads To Train Humans in Lipreading at Scale
Aditya Agarwal, Bipasha Sen, Rudrabha Mukhopadhyay et al.
Towards Online Domain Adaptive Object Detection
Vibashan VS, Poojan Oza, Vishal M. Patel
Tracking Growth and Decay of Plant Roots in Minirhizotron Images
Alexander Gillert, Bo Peters, Uwe Freiherr von Lukas et al.
Training Auxiliary Prototypical Classifiers for Explainable Anomaly Detection in Medical Image Segmentation
Wonwoo Cho, Jeonghoon Park, Jaegul Choo
Trans4Map: Revisiting Holistic Bird's-Eye-View Mapping From Egocentric Images to Allocentric Semantics With Vision Transformers
Chang Chen, Jiaming Zhang, Kailun Yang et al.
Transformers for Recognition in Overhead Imagery: A Reality Check
Francesco Luzi, Aneesh Gupta, Leslie Collins et al.
TransMOT: Spatial-Temporal Graph Transformer for Multiple Object Tracking
Peng Chu, Jiang Wang, Quanzeng You et al.
TransPillars: Coarse-To-Fine Aggregation for Multi-Frame 3D Object Detection
Zhipeng Luo, Gongjie Zhang, Changqing Zhou et al.
TransVLAD: Multi-Scale Attention-Based Global Descriptors for Visual Geo-Localization
Yifan Xu, Pourya Shamsolmoali, Eric Granger et al.
Treating Motion as Option To Reduce Motion Dependency in Unsupervised Video Object Segmentation
Suhwan Cho, Minhyeok Lee, Seunghoon Lee et al.
Treatment Learning Causal Transformer for Noisy Image Classification
Chao-Han Huck Yang, I-Te Hung, Yi-Chieh Liu et al.
TTTFlow: Unsupervised Test-Time Training With Normalizing Flow
David Osowiechi, Gustavo A. Vargas Hakim, Mehrdad Noori et al.
TVCalib: Camera Calibration for Sports Field Registration in Soccer
Jonas Theiner, Ralph Ewerth
TVT: Transferable Vision Transformer for Unsupervised Domain Adaptation
Jinyu Yang, Jingjing Liu, Ning Xu et al.
Two-Level Data Augmentation for Calibrated Multi-View Detection
Martin Engilberge, Haixin Shi, Zhiye Wang et al.