Papers
4,428 papers found
Splatting-Based Synthesis for Video Frame Interpolation
Simon Niklaus, Ping Hu, Jiawen Chen
Split To Learn: Gradient Split for Multi-Task Human Image Analysis
Weijian Deng, Yumin Suh, Xiang Yu et al.
SSFE-Net: Self-Supervised Feature Enhancement for Ultra-Fine-Grained Few-Shot Class Incremental Learning
Zicheng Pan, Xiaohan Yu, Miaohua Zhang et al.
SSSD: Self-Supervised Self Distillation
Wei-Chi Chen, Wei-Ta Chu
STAR-Transformer: A Spatio-Temporal Cross Attention Transformer for Human Action Recognition
Dasom Ahn, Sangwon Kim, Hyunsu Hong et al.
Stop or Forward: Dynamic Layer Skipping for Efficient Action Recognition
Jonghyeon Seon, Jaedong Hwang, Jonghwan Mun et al.
Structure-Encoding Auxiliary Tasks for Improved Visual Representation in Vision-and-Language Navigation
Chia-Wen Kuo, Chih-Yao Ma, Judy Hoffman et al.
Style-Guided Inference of Transformer for High-Resolution Image Synthesis
Jonghwa Yim, Minjae Kim
Surface Normal Estimation From Optimized and Distributed Light Sources Using DNN-Based Photometric Stereo
Takafumi Iwaguchi, Hiroshi Kawasaki
SVD-NAS: Coupling Low-Rank Approximation and Neural Architecture Search
Zhewen Yu, Christos-Savvas Bouganis
Switching to Discriminative Image Captioning by Relieving a Bottleneck of Reinforcement Learning
Ukyo Honda, Taro Watanabe, Yuji Matsumoto
Synthetic Latent Fingerprint Generator
André Brasil Vieira Wyzykowski, Anil K. Jain
Task Agnostic and Post-Hoc Unseen Distribution Detection
Radhika Dua, Seongjun Yang, Yixuan Li et al.
TCAM: Temporal Class Activation Maps for Object Localization in Weakly-Labeled Unconstrained Videos
Soufiane Belharbi, Ismail Ben Ayed, Luke McCaffrey et al.
Temporally Consistent Online Depth Estimation in Dynamic Scenes
Zhaoshuo Li, Wei Ye, Dilin Wang et al.
TeST: Test-Time Self-Training Under Distribution Shift
Samarth Sinha, Peter Gehler, Francesco Locatello et al.
Text and Image Guided 3D Avatar Generation and Manipulation
Zehranaz Canfes, M. Furkan Atasoy, Alara Dirik et al.
Text-Guided Object Detector for Multi-Modal Video Question Answering
Ruoyue Shen, Nakamasa Inoue, Koichi Shinoda
The Box Size Confidence Bias Harms Your Object Detector
Johannes Gilg, Torben Teepe, Fabian Herzog et al.
The Change You Want To See
Ragav Sachdeva, Andrew Zisserman
The CropAndWeed Dataset: A Multi-Modal Learning Approach for Efficient Crop and Weed Manipulation
Daniel Steininger, Andreas Trondl, Gerardus Croonen et al.
The Fully Convolutional Transformer for Medical Image Segmentation
Athanasios Tragakis, Chaitanya Kaul, Roderick Murray-Smith et al.
THOR-Net: End-to-End Graformer-Based Realistic Two Hands and Object Reconstruction With Self-Supervision
Ahmed Tawfik Aboukhadra, Jameel Malik, Ahmed Elhayek et al.
TI2Net: Temporal Identity Inconsistency Network for Deepfake Detection
Baoping Liu, Bo Liu, Ming Ding et al.