Papers
XMP-Font: Self-Supervised Cross-Modality Pre-Training for Few-Shot Font Generation
Wei Liu, Fangyue Liu, Fei Ding et al.
X-Pool: Cross-Modal Language-Video Attention for Text-Video Retrieval
Satya Krishna Gorti, Noël Vouitsis, Junwei Ma et al.
X-Trans2Cap: Cross-Modal Knowledge Transfer Using Transformer for 3D Dense Captioning
Zhihao Yuan, Xu Yan, Yinghong Liao et al.
XYDeblur: Divide and Conquer for Single Image Deblurring
Seo-Won Ji, Jeongmin Lee, Seung-Wook Kim et al.
XYLayoutLM: Towards Layout-Aware Multimodal Networks for Visually-Rich Document Understanding
Zhangxuan Gu, Changhua Meng, Ke Wang et al.
YouMVOS: An Actor-Centric Multi-Shot Video Object Segmentation Dataset
Donglai Wei, Siddhant Kharbanda, Sarthak Arora et al.
ZebraPose: Coarse To Fine Surface Encoding for 6DoF Object Pose Estimation
Yongzhi Su, Mahdi Saleh, Torben Fetzer et al.
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
Yoad Tewel, Yoav Shalev, Idan Schwartz et al.
Zero Experience Required: Plug & Play Modular Transfer Learning for Semantic Visual Navigation
Ziad Al-Halah, Santhosh Kumar Ramakrishnan, Kristen Grauman
Zero-Query Transfer Attacks on Context-Aware Object Detectors
Zikui Cai, Shantanu Rane, Alejandro E. Brito et al.
Zero-Shot Text-Guided Object Generation With Dream Fields
Ajay Jain, Ben Mildenhall, Jonathan T. Barron et al.
ZeroWaste Dataset: Towards Deformable Object Segmentation in Cluttered Scenes
Dina Bashkirova, Mohamed Abdelfattah, Ziliang Zhu et al.
Zoom in and Out: A Mixed-Scale Triplet Network for Camouflaged Object Detection
Youwei Pang, Xiaoqi Zhao, Tian-Zhu Xiang et al.
ZZ-Net: A Universal Rotation Equivariant Architecture for 2D Point Clouds
Georg Bökman, Fredrik Kahl, Axel Flinth
2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition
Hengduo Li, Zuxuan Wu, Abhinav Shrivastava et al.
3D AffordanceNet: A Benchmark for Visual Object Affordance Understanding
Shengheng Deng, Xun Xu, Chaozheng Wu et al.
3DCaricShop: A Dataset and a Baseline Method for Single-View 3D Caricature Face Reconstruction
Yuda Qiu, Xiaojie Xu, Lingteng Qiu et al.
3D CNNs With Adaptive Temporal Feature Resolutions
Mohsen Fayyaz, Emad Bahrami, Ali Diba et al.
3D Graph Anatomy Geometry-Integrated Network for Pancreatic Mass Segmentation, Diagnosis, and Quantitative Patient Management
Tianyi Zhao, Kai Cao, Jiawen Yao et al.
3D Human Action Representation Learning via Cross-View Consistency Pursuit
Linguo Li, Minsi Wang, Bingbing Ni et al.
3DIoUMatch: Leveraging IoU Prediction for Semi-Supervised 3D Object Detection
He Wang, Yezhen Cong, Or Litany et al.
3D-MAN: 3D Multi-Frame Attention Network for Object Detection
Zetong Yang, Yin Zhou, Zhifeng Chen et al.
3D Object Detection With Pointformer
Xuran Pan, Zhuofan Xia, Shiji Song et al.
3D Shape Generation With Grid-Based Implicit Functions
Moritz Ibing, Isaak Lim, Leif Kobbelt
3D Spatial Recognition Without Spatially Labeled 3D
Zhongzheng Ren, Ishan Misra, Alexander G. Schwing et al.