Papers
8,506 papers found
Visual Semantic Reasoning for Image-Text Matching
Kunpeng Li, Yulun Zhang, Kai Li et al.
VrR-VG: Refocusing Visually-Relevant Relationships
Yuanzhi Liang, Yalong Bai, Wei Zhang et al.
VTNFP: An Image-Based Virtual Try-On Network With Body and Clothing Feature Preservation
Ruiyun Yu, Xiaoqi Wang, Xiaohui Xie
VV-Net: Voxel VAE Net With Group Convolutions for Point Cloud Segmentation
Hsien-Yu Meng, Lin Gao, Yu-Kun Lai et al.
Wasserstein GAN With Quadratic Transport Cost
Huidong Liu, Xianfeng Gu, Dimitris Samaras
Watch, Listen and Tell: Multi-Modal Weakly Supervised Dense Event Captioning
Tanzila Rahman, Bicheng Xu, Leonid Sigal
Wavelet Domain Style Transfer for an Effective Perception-Distortion Tradeoff in Single Image Super-Resolution
Xin Deng, Ren Yang, Mai Xu et al.
Weakly Aligned Cross-Modal Learning for Multispectral Pedestrian Detection
Lu Zhang, Xiangyu Zhu, Xiangyu Chen et al.
Weakly-Supervised Action Localization With Background Modeling
Phuc Xuan Nguyen, Deva Ramanan, Charless C. Fowlkes
Weakly Supervised Energy-Based Learning for Action Segmentation
Jun Li, Peng Lei, Sinisa Todorovic
Weakly Supervised Object Detection With Segmentation Collaboration
Xiaoyan Li, Meina Kan, Shiguang Shan et al.
Weakly Supervised Temporal Action Localization Through Contrast Based Evaluation Networks
Ziyi Liu, Le Wang, Qilin Zhang et al.
What Else Can Fool Deep Learning? Addressing Color Constancy Errors on Deep Neural Network Performance
Mahmoud Afifi, Michael S. Brown
What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis
Jeonghun Baek, Geewook Kim, Junyeop Lee et al.
What Synthesis Is Missing: Depth Adaptation Integrated With Weak Supervision for Indoor Scene Parsing
Keng-Chi Liu, Yi-Ting Shen, Jan P. Klopp et al.
What Would You Expect? Anticipating Egocentric Actions With Rolling-Unrolling LSTMs and Modality Attention
Antonino Furnari, Giovanni Maria Farinella
Where Is My Mirror?
Xin Yang, Haiyang Mei, Ke Xu et al.
Why Does a Visual Question Have Different Answers?
Nilavra Bhattacharya, Qing Li, Danna Gurari
WoodScape: A Multi-Task, Multi-Camera Fisheye Dataset for Autonomous Driving
Senthil Yogamani, Ciaran Hughes, Jonathan Horgan et al.
WSOD2: Learning Bottom-Up and Top-Down Objectness Distillation for Weakly-Supervised Object Detection
Zhaoyang Zeng, Bei Liu, Jianlong Fu et al.
XRAI: Better Attributions Through Regions
Andrei Kapishnikov, Tolga Bolukbasi, Fernanda Viegas et al.
xR-EgoPose: Egocentric 3D Human Pose From an HMD Camera
Denis Tome, Patrick Peluse, Lourdes Agapito et al.
X-Section: Cross-Section Prediction for Enhanced RGB-D Fusion
Andrea Nicastro, Ronald Clark, Stefan Leutenegger
YOLACT: Real-Time Instance Segmentation
Daniel Bolya, Chong Zhou, Fanyi Xiao et al.
Zero-Shot Anticipation for Instructional Activities
Fadime Sener, Angela Yao