Papers
ViP: Virtual Pooling for Accelerating CNN-based Image Classification and Object Detection
Zhuo Chen, Jiyuan Zhang, Ruizhou Ding et al.
Visual Question Answering on 360deg Images
Shih-Han Chou, Wei-Lun Chao, Wei-Sheng Lai et al.
VRT-Net: Real-Time Scene Parsing via Variable Resolution Transform
Jogendra Nath Kundu, Gaurav Singh Rajput, Venkatesh Babu RADHAKRISHNAN
Watch to Listen Clearly: Visual Speech Enhancement Driven Multi-modality Speech Recognition
Bo Xu, Jacob Wang, Cheng Lu et al.
Weakly Supervised Gaussian Networks for Action Detection
Basura Fernando, Cheston Tan, Hakan Bilen
Weakly Supervised Graph Convolutional Neural Network for Human Action Localization
Daisuke Miki, Shi Chen, Kazuyuki Demachi
Weakly-Supervised Multi-Person Action Recognition in 360$^{\circ}$ Videos
Junnan Li, Jianquan Liu, Wong Yongkang et al.
Weakly Supervised Temporal Action Localization Using Deep Metric Learning
Ashraful Islam, Richard Radke
Wide Hidden Expansion Layer for Deep Convolutional Neural Networks
Min Wang, Baoyuan Liu, Hassan Foroosh
Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison
DONGXU LI, Cristian Rodriguez, Xin Yu et al.