Papers
Voice Conversion Across Arbitrary Speakers Based on a Single Target-Speaker Utterance
Songxiang Liu, Jinghua Zhong, Lifa Sun et al.
Voice Conversion with Conditional SampleRNN
Cong Zhou, Michael Horgan, Vivek Kumar et al.
VoiceGuard: Secure and Private Speech Processing
Ferdinand Brasser, Tommaso Frassetto, Korbinian Riedhammer et al.
VoiceLoop: Voice Fitting and Synthesis via a Phonological Loop
Yaniv Taigman, Lior Wolf, Adam Polyak et al.
Voice-powered Solutions with Cloud AI
Dan Aharon
Voices Obscured in Complex Environmental Settings (VOiCES) Corpus
Colleen Richey, Maria A. Barrios, Zeb Armstrong et al.
Voice Source Contribution to Prominence Perception: Rd Implementation
Andy Murphy, Irena Yanushevskaya, Ailbhe Ní Chasaide et al.
Volumetric performance capture from minimal camera viewpoints
Andrew Gilbert, Marco Volino, John Collomosse et al.
Vowels and Diphthongs in Hangzhou Wu Chinese Dialect
Yang Yue, Fang Hu
Vowel Space as a Tool to Evaluate Articulation Problems
Rob van Son, Catherine Middag, Kris Demuynck
VoxCeleb2: Deep Speaker Recognition
Joon Son Chung, Arsha Nagrani, Andrew Zisserman
VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection
Yin Zhou, Oncel Tuzel
VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions
Qing Li, Qingyi Tao, Shafiq Joty et al.
VSO: Visual Semantic Odometry
Konstantinos-Nektarios Lianos, Johannes L. Schonberger, Marc Pollefeys et al.
W2F: A Weakly-Supervised to Fully-Supervised Framework for Object Detection
Yongqiang Zhang, Yancheng Bai, Mingli Ding et al.
WARP-Text: a Web-Based Tool for Annotating Relationships between Pairs of Texts
Venelin Kovatchev, M. Antònia Martí, Maria Salamó
Wasserstein Auto-Encoders
Ilya Tolstikhin, Olivier Bousquet, Sylvain Gelly et al.
Wasserstein Distributionally Robust Kalman Filtering
Soroosh Shafieezadeh-Abadeh, Viet Anh Nguyen, Daniel Huhn et al.
Wasserstein Divergence for GANs
Jiqing Wu, Zhiwu Huang, Janine Thoma et al.
Wasserstein Introspective Neural Networks
Kwonjoon Lee, Weijian Xu, Fan Fan et al.
Wasserstein Variational Inference
Luca Ambrogioni, Umut Güçlü, Yağmur Güçlütürk et al.
Watching a Small Portion could be as Good as Watching All: Towards Efficient Video Classification
Hehe Fan, Zhongwen Xu, Linchao Zhu et al.
Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning
Xin Wang, Yuan-Fang Wang, William Yang Wang
Watch Your Step: Learning Node Embeddings via Graph Attention
Sami Abu-El-Haija, Bryan Perozzi, Rami Al-Rfou et al.
Waveform-Based Speaker Representations for Speech Synthesis
Moquan Wan, Gilles Degottex, Mark J.F. Gales