Papers
176,624 papers found
Visually Indicated Sounds
Andrew Owens, Phillip Isola, Josh McDermott et al.
Visual Path Prediction in Complex Scenes With Crowded Moving Objects
YoungJoon Yoo, Kimin Yun, Sangdoo Yun et al.
Visual Question Answering with Question Representation Update (QRU)
Ruiyu Li, Jiaya Jia
Visual Speech Synthesis Using Dynamic Visemes, Contextual Features and DNNs
Ausdang Thangthai, Ben Milner, Sarah Taylor
Visual Tracking Using Attention-Modulated Disintegration and Integration
Jongwon Choi, Hyung Jin Chang, Jiyeoup Jeong et al.
Visual Word2Vec (vis-w2v): Learning Visually Grounded Word Embeddings Using Abstract Scenes
Satwik Kottur, Ramakrishna Vedantam, Jose M. F. Moura et al.
VLAD3: Encoding Dynamics of Deep Features for Action Recognition
Yingwei Li, Weixin Li, Vijay Mahadevan et al.
Vocal Effort Modification for Singing Synthesis
Olivier Perrotin, Christophe d’Alessandro
Vocal Tract Length Normalization for Speaker Independent Acoustic-to-Articulatory Speech Inversion
Ganesh Sivaraman, Vikramjit Mitra, Hosung Nam et al.
Voice Conversion Based on Matrix Variate Gaussian Mixture Model Using Multiple Frame Features
Yi Yang, Hidetsugu Uchida, Daisuke Saito et al.
Voice Conversion Based on Trajectory Model Training of Neural Networks Considering Global Variance
Naoki Hosaka, Kei Hashimoto, Keiichiro Oura et al.
Voice Quality Control Using Perceptual Expressions for Statistical Parametric Speech Synthesis Based on Cluster Adaptive Training
Yamato Ohtani, Koichiro Mori, Masahiro Morita
Voice-Quality Difference Between the Vowels in Filled Pauses and Ordinary Lexical Items
Kikuo Maekawa, Hiroki Mori
Volumetric 3D Tracking by Detection
Chun-Hao Huang, Benjamin Allain, Jean-Sebastien Franco et al.
Volumetric and Multi-View CNNs for Object Classification on 3D Data
Charles R. Qi, Hao Su, Matthias Niessner et al.
Volumetric Spanners: An Efficient Exploration Basis for Learning
Elad Hazan, Zohar Karnin
Voting Detector: A Combination of Anomaly Detectors to Reveal Annotation Errors in TTS Corpora
Jindřich Matoušek, Daniel Tihelka
Vowel Characteristics in the Assessment of L2 English Pronunciation
Calbert Graham, Paula Buttery, Francis Nolan
Vowels and Diphthongs in Cangnan Southern Min Chinese Dialect
Fang Hu, Chunyu Ge
Vowels and Diphthongs in the Taiyuan Jin Chinese Dialect
Liping Xia, Fang Hu
VoxSim: A Visual Platform for Modeling Motion Language
Nikhil Krishnaswamy, James Pustejovsky
Walk and Learn: Facial Attribute Representation Learning From Egocentric Video and Contextual Data
Jing Wang, Yu Cheng, Rogerio Schmidt Feris
WarpNet: Weakly Supervised Matching for Single-View Reconstruction
Angjoo Kanazawa, David W. Jacobs, Manmohan Chandraker
Wasserstein Training of Restricted Boltzmann Machines
Grégoire Montavon, Klaus-Robert Müller, Marco Cuturi