Papers
A Voice Conversion Framework with Tandem Feature Sparse Representation and Speaker-Adapted WaveNet Vocoder
Berrak Sisman, Mingyang Zhang, Haizhou Li
Avoiding Speaker Overfitting in End-to-End DNNs Using Raw Waveform for Text-Independent Speaker Verification
Jee-weon Jung, Hee-soo Heo, IL-ho Yang et al.
A Weighted Superposition of Functional Contours Model for Modelling Contextual Prominence of Elementary Prosodic Contours
Branislav Gerazov, Gérard Bailly, Yi Xu
Bags in Bag: Generating Context-Aware Bags for Tracking Emotions from Speech
Jing Han, Zixing Zhang, Maximilian Schmitt et al.
Bidirectional Long-Short Term Memory Network-based Estimation of Reliable Spectral Component Locations
Aaron Nicolson, Kuldip K. Paliwal
Binaural Speech Intelligibility Estimation Using Deep Neural Networks
Kazuhiro Kondo, Kazuya Taira, Yosuke Kobayashi
Biophysically-inspired Features Improve the Generalizability of Neural Network-based Speech Enhancement Systems
Deepak Baby, Sarah Verhulst
BLSTM-CRF Based End-to-End Prosodic Boundary Prediction with Context Sensitive Embeddings in a Text-to-Speech Front-End
Yibin Zheng, Jianhua Tao, Zhengqi Wen et al.
Bone-Conduction Sensor Assisted Noise Estimation for Improved Speech Enhancement
Ching-Hua Lee, Bhaskar D. Rao, Harinath Garudadri
Brain-Computer Interface using Electroencephalogram Signatures of Eye Blinks
Srihari Maruthachalam, Sidharth Aggarwal, Mari Ganesh Kumar et al.
Breathy to Tense Voice Discrimination using Zero-Time Windowing Cepstral Coefficients (ZTWCCs)
Sudarsana Reddy Kadiri, Bayya Yegnanarayana
Bubble Cooperative Networks for Identifying Important Speech Cues
Viet Anh Trinh, Brian McFee, Michael I Mandel
Building a Unified Code-Switching ASR System for South African Languages
Emre Yılmaz, Astik Biswas, Ewald van der Westhuizen et al.
Building Large-vocabulary Speaker-independent Lipreading Systems
Kwanchiva Thangthai, Richard Harvey
Building State-of-the-art Distant Speech Recognition Using the CHiME-4 Challenge with a Setup of Speech Enhancement Baseline
Szu-Jui Chen, Aswin Shanmugam Subramanian, Hainan Xu et al.
BUT OpenSAT 2017 Speech Recognition System
Martin Karafiát, Murali Karthick Baskar, Igor Szöke et al.
BUT System for DIHARD Speech Diarization Challenge 2018
Mireia Diez, Federico Landini, Lukáš Burget et al.
BUT System for Low Resource Indian Language ASR
Bhargav Pulugundla, Murali Karthick Baskar, Santosh Kesiraju et al.
CACTAS - Collaborative Audio Categorization and Transcription for ASR Systems
Mithul Mathivanan, Kinnera Saranu, Abhishek Pandey et al.
Capsule Networks for Low Resource Spoken Language Understanding
Vincent Renkens, Hugo van Hamme
Captaina: Integrated Pronunciation Practice and Data Collection Portal
Aku Rouhe, Reima Karhila, Aija Elg et al.
Categorical vs Dimensional Perception of Italian Emotional Speech
Emilia Parada-Cabaleiro, Giovanni Costantini, Anton Batliner et al.
Category Similarity in Multilingual Pronunciation Training
Jacques Koreman
Characterizing Rhythm Differences between Strong and Weak Accented L2 Speech
Chris Davis, Jeesun Kim
Character-level Language Modeling with Gated Hierarchical Recurrent Neural Networks
Iksoo Choi, Jinhwan Park, Wonyong Sung