Papers
Towards a Speaker Independent Speech-BCI Using Speaker Adaptation
Debadatta Dash, Alan Wisler, Paul Ferrari et al.
Towards Bilingual Lexicon Discovery From Visually Grounded Speech Audio
Emmanuel Azuh, David Harwath, James Glass
Towards Debugging Deep Neural Networks by Generating Speech Utterances
Bilal Soomro, Anssi Kanervisto, Trung Ngo Trong et al.
Towards Detection of Canonical Babbling by Citizen Scientists: Performance as a Function of Clip Length
Amanda Seidl, Anne S. Warlaumont, Alejandrina Cristia
Towards Discriminative Representations and Unbiased Predictions: Class-Specific Angular Softmax for Speech Emotion Recognition
Zhixuan Li, Liang He, Jingyang Li et al.
Towards Generalized Speech Enhancement with Generative Adversarial Networks
Santiago Pascual, Joan Serrà, Antonio Bonafonte
Towards Joint Sound Scene and Polyphonic Sound Event Recognition
Helen L. Bear, Inês Nolasco, Emmanouil Benetos
Towards Language-Universal Mandarin-English Speech Recognition
Shiliang Zhang, Yuan Liu, Ming Lei et al.
Towards Robust Speech Emotion Recognition Using Deep Residual Networks for Speech Enhancement
Andreas Triantafyllopoulos, Gil Keren, Johannes Wagner et al.
Towards the Prosody of Persuasion in Competitive Negotiation. The Relationship Between f0 and Negotiation Success in Same Sex Sales Tasks
Jan Michalsky, Heike Schoormann, Thomas Schultze
Towards the Speech Features of Early-Stage Dementia: Design and Application of the Mandarin Elderly Cognitive Speech Database
Tianqi Wang, Quanlei Yan, Jingshen Pan et al.
Towards the Speech Features of Mild Cognitive Impairment: Universal Evidence from Structured and Unstructured Connected Speech of Chinese
Tianqi Wang, Chongyuan Lian, Jingshen Pan et al.
Towards Universal Dialogue Act Tagging for Task-Oriented Dialogues
Shachi Paul, Rahul Goel, Dilek Hakkani-Tür
Towards Using Context-Dependent Symbols in CTC Without State-Tying Decision Trees
Jan Chorowski, Adrian Łańcucki, Bartosz Kostka et al.
Towards Variability Resistant Dialectal Speech Evaluation
Ahmed Ali, Salam Khalifa, Nizar Habash
Tracking the New Zealand English NEAR/SQUARE Merger Using Functional Principal Components Analysis
Michele Gubian, Jonathan Harrington, Mary Stevens et al.
Trainable Dynamic Subsampling for End-to-End Speech Recognition
Shucong Zhang, Erfan Loweimi, Yumo Xu et al.
Training Multi-Speaker Neural Text-to-Speech Systems Using Speaker-Imbalanced Speech Corpora
Hieu-Thi Luong, Xin Wang, Junichi Yamagishi et al.
Transfer Learning from Audio-Visual Grounding to Speech Recognition
Wei-Ning Hsu, David Harwath, James Glass
Transfer-Representation Learning for Detecting Spoofing Attacks with Converted and Synthesized Speech in Automatic Speaker Verification System
Su-Yu Chang, Kai-Cheng Wu, Chia-Ping Chen
Transformer Based Grapheme-to-Phoneme Conversion
Sevinj Yolchuyeva, Géza Németh, Bálint Gyires-Tóth
Transparent Pronunciation Scoring Using Articulatorily Weighted Phoneme Edit Distance
Reima Karhila, Anna-Riikka Smolander, Sari Ylinen et al.
Turn-Taking Prediction Based on Detection of Transition Relevance Place
Kohei Hara, Koji Inoue, Katsuya Takanashi et al.
Two-Dimensional Convolutional Recurrent Neural Networks for Speech Activity Detection
Anastasios Vafeiadis, Eleftherios Fanioudakis, Ilyas Potamitis et al.
Two-Pass End-to-End Speech Recognition
Tara N. Sainath, Ruoming Pang, David Rybach et al.