Papers
Synchronising Speech Segments with Musical Beats in Mandarin and English Singing
Cong Zhang, Jian Zhu
SynthASR: Unlocking Synthetic Data for Speech Recognition
Amin Fazel, Wei Yang, Yulan Liu et al.
Synthesis of Expressive Speaking Styles with Limited Training Data in a Multi-Speaker, Prosody-Controllable Sequence-to-Sequence Architecture
Slava Shechtman, Raul Fernandez, Alexander Sorin et al.
Systems for Low-Resource Speech Recognition Tasks in Open Automatic Speech Recognition and Formosa Speech Recognition Challenges
Hung-Pang Lin, Yu-Jia Zhang, Chia-Ping Chen
T5G2P: Using Text-to-Text Transfer Transformer for Grapheme-to-Phoneme Conversion
Markéta Řezáčková, Jan Švec, Daniel Tihelka
Tackling the ADRESSO Challenge 2021: The MUET-RMIT System for Alzheimer’s Dementia Recognition from Spontaneous Speech
Zafi Sherhan Syed, Muhammad Shehram Shah Syed, Margaret Lech et al.
TacoLPCNet: Fast and Stable TTS by Conditioning LPCNet on Mel Spectrogram Predictions
Cheng Gong, Longbiao Wang, Ju Zhang et al.
Taiwan Min Nan (Taiwanese) Checked Tones Sound Change
Ho-hsien Pan, Shao-ren Lyu
Take a Breath: Respiratory Sounds Improve Recollection in Synthetic Speech
Mikey Elmers, Raphael Werner, Beeke Muhlack et al.
Talk, Don’t Write: A Study of Direct Speech-Based Image Retrieval
Ramon Sanabria, Austin Waters, Jason Baldridge
TalkNet: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis
Stanislav Beliaev, Boris Ginsburg
Targeted and Targetless Neutral Tones in Taiwanese Southern Min
Roger Cheng-yen Liu, Feng-fan Hsieh, Yueh-chin Chang
Targeted Keyword Filtering for Accelerated Spoken Topic Identification
Jonathan Wintrode
Target-Speaker Voice Activity Detection with Improved i-Vector Estimation for Unknown Number of Speaker
Maokui He, Desh Raj, Zili Huang et al.
TDCA-Net: Time-Domain Channel Attention Network for Depression Detection
Cong Cai, Mingyue Niu, Bin Liu et al.
Teacher-Student MixIT for Unsupervised and Semi-Supervised Speech Separation
Jisi Zhang, Cătălin Zorilă, Rama Doddipatla et al.
Teaching Keyword Spotters to Spot New Keywords with Limited Examples
Abhijeet Awasthi, Kevin Kilgour, Hassan Rom
Team02 Text-Independent Speaker Verification System for SdSV Challenge 2021
Woo Hyun Kang, Nam Soo Kim
TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation
Helin Wang, Bo Wu, Lianwu Chen et al.
Temporal Context in Speech Emotion Recognition
Yangyang Xia, Li-Wei Chen, Alexander Rudnicky et al.
Temporal Convolutional Network with Frequency Dimension Adaptive Attention for Speech Enhancement
Qiquan Zhang, Qi Song, Aaron Nicolson et al.
Testing Acoustic Voice Quality Classification Across Languages and Speech Styles
Bettina Braun, Nicole Dehé, Marieke Einfeldt et al.
Text Anchor Based Metric Learning for Small-Footprint Keyword Spotting
Li Wang, Rongzhi Gu, Nuo Chen et al.
Text Augmentation for Language Models in High Error Recognition Scenario
Karel Beneš, Lukáš Burget