Papers
Text-to-speech synthesis using spectral modeling based on non-negative autoencoder
Takeru Gorai, Daisuke Saito, Nobuaki Minematsu
The 1st Clarity Prediction Challenge: A machine learning challenge for hearing aid intelligibility prediction
Jon Barker, Michael Akeroyd, Trevor J. Cox et al.
The CLIPS System for 2022 Spoofing-Aware Speaker Verification Challenge
Jucai Lin, Tingwei Chen, Jingbiao Huang et al.
The discrimination of [zi]-[dʑi] by Japanese listeners and the prospective phonologization of /zi/
Andrea Alicehajic, Silke Hamann
The DKU-OPPO System for the 2022 Spoofing-Aware Speaker Verification Challenge
Xingming Wang, Xiaoyi Qin, Yikang Wang et al.
The Effectiveness of Time Stretching for Enhancing Dysarthric Speech for Improved Dysarthric Speech Recognition
Luke Prananta, Bence Halpern, Siyuan Feng et al.
The effect of backward noise on lexical tone discrimination in Mandarin-speaking amusics
Zixia Fan, Jing Shao, Weigong Pan et al.
The effect of increasing acoustic and linguistic complexity on auditory processing: an EEG study
Fareeha S. Rana, Daniel Pape, Elisabet Service
The Effects of Implicit and Explicit Feedback in an ASR-based Reading Tutor for Dutch First-graders
Yu Bai, Ferdy Hubers, Catia Cucchiarini et al.
The HCCL System for the NIST SRE21
Zhuo Li, Runqiu Xiao, Hangting Chen et al.
The Magnitude and Phase based Speech Representation Learning using Autoencoder for Classifying Speech Emotions using Deep Canonical Correlation Analysis
Ashishkumar Gudmalwar, Biplove Basel, Anirban Dutta et al.
The mapping between syntactic and prosodic phrasing in English and Mandarin
Jianjing Kuang, May Pik Yu Chan, Nari Rhee et al.
The Prosody of Cheering in Sport Events
Marzena Zygis, Sarah Wesolek, Nina Hosseini-Kivanani et al.
The THUEE System Description for the IARPA OpenASR21 Challenge
Jing Zhao, Haoyu Wang, Jinpeng Li et al.
The VoiceMOS Challenge 2022
Wen Chin Huang, Erica Cooper, Yu Tsao et al.
The ZevoMOS entry to VoiceMOS Challenge 2022
Adriana Stan
Three-dimensional finite-difference time-domain acoustic analysis of simplified vocal tract shapes
Debasish Mohapatra, Mario Fleischer, Victor Zappi et al.
Thutmose Tagger: Single-pass neural model for Inverse Text Normalization
Alexandra Antonova, Evelina Bakhturina, Boris Ginsburg
Time-domain Ad-hoc Array Speech Enhancement Using a Triple-path Network
Ashutosh Pandey, Buye Xu, Anurag Kumar et al.
Tiny-Sepformer: A Tiny Time-Domain Transformer Network For Speech Separation
Jian Luo, Jianzong Wang, Ning Cheng et al.
TMGAN-PLC: Audio Packet Loss Concealment using Temporal Memory Generative Adversarial Network
Yuansheng Guan, Guochen Yu, Andong Li et al.
Token-level Speaker Change Detection Using Speaker Difference and Speech Content via Continuous Integrate-and-fire
Zhiyun Fan, Zhenlin Liang, Linhao Dong et al.
Tokenwise Contrastive Pretraining for Finer Speech-to-BERT Alignment in End-to-End Speech-to-Intent Systems
Vishal Sunder, Eric Fosler-Lussier, Samuel Thomas et al.
TopicKS: Topic-driven Knowledge Selection for Knowledge-grounded Dialogue Generation
Shiquan Wang, Yuke Si, Xiao Wei et al.