Papers
Speaker Diarization System Based on DPCA Algorithm for Fearless Steps Challenge Phase-2
Xueshuai Zhang, Wenchao Wang, Pengyuan Zhang
Speaker Discrimination in Humans and Machines: Effects of Speaking Style Variability
Amber Afshan, Jody Kreiman, Abeer Alwan
Speaker Identification for Household Scenarios with Self-Attention and Adversarial Training
Ruirui Li, Jyun-Yu Jiang, Xian Wu et al.
Speaker-Independent Mel-Cepstrum Estimation from Articulator Movements Using D-Vector Input
Kouichi Katsurada, Korin Richmond
Speaker Re-Identification with Speaker Dependent Speech Enhancement
Yanpei Shi, Qiang Huang, Thomas Hain
Speaker Representation Learning Using Global Context Guided Channel and Time-Frequency Transformations
Wei Xia, John H.L. Hansen
Speaker-Utterance Dual Attention for Speaker and Utterance Verification
Tianchi Liu, Rohan Kumar Das, Maulik Madhavi et al.
Speaking Speed Control of End-to-End Speech Synthesis Using Sentence-Level Conditioning
Jae-Sung Bae, Hanbin Bae, Young-Sun Joo et al.
SpecMark: A Spectral Watermarking Framework for IP Protection of Speech Recognition Systems
Huili Chen, Bita Darvish, Farinaz Koushanfar
SpecSwap: A Simple Data Augmentation Method for End-to-End Speech Recognition
Xingchen Song, Zhiyong Wu, Yiheng Huang et al.
Spectral Moment and Duration of Burst of Plosives in Speech of Children with Hearing Impairment and Typically Developing Children — A Comparative Study
Ajish K. Abraham, M. Pushpavathi, N. Sreedevi et al.
SpeechBERT: An Audio-and-Text Jointly Learned Language Model for End-to-End Spoken Question Answering
Yung-Sung Chuang, Chi-Liang Liu, Hung-yi Lee et al.
Speech Clarity Improvement by Vocal Self-Training Using a Hearing Impairment Simulator and its Correlation with an Auditory Modulation Index
Toshio Irino, Soichi Higashiyama, Hanako Yoshigi
Speech Driven Talking Head Generation via Attentional Landmarks Based Representation
Wentao Wang, Yan Wang, Jianqing Sun et al.
Speech Emotion Recognition ‘in the Wild’ Using an Autoencoder
Vipula Dissanayake, Haimo Zhang, Mark Billinghurst et al.
Speech Emotion Recognition with Discriminative Feature Learning
Huan Zhou, Kai Liu
Speech Enhancement Based on Beamforming and Post-Filtering by Combining Phase Information
Rui Cheng, Changchun Bao
Speech Enhancement with Stochastic Temporal Convolutional Networks
Julius Richter, Guillaume Carbajal, Timo Gerkmann
SpeechMix — Augmenting Deep Sound Recognition Using Hidden Space Interpolations
Amit Jindal, Narayanan Elavathur Ranganatha, Aniket Didolkar et al.
Speech Pseudonymisation Assessment Using Voice Similarity Matrices
Paul-Gauthier Noé, Jean-François Bonastre, Driss Matrouf et al.
Speech Rate Task-Specific Representation Learning from Acoustic-Articulatory Data
Renuka Mannem, Hima Jyothi R., Aravind Illa et al.
Speech Recognition and Multi-Speaker Diarization of Long Conversations
Huanru Henry Mao, Shuyang Li, Julian McAuley et al.