Papers
Whisper Activity Detection Using CNN-LSTM Based Attention Pooling Network Trained for a Speaker Identification Task
Abinay Reddy Naini, Malla Satyapriya, Prasanta Kumar Ghosh
Whisper Augmented End-to-End/Hybrid Speech Recognition System — CycleGAN Approach
Prithvi R.R. Gudepu, Gowtham P. Vadisetti, Abhishek Niranjan et al.
Whistled Vowel Identification by French Listeners
Anaïs Tran Ngoc, Julien Meyer, Fanny Meunier
Why Did the x-Vector System Miss a Target Speaker? Impact of Acoustic Mismatch Upon Target Score on VoxCeleb Data
Rosa González Hautamäki, Tomi Kinnunen
WISE: Word-Level Interaction-Based Multimodal Fusion for Speech Emotion Recognition
Guang Shen, Riwei Lai, Rui Chen et al.
Word Error Rate Estimation Without ASR Output: e-WER2
Ahmed Ali, Steve Renals
XiaoiceSing: A High-Quality and Integrated Singing Voice Synthesis System
Peiling Lu, Jie Wu, Jian Luan et al.
X-TaSNet: Robust and Accurate Time-Domain Speaker Extraction Network
Zining Zhang, Bingsheng He, Zhenjie Zhang
X-Vector Singular Value Modification and Statistical-Based Decomposition with Ensemble Regression Modeling for Speaker Anonymization System
Candy Olivia Mawalim, Kasorn Galajit, Jessada Karnjana et al.
x-Vectors Meet Adversarial Attacks: Benchmarking Adversarial Robustness in Speaker Verification
Jesús Villalba, Yuekai Zhang, Najim Dehak
A Chinese Dataset for Identifying Speakers in Novels
Jia-Xiang Chen, Zhen-Hua Ling, Li-Rong Dai
A Combination of Model-Based and Feature-Based Strategy for Speech-to-Singing Alignment
Bidisha Sharma, Haizhou Li
A Comparison of Deep Learning Methods for Language Understanding
Mandy Korpusik, Zoe Liu, James Glass
A Comprehensive Study of Speech Separation: Spectrogram vs Waveform Separation
Fahimeh Bahmaninezhad, Jian Wu, Rongzhi Gu et al.
A Computational Model of Early Language Acquisition from Audiovisual Experiences of Young Infants
Okko Räsänen, Khazar Khorrami
A Convolutional Neural Network with Non-Local Module for Speech Enhancement
Xiaoqi Li, Yaxing Li, Meng Li et al.
Acoustic and Articulatory Feature Based Speech Rate Estimation Using a Convolutional Dense Neural Network
Renuka Mannem, Jhansi Mallela, Aravind Illa et al.
Acoustic and Articulatory Study of Ewe Vowels: A Comparative Study of Male and Female
Kowovi Comivi Alowonou, Jianguo Wei, Wenhuan Lu et al.
Acoustic Characteristics of Lexical Tone Disruption in Mandarin Speakers After Brain Damage
Wenjun Chen, Jeroen van de Weijer, Shuangshuang Zhu et al.
Acoustic Correlates of Phonation Type in Chichimec
Anneliese Kelterer, Barbara Schuppler
Acoustic Cues to Topic and Narrow Focus in Egyptian Arabic
Dina El Zarka, Barbara Schuppler, Francesco Cangemi
Acoustic Indicators of Deception in Mandarin Daily Conversations Recorded from an Interactive Game
Chih-Hsiang Huang, Huang-Cheng Chou, Yi-Tong Wu et al.
Acoustic Model Bootstrapping Using Semi-Supervised Learning
Langzhou Chen, Volker Leutnant
Acoustic Model Ensembling Using Effective Data Augmentation for CHiME-5 Challenge
Feng Ma, Li Chai, Jun Du et al.
Acoustic Modeling for Automatic Lyrics-to-Audio Alignment
Chitralekha Gupta, Emre Yılmaz, Haizhou Li