Papers
Evaluating the Vulnerability of End-to-End Automatic Speech Recognition Models to Membership Inference Attacks
Muhammad A. Shah, Joseph Szurley, Markus Mueller et al.
Evaluation of Audio-Visual Alignments in Visually Grounded Speech Models
Khazar Khorrami, Okko Räsänen
Event Specific Attention for Polyphonic Sound Event Detection
Harshavardhan Sundar, Ming Sun, Chao Wang
Excitation Source Feature Based Dialect Identification in Ao — A Low Resource Language
Moakala Tzudir, Shikha Baghel, Priyankoo Sarmah et al.
Explaining Deep Learning Models for Speech Enhancement
Sunit Sivasankaran, Emmanuel Vincent, Dominique Fohr
Explore wav2vec 2.0 for Mispronunciation Detection
Xiaoshuo Xu, Yueteng Kang, Songjun Cao et al.
Exploring Emotional Prototypes in a High Dimensional TTS Latent Space
Pol van Rijn, Silvan Mertes, Dominik Schiller et al.
Exploring Targeted Universal Adversarial Perturbations to End-to-End ASR Models
Zhiyun Lu, Wei Han, Yu Zhang et al.
Exploring the Potential of Lexical Paraphrases for Mitigating Noise-Induced Comprehension Errors
Anupama Chingacham, Vera Demberg, Dietrich Klakow
Exploring wav2vec 2.0 on Speaker Verification and Language Identification
Zhiyun Fan, Meng Li, Shiyu Zhou et al.
Expressive Latvian Speech Synthesis for Dialog Systems
Dāvis Nicmanis, Askars Salimbajevs
Expressive Robot Performance Based on Facial Motion Capture
Jonas Beskow, Charlie Caper, Johan Ehrenfors et al.
Expressive Text-to-Speech Using Style Tag
Minchan Kim, Sung Jun Cheon, Byoung Jin Choi et al.
Extending the Fullband E-Model Towards Background Noise, Bursty Packet Loss, and Conversational Degradations
Thilo Michael, Gabriel Mittag, Andreas Bütow et al.
Extracting Different Levels of Speech Information from EEG Using an LSTM-Based Model
Mohammad Jalilpour Monesi, Bernd Accou, Tom Francart et al.
Extremely Low Footprint End-to-End ASR System for Smart Device
Zhifu Gao, Yiwu Yao, Shiliang Zhang et al.
F0Patterns of L2 English Speech by Mandarin Chinese Learners
Hongwei Ding, Binghuai Lin, Liyuan Wang
Factorization-Aware Training of Transformers for Natural Language Understanding on the Edge
Hamidreza Saghir, Samridhi Choudhary, Sepehr Eghbali et al.
Fair Voice Biometrics: Impact of Demographic Imbalance on Group Fairness in Speaker Recognition
Gianni Fenu, Mirko Marras, Giacomo Medda et al.
Fake Audio Detection in Resource-Constrained Settings Using Microfeatures
Hira Dhamyal, Ayesha Ali, Ihsan Ayyub Qazi et al.
FANS: Fusing ASR and NLU for On-Device SLU
Martin Radfar, Athanasios Mouchtaris, Siegfried Kunzmann et al.
Far-Field Speaker Localization and Adaptive GLMB Tracking
Shoufeng Lin, Zhaojie Luo
FastICARL: Fast Incremental Classifier and Representation Learning with Efficient Budget Allocation in Audio Sensing Applications
Young D. Kwon, Jagmohan Chauhan, Cecilia Mascolo