Papers
Exploiting Cross-Domain And Cross-Lingual Ultrasound Tongue Imaging Features For Elderly And Dysarthric Speech Recognition
Shujie Hu, Xurong Xie, Mengzhe Geng et al.
Exploiting Diversity of Automatic Transcripts from Distinct Speech Recognition Techniques for Children’s Speech
Christopher Gebauer, Lars Rumberg, Hanna Ehlert et al.
Exploiting Emotion Information in Speaker Embeddings for Expressive Text-to-Speech
Zein Shaheen, Tasnima Sadekova, Yulia Matveeva et al.
Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
Xuankai Chang, Brian Yan, Yuya Fujita et al.
Exploration on HuBERT with Multiple Resolution
Jiatong Shi, Yun Tang, Hirofumi Inaguma et al.
Exploring a classification approach using quantised articulatory movements for acoustic to articulatory inversion
Jesuraj Bandekar, Sathvik Udupa, Prasanta Kumar Ghosh
Exploring Auditory Attention Decoding using Speaker Features
Zelin Qiu, Jianjun Gu, Dingding Yao et al.
Exploring Downstream Transfer of Self-Supervised Features for Speech Emotion Recognition
Yuanbo Fang, Xiaofen Xing, Xiangmin Xu et al.
Exploring Energy-based Language Models with Different Architectures and Training Methods for Speech Recognition
Hong Liu, Zhaobiao Lv, Zhijian Ou et al.
Exploring Graph Theory Methods For the Analysis of Pronunciation Variation in Spontaneous Speech
Bernhard C. Geiger, Barbara Schuppler
Exploring multi-task learning and data augmentation in dementia detection with self-supervised pretrained models
Minchuan Chen, Chenfeng Miao, Jun Ma et al.
Exploring Sources of Racial Bias in Automatic Speech Recognition through the Lens of Rhythmic Variation
Li-Fang Lai, Nicole Holliday
Exploring the English Accent-independent Features for Speech Emotion Recognition using Filter and Wrapper-based Methods for Feature Selection
Nowshin Tabassum, Tasfia Tabassum, Fardin Saad et al.
Exploring the Impact of Back-End Network on Wav2vec 2.0 for Dialect Identification
Qibao Luo, Ruohua Zhou
Exploring the Impact of Pretrained Models and Web-Scraped Data for the 2022 NIST Language Recognition Evaluation
Tanel Alumäe, Kunnar Kukk, Viet-Bac Le et al.
Exploring the Interactions Between Target Positive and Negative Information for Acoustic Echo Cancellation
Chang Han, Xinmeng Xu, Weiping Tu et al.
Exploring the mutual intelligibility breakdown caused by sculpting speech from a competing speech signal
Martin Cooke, María Luisa García Lecumberri
Expressive Machine Dubbing Through Phrase-level Cross-lingual Prosody Transfer
Jakub Swiatkowski, Duo Wang, Mikolaj Babianski et al.
Expresso: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis
Tu Anh Nguyen, Wei-Ning Hsu, Antony D'Avirro et al.
Extending DNN-based Multiplicative Masking to Deep Subband Filtering for Improved Dereverberation
Jean-Marie Lemercier, Julian Tobergte, Timo Gerkmann
Extremely Low Bit Quantization for Mobile Speaker Verification Systems Under 1MB Memory
Bei Liu, Haoyu Wang, Yanmin Qian
F0inTFS: A lightweight periodicity enhancement strategy for cochlear implants
Huali Zhou, Fanhui Kong, Nengheng Zheng et al.
Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems
Jiajun Deng, Guinan Li, Xurong Xie et al.
FACTSpeech: Speaking a Foreign Language Pronunciation Using Only Your Native Characters
Hong-Sun Yang, Ji-Hoon Kim, Yoon-Cheol Ju et al.
Factual Consistency Oriented Speech Recognition
Naoyuki Kanda, Takuya Yoshioka, Yang Liu