Papers
8,761 papers found
A Neural State-Space Modeling Approach to Efficient Speech Separation
Chen Chen, Chao-Han Huck Yang, Kai Li et al.
A Neural Time Alignment Module for End-to-End Automatic Speech Recognition
Dongcheng Jiang, Chao Zhang, Philip C. Woodland
A Neural TTS System with Parallel Prosody Transfer from Unseen Speakers
Slava Shechtman, Raul Fernandez
A New Benchmark of Aphasia Speech Recognition and Detection Based on E-Branchformer and Multi-task Learning
Jiyang Tang, William Chen, Xuankai Chang et al.
An extension of disentanglement metrics and its application to voice
Olivier Zhang, Olivier Le Blouch, Nicolas Gengembre et al.
An Improved End-to-End Audio-Visual Speech Recognition Model
Sheng Yang, Zheng Gong, Jia Kang
An Information-Theoretic Analysis of Self-supervised Discrete Representations of Speech
Badr M. Abdullah, Mohammed Maqsood Shaik, Bernd Möbius et al.
An Intra-BRNN and GB-RVQ Based END-TO-END Neural Audio Codec
Linping Xu, Jiawei Jiang, Dejun Zhang et al.
An Investigation of Indian Native Language Phonemic Influences on L2 English Pronunciations
Shelly Jain, Priyanshi Pal, Anil Kumar Vuppala et al.
An Investigation of the Combination of Rehearsal and Knowledge Distillation in Continual Learning for Spoken Language Understanding
Umberto Cappellazzo, Daniele Falavigna, Alessio Brutti
Anomalous Sound Detection Based on Sound Separation
Kanta Shimonishi, Kota Dohi, Yohei Kawaguchi
Anomalous Sound Detection Using Self-Attention-Based Frequency Pattern Analysis of Machine Sounds
Hejing Zhang, Jian Guan, Qiaoxi Zhu et al.
A no-reference speech quality assessment method based on neural network with densely connected convolutional architecture
Wuxuan Gong, Jing Wang, Yitong Liu et al.
Another Point of View on Visual Speech Recognition
Baptiste Pouthier, Laurent Pilati, Giacomo Valenti et al.
An Outlier Analysis of Vowel Formants from a Corpus Phonetics Pipeline
Emily P. Ahn, Gina-Anne Levow, Richard A. Wright et al.
A novel frequency warping scale for speech emotion recognition
Premjeet Singh, Goutam Saha
A Novel Interpretable and Generalizable Re-synchronization Model for Cued Speech based on a Multi-Cuer Corpus
Lufei Gao, Shan Huang, Li Liu
A Novel Self-training Approach for Low-resource Speech Recognition
Satwinder Singh, Feng Hou, Ruili Wang
A Parameter-Efficient Learning Approach to Arabic Dialect Identification with Pre-Trained General-Purpose Speech Model
Srijith Radhakrishnan, Chao-Han Huck Yang, Sumeer Ahmad Khan et al.
A Personalised Speech Communication Application for Dysarthric Speakers
Matthew Gibson, Ievgen Karaulov, Oleksii Zhelo et al.
A Pipeline to Evaluate the Effects of Noise on Machine Learning Detection of Laryngeal Cancer
Mary Paterson, James Moor, Luisa Cutillo
Application for Real-time Audio-Visual Speech Enhancement
Mandar Gogate, Kia Dashtipour, Amir Hussain
Application of Knowledge Distillation to Multi-Task Speech Representation Learning
Mine Kerpicci, Van Nguyen, Shuhua Zhang et al.
Approximate Nearest Neighbour Phrase Mining for Contextual Speech Recognition
Maurits Bleeker, Pawel Swietojanski, Stefan Braun et al.
A Preliminary Study on Augmenting Speech Emotion Recognition using a Diffusion Model
Mohammad Ibrahim Malik, Siddique Latif, Raja Jurdak et al.