Research Explorer

A Neural State-Space Modeling Approach to Efficient Speech Separation

Chen Chen, Chao-Han Huck Yang, Kai Li et al.

2023 INTERSPEECH

A Neural Time Alignment Module for End-to-End Automatic Speech Recognition

Dongcheng Jiang, Chao Zhang, Philip C. Woodland

2023 INTERSPEECH

A Neural TTS System with Parallel Prosody Transfer from Unseen Speakers

Slava Shechtman, Raul Fernandez

2023 INTERSPEECH

A New Benchmark of Aphasia Speech Recognition and Detection Based on E-Branchformer and Multi-task Learning

Jiyang Tang, William Chen, Xuankai Chang et al.

2023 INTERSPEECH

An extension of disentanglement metrics and its application to voice

Olivier Zhang, Olivier Le Blouch, Nicolas Gengembre et al.

2023 INTERSPEECH

An Improved End-to-End Audio-Visual Speech Recognition Model

Sheng Yang, Zheng Gong, Jia Kang

2023 INTERSPEECH

An Information-Theoretic Analysis of Self-supervised Discrete Representations of Speech

Badr M. Abdullah, Mohammed Maqsood Shaik, Bernd Möbius et al.

2023 INTERSPEECH

An Intra-BRNN and GB-RVQ Based END-TO-END Neural Audio Codec

Linping Xu, Jiawei Jiang, Dejun Zhang et al.

2023 INTERSPEECH

An Investigation of Indian Native Language Phonemic Influences on L2 English Pronunciations

Shelly Jain, Priyanshi Pal, Anil Kumar Vuppala et al.

2023 INTERSPEECH

An Investigation of the Combination of Rehearsal and Knowledge Distillation in Continual Learning for Spoken Language Understanding

Umberto Cappellazzo, Daniele Falavigna, Alessio Brutti

2023 INTERSPEECH

Anomalous Sound Detection Based on Sound Separation

Kanta Shimonishi, Kota Dohi, Yohei Kawaguchi

2023 INTERSPEECH

Anomalous Sound Detection Using Self-Attention-Based Frequency Pattern Analysis of Machine Sounds

Hejing Zhang, Jian Guan, Qiaoxi Zhu et al.

2023 INTERSPEECH

A no-reference speech quality assessment method based on neural network with densely connected convolutional architecture

Wuxuan Gong, Jing Wang, Yitong Liu et al.

2023 INTERSPEECH

Another Point of View on Visual Speech Recognition

Baptiste Pouthier, Laurent Pilati, Giacomo Valenti et al.

2023 INTERSPEECH

An Outlier Analysis of Vowel Formants from a Corpus Phonetics Pipeline

Emily P. Ahn, Gina-Anne Levow, Richard A. Wright et al.

2023 INTERSPEECH

A novel frequency warping scale for speech emotion recognition

Premjeet Singh, Goutam Saha

2023 INTERSPEECH

A Novel Interpretable and Generalizable Re-synchronization Model for Cued Speech based on a Multi-Cuer Corpus

Lufei Gao, Shan Huang, Li Liu

2023 INTERSPEECH

A Novel Self-training Approach for Low-resource Speech Recognition

Satwinder Singh, Feng Hou, Ruili Wang

2023 INTERSPEECH

A Parameter-Efficient Learning Approach to Arabic Dialect Identification with Pre-Trained General-Purpose Speech Model

Srijith Radhakrishnan, Chao-Han Huck Yang, Sumeer Ahmad Khan et al.

2023 INTERSPEECH

A Personalised Speech Communication Application for Dysarthric Speakers

Matthew Gibson, Ievgen Karaulov, Oleksii Zhelo et al.

2023 INTERSPEECH

A Pipeline to Evaluate the Effects of Noise on Machine Learning Detection of Laryngeal Cancer

Mary Paterson, James Moor, Luisa Cutillo

2023 INTERSPEECH

Application for Real-time Audio-Visual Speech Enhancement

Mandar Gogate, Kia Dashtipour, Amir Hussain

2023 INTERSPEECH

Application of Knowledge Distillation to Multi-Task Speech Representation Learning

Mine Kerpicci, Van Nguyen, Shuhua Zhang et al.

2023 INTERSPEECH

Approximate Nearest Neighbour Phrase Mining for Contextual Speech Recognition

Maurits Bleeker, Pawel Swietojanski, Stefan Braun et al.

2023 INTERSPEECH

A Preliminary Study on Augmenting Speech Emotion Recognition using a Diffusion Model

Mohammad Ibrahim Malik, Siddique Latif, Raja Jurdak et al.

2023 INTERSPEECH

Papers