Research Explorer

Zambezi Voice: A Multilingual Speech Corpus for Zambian Languages

Claytone Sikasote, Kalinda Siaminwe, Stanly Mwape et al.

2023 INTERSPEECH

ZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMs

Xingchen Song, Di Wu, Binbin Zhang et al.

2023 INTERSPEECH

Zero-Shot Accent Conversion using Pseudo Siamese Disentanglement Network

Dongya Jia, Qiao Tian, Kainan Peng et al.

2023 INTERSPEECH

Zero-Shot Automatic Pronunciation Assessment

Hongfu Liu, Mingqian Shi, Ye Wang

2023 INTERSPEECH

ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models

Minki Kang, Wooseok Han, Sung Ju Hwang et al.

2023 INTERSPEECH

Zoneformer: On-device Neural Beamformer For In-car Multi-zone Speech Separation, Enhancement and Echo Cancellation

Yong Xu, Vinay Kothapally, Meng Yu et al.

2023 INTERSPEECH

4-bit Conformer with Native Quantization Aware Training for Speech Recognition

Shaojin Ding, Phoenix Meadowlark‎, Yanzhang He et al.

2022 INTERSPEECH

A BERT-based Language Modeling Framework

Chin-Yueh Chien, Kuan-Yu Chen

2022 INTERSPEECH

A blueprint for using deepfakes in sociolinguistic matched-guise experiments

Nathan Joel Young, David Britain, Adrian Leemann

2022 INTERSPEECH

Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization

Andrea Fasoli, Chia-Yu Chen, Mauricio Serrano et al.

2022 INTERSPEECH

Accent Conversion using Pre-trained Model and Synthesized Data from Voice Conversion

Tuan Nam Nguyen, Ngoc-Quan Pham, Alexander Waibel

2022 INTERSPEECH

Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning

Rui Liu, Berrak Sisman, Björn Schuller et al.

2022 INTERSPEECH

ACNN-VC: Utilizing Adaptive Convolution Neural Network for One-Shot Voice Conversion

Ji Sub Um, Yeunju Choi, Hoi Rin Kim

2022 INTERSPEECH

A compact transformer-based GAN vocoder

Chenfeng Miao, Ting Chen, Minchuan Chen et al.

2022 INTERSPEECH

A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings

Fan Yu, Zhihao Du, ShiLiang Zhang et al.

2022 INTERSPEECH

A comparative study on vowel articulation in Parkinson's disease and multiple system atrophy

Khalid Daoudi, Biswajit Das, Solange Milhé de Saint Victor et al.

2022 INTERSPEECH

A Complementary Joint Training Approach Using Unpaired Speech and Text A Complementary Joint Training Approach Using Unpaired Speech and Text

Yeqian Du, Jie Zhang, Qiu-shi Zhu et al.

2022 INTERSPEECH

A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy

Sankaran Panchapagesan, Arun Narayanan, Turaj Zakizadeh Shabestary et al.

2022 INTERSPEECH

Acoustic Feature Shuffling Network for Text-independent Speaker Verification

Jin Li, Xin Fang, Fan Chu et al.

2022 INTERSPEECH

Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History

Yuto Nishimura, Yuki Saito, Shinnosuke Takamichi et al.

2022 INTERSPEECH

Acoustic Representation Learning on Breathing and Speech Signals for COVID-19 Detection

Debottam Dutta, Debarpan Bhattacharya, Sriram Ganapathy et al.

2022 INTERSPEECH

Acoustic Stress Detection in Isolated English Words for Computer-Assisted Pronunciation Training

Vera Bernhard, Sandra Schwab, Jean-Philippe Goldman

2022 INTERSPEECH

Acoustic To Articulatory Speech Inversion Using Multi-Resolution Spectro-Temporal Representations Of Speech Signals

Rahil Parikh, Nadee Seneviratne, Ganesh Sivaraman et al.

2022 INTERSPEECH

Acoustic-to-articulatory Speech Inversion with Multi-task Learning

Yashish M. Siriwardena, Ganesh Sivaraman, Carol Espy-Wilson

2022 INTERSPEECH

Acquisition of allophonic variation in second language speech: An acoustic and articulatory study of English laterals by Japanese speakers

Takayuki Nagamine

2022 INTERSPEECH

Papers