Research Explorer

Fake the Real: Backdoor Attack on Deep Speech Classification via Voice Conversion

Zhe Ye, Terui Mao, Li Dong et al.

2023 INTERSPEECH

Fast and Efficient Multilingual Self-Supervised Pre-training for Low-Resource Speech Recognition

Zhilong Zhang, Wei Wang, Yanmin Qian

2023 INTERSPEECH

Fast Enrollable Streaming Keyword Spotting System: Training and Inference using a Web Browser

Namhyun Cho, Sunmin Kim, Yoseb Kang et al.

2023 INTERSPEECH

FastFit: Towards Real-Time Iterative Neural Vocoder by Replacing U-Net Encoder With Multiple STFTs

Won Jang, Dan Lim, Heayoung Park

2023 INTERSPEECH

FC-MTLF: A Fine- and Coarse-grained Multi-Task Learning Framework for Cross-Lingual Spoken Language Understanding

Xuxin Cheng, Wanshi Xu, Ziyu Yao et al.

2023 INTERSPEECH

Feature Normalization for Fine-tuning Self-Supervised Models in Speech Enhancement

Hejung Yang, Hong-Goo Kang

2023 INTERSPEECH

Federated Learning for Secure Development of AI Models for Parkinson’s Disease Detection Using Speech from Different Languages

Soroosh Tayebi Arasteh, Cristian David Ríos-Urrego, Elmar Nöth et al.

2023 INTERSPEECH

Federated Learning Toolkit with Voice-based User Verification Demo

Prathamesh Mandke, Rachel Oberst, Matthias Reisser et al.

2023 INTERSPEECH

Few-shot Class-incremental Audio Classification Using Adaptively-refined Prototypes

Wei Xie, Yanxiong Li, Qianhua He et al.

2023 INTERSPEECH

Few-shot Class-incremental Audio Classification Using Stochastic Classifier

Yanxiong Li, Wenchang Cao, Jialong Li et al.

2023 INTERSPEECH

Few-shot Dysarthric Speech Recognition with Text-to-Speech Data Augmentation

Enno Hermann, Mathew Magimai.-Doss

2023 INTERSPEECH

Few-Shot Open-Set Learning for On-Device Customization of KeyWord Spotting Systems

Manuele Rusci, Tinne Tuytelaars

2023 INTERSPEECH

Filling the population statistics gap: Swiss German reference data on F0 and speech tempo for forensic contexts

Hannah Hedegard, Andrea Fröhlich, Fabian Tomaschek et al.

2023 INTERSPEECH

Fine-tuned RoBERTa Model with a CNN-LSTM Network for Conversational Emotion Recognition

Jiachen Luo, Huy Phan, Joshua Reiss

2023 INTERSPEECH

Fine-tuning Audio Spectrogram Transformer with Task-aware Adapters for Sound Event Detection

Kang Li, Yan Song, Ian McLoughlin et al.

2023 INTERSPEECH

First Language Effects on Second Language Perception: Evidence from English Low-vowel Nasal Sequences Perceived by L1 Mandarin Chinese Listeners

Sijia Zhang

2023 INTERSPEECH

FlexiAST: Flexibility is What AST Needs

Jiu Feng, Mehmet Hamza Erol, Joon Son Chung et al.

2023 INTERSPEECH

Flow-VAE VC: End-to-End Flow Framework with Contrastive Loss for Zero-shot Voice Conversion

Le Xu, Rongxiu Zhong, Ying Liu et al.

2023 INTERSPEECH

FN-SSL: Full-Band and Narrow-Band Fusion for Sound Source Localization

Yabo Wang, Bing Yang, Xiaofei Li

2023 INTERSPEECH

Focus-attention-enhanced Crossmodal Transformer with Metric Learning for Multimodal Speech Emotion Recognition

Keulbit Kim, Namhyun Cho

2023 INTERSPEECH

Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information

Jiuxin Lin, Peng Wang, Heinrich Dinkel et al.

2023 INTERSPEECH

FOOCTTS: Generating Arabic Speech with Acoustic Environment for Football Commentator

Massa Baali, Ahmed M. Ali

2023 INTERSPEECH

Fooling Speaker Identification Systems with Adversarial Background Music

Chu-Xiao Zuo, Jia-Yi Leng, Wu-Jun Li

2023 INTERSPEECH

FRA-RIR: Fast Random Approximation of the Image-source Method

Yi Luo, Jianwei Yu

2023 INTERSPEECH

Frequency Patterns of Individual Speaker Characteristics at Higher and Lower Spectral Ranges

Zhao Zhang, Ju Zhang, Ziyu Zhu et al.

2023 INTERSPEECH

Papers