Research Explorer

Adapting Multi-Lingual ASR Models for Handling Multiple Talkers

Chenda Li, Yao Qian, Zhuo Chen et al.

2023 INTERSPEECH

Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition

Tianyi Xu, Zhanheng Yang, Kaixun Huang et al.

2023 INTERSPEECH

Adaptive Neural Network Quantization For Lightweight Speaker Verification

Haoyu Wang, Bei Liu, Yifei Wu et al.

2023 INTERSPEECH

Addressing Cold Start Problem for End-to-end Automatic Speech Scoring

Jungbae Park, Seungtaek Choi

2023 INTERSPEECH

AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in the SUPERB Benchmark

Gaobin Yang, Jun Du, Maokui He et al.

2023 INTERSPEECH

A Dual Attention-based Modality-Collaborative Fusion Network for Emotion Recognition

Xiaoheng Zhang, Yang Li

2023 INTERSPEECH

Advanced RawNet2 with Attention-based Channel Masking for Synthetic Speech Detection

Jing Li, Yanhua Long, Yijie Li et al.

2023 INTERSPEECH

Advances in Language Recognition in Low Resource African Languages: The JHU-MIT Submission for NIST LRE22

Jesús Villalba, Jonas Borgstrom, Maliha Jahan et al.

2023 INTERSPEECH

Adversarial Diffusion Probability Model For Cross-domain Speaker Verification Integrating Contrastive Loss

Xinmei Su, Xiang Xie, Fengrun Zhang et al.

2023 INTERSPEECH

Adversarial Learning of Intermediate Acoustic Feature for End-to-End Lightweight Text-to-Speech

Hyungchan Yoon, Seyun Um, Changhwan Kim et al.

2023 INTERSPEECH

Affective attributes of French caregivers' professional speech

Jean-Luc Rouas, Yaru Wu, Takaaki Shochi

2023 INTERSPEECH

AfriNames: Most ASR Models "Butcher" African Names

Tobi Olatunji, Tejumade Afonja, Bonaventure F. P. Dossou et al.

2023 INTERSPEECH

A GAN Speech Inpainting Model for Audio Editing Software

Haixin Zhao

2023 INTERSPEECH

A Generative Framework for Conversational Laughter: Its 'Language Model' and Laughter Sound Synthesis

Hiroki Mori, Shunya Kimura

2023 INTERSPEECH

A Hierarchical Context-aware Modeling Approach for Multi-aspect and Multi-granular Pronunciation Assessment

Fu-An Chao, Tien-Hong Lo, Tzu-I Wu et al.

2023 INTERSPEECH

A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-task Learning

Hyungshin Ryu, Sunhee Kim, Minhwa Chung

2023 INTERSPEECH

A Lexical-aware Non-autoregressive Transformer-based ASR Model

Chong-En Lin, Kuan-Yu Chen

2023 INTERSPEECH

AlignAtt: Using Attention-based Audio-Translation Alignments as a Guide for Simultaneous Speech Translation

Sara Papi, Marco Turchi, Matteo Negri

2023 INTERSPEECH

Aligning Speech Enhancement for Improving Downstream Classification Performance

Yan Xiong, Visar Berisha, Chaitali Chakrabarti

2023 INTERSPEECH

Alignment of Beat Gestures and Prosodic Prominence in German

Sophie Repp, Lara Muhtz, Johannes Heim

2023 INTERSPEECH

Allophant: Cross-lingual Phoneme Recognition with Articulatory Attributes

Kevin Glocker, Aaricia Herygers, Munir Georges

2023 INTERSPEECH

ALO-VC: Any-to-any Low-latency One-shot Voice Conversion

Bohan Wang, Damien Ronssin, Milos Cernak

2023 INTERSPEECH

A Low-Resource Pipeline for Text-to-Speech from Found Data With Application to Scottish Gaelic

Dan Wells, Korin Richmond, William Lamb

2023 INTERSPEECH

Alzheimer Disease Classification through ASR-based Transcriptions: Exploring the Impact of Punctuation and Pauses

Lucía Gómez-Zaragozá, Simone Wills, Cristian Tejedor-Garcia et al.

2023 INTERSPEECH

A Mask Free Neural Network for Monaural Speech Enhancement

Liang Liu, Haixin Guan, Jinlong Ma et al.

2023 INTERSPEECH

Papers