Research Explorer

MFT-CRN:Multi-scale Fourier Transform for Monaural Speech Enhancement

Yulong Wang, Xueliang Zhang

2023 INTERSPEECH

miniStreamer: Enhancing Small Conformer with Chunked-Context Masking for Streaming ASR Applications on the Edge

Haris Gulzar, Monikka Roslianna Busto, Takeharu Eda et al.

2023 INTERSPEECH

Mispronunciation detection and diagnosis model for tonal language, applied to Vietnamese

Tuong Tu Huu, Viet Thanh Pham, Thi Thu Trang Nguyen et al.

2023 INTERSPEECH

Mitigating Catastrophic Forgetting for Few-Shot Spoken Word Classification Through Meta-Learning

Ruan van der Merwe, Herman Kamper

2023 INTERSPEECH

Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction

Eunseop Yoon, Hee Suk Yoon, Dhananjaya Gowda et al.

2023 INTERSPEECH

Mix before Align: Towards Zero-shot Cross-lingual Sentiment Analysis via Soft-Mix and Multi-View Learning

Zhihong Zhu, Xuxin Cheng, Dongsheng Chen et al.

2023 INTERSPEECH

MixRep: Hidden Representation Mixup for Low-Resource Speech Recognition

Jiamin Xie, John H. L. Hansen

2023 INTERSPEECH

Mixture Encoder for Joint Speech Separation and Recognition

Simon Berger, Peter Vieting, Christoph Boeddeker et al.

2023 INTERSPEECH

Mixture-of-Expert Conformer for Streaming Multilingual ASR

Ke Hu, Bo Li, Tara Sainath et al.

2023 INTERSPEECH

ML-SUPERB: Multilingual Speech Universal PERformance Benchmark

Jiatong Shi, Dan Berrebbi, William Chen et al.

2023 INTERSPEECH

MMER: Multimodal Multi-task Learning for Speech Emotion Recognition

Sreyan Ghosh, Utkarsh Tyagi, S Ramaneswaran et al.

2023 INTERSPEECH

MMLung: Moving Closer to Practical Lung Health Estimation using Smartphones

Mohammed Mosuily, Lindsay Welch, Jagmohan Chauhan

2023 INTERSPEECH

MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for speech recognition

Xiaohuan Zhou, Jiaming Wang, Zeyu Cui et al.

2023 INTERSPEECH

MOCKS 1.0: Multilingual Open Custom Keyword Spotting Testset

Mikołaj Pudo, Mateusz Wosik, Adam Cieślak et al.

2023 INTERSPEECH

Modality Confidence Aware Training for Robust End-to-End Spoken Language Understanding

Suyoun Kim, Akshat Shrivastava, Duc Le et al.

2023 INTERSPEECH

Model-assisted Lexical Tone Evaluation of three-year-old Chinese-speaking Children by also Considering Segment Production

Shu-Chuan Tseng, Yi-Fen Liu, Xiang-Li Lu

2023 INTERSPEECH

Model Compression for DNN-based Speaker Verification Using Weight Quantization

Jingyu Li, Wei Liu, Zhaoyang Zhang et al.

2023 INTERSPEECH

Modeling Dependent Structure for Utterances in ASR Evaluation

Zhe Liu, Fuchun Peng

2023 INTERSPEECH

Model-Internal Slot-triggered Biasing for Domain Expansion in Neural Transducer ASR Models

Yiting Lu, Philip Harding, Kanthashree Mysore Sathyendra et al.

2023 INTERSPEECH

Modular Domain Adaptation for Conformer-Based Streaming ASR

Qiujia Li, Bo Li, Dongseong Hwang et al.

2023 INTERSPEECH

Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer

Paul-Ambroise Duquenne, Holger Schwenk, Benoît Sagot

2023 INTERSPEECH

Monaural Speech Separation Method Based on Recurrent Attention with Parallel Branches

Xue Yang, Changchun Bao, Xu Zhang et al.

2023 INTERSPEECH

MOSLight: A Lightweight Data-Efficient System for Non-Intrusive Speech Quality Assessment

Zitong Li, Wei Li

2023 INTERSPEECH

MOS vs. AB: Evaluating Text-to-Speech Systems Reliably Using Clustered Standard Errors

Joshua Camp, Tom Kenter, Lev Finkelstein et al.

2023 INTERSPEECH

Motor Control Similarity Between Speakers Saying “A Souk” Using Inverse Atlas Tongue Modeling

Ursa Maity, Fangxu Xing, Jerry Prince et al.

2023 INTERSPEECH

Papers