Co-occurring keywords
Papers
Few-Shot Keyword Spotting from Mixed Speech
INTERSPEECH 2024
Self-supervised speech representations display some human-like cross-linguistic perceptual abilities
EMNLP 2024
Lightweight Transducer Based on Frame-Level Criterion
INTERSPEECH 2024
TM-PATHVQA: 90000+ Textless Multilingual Questions for Medical Visual Question Answering
INTERSPEECH 2024
OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer
INTERSPEECH 2024
Backchannel prediction, based on who, when and what
INTERSPEECH 2024
Phonological-Level Mispronunciation Detection and Diagnosis
INTERSPEECH 2024
Developing an End-to-End Framework for Predicting the Social Communication Severity Scores of Children with Autism Spectrum Disorder
INTERSPEECH 2024
Incorporating Class-based Language Model for Named Entity Recognition in Factorized Neural Transducer
INTERSPEECH 2024
Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models
INTERSPEECH 2024
A Cluster-based Personalized Federated Learning Strategy for End-to-End ASR of Dementia Patients
INTERSPEECH 2024