Co-occurring keywords
Papers
LaRA: Large Rank Adaptation for Speech and Text Cross-Modal Learning in Large Language Models
EMNLP 2024
Few-Shot Keyword Spotting from Mixed Speech
INTERSPEECH 2024
TM-PATHVQA: 90000+ Textless Multilingual Questions for Medical Visual Question Answering
INTERSPEECH 2024
Backchannel prediction, based on who, when and what
INTERSPEECH 2024
Lightweight Transducer Based on Frame-Level Criterion
INTERSPEECH 2024
Revisiting Convolution-free Transformer for Speech Recognition
INTERSPEECH 2024
Developing an End-to-End Framework for Predicting the Social Communication Severity Scores of Children with Autism Spectrum Disorder
INTERSPEECH 2024
Dynamic Encoder Size Based on Data-Driven Layer-wise Pruning for Speech Recognition
INTERSPEECH 2024
Quantification of stylistic differences in human- and ASR-produced transcripts of African American English
INTERSPEECH 2024
Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models
INTERSPEECH 2024
A Dataset and Two-pass System for Reading Miscue Detection
INTERSPEECH 2024
SC-MoE: Switch Conformer Mixture of Experts for Unified Streaming and Non-streaming Code-Switching ASR
INTERSPEECH 2024