Papers
Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm
INTERSPEECH 2024
Speech Boosting: Low-Latency Live Speech Enhancement for TWS Earbuds
INTERSPEECH 2024
SAML: Speaker Adaptive Mixture of LoRA Experts for End-to-End ASR
INTERSPEECH 2024
Streaming Audio Transformers for Online Audio Tagging
INTERSPEECH 2024
Efficient Audio Captioning with Encoder-Level Knowledge Distillation
INTERSPEECH 2024
tinyCLAP: Distilling Constrastive Language-Audio Pretrained Models
INTERSPEECH 2024
Speech Prefix-Tuning with RNNT Loss for Improving LLM Predictions
INTERSPEECH 2024
Edged based audio-visual speech enhancement demonstrator
INTERSPEECH 2024
RT-LA-VocE: Real-Time Low-SNR Audio-Visual Speech Enhancement
INTERSPEECH 2024
Leveraging Adapter for Parameter-Efficient ASR Encoder
INTERSPEECH 2024