Co-occurring keywords
Papers
Fine-tuning Audio Spectrogram Transformer with Task-aware Adapters for Sound Event Detection
INTERSPEECH 2023
Adapting Multi-Lingual ASR Models for Handling Multiple Talkers
INTERSPEECH 2023
Fine-tuned RoBERTa Model with a CNN-LSTM Network for Conversational Emotion Recognition
INTERSPEECH 2023
Learning 3D Representations From 2D Pre-Trained Models via Image-to-Point Masked Autoencoders
CVPR 2023