Co-occurring keywords
Papers
Interpretabilty of Speech Emotion Recognition modelled using Self-Supervised Speech and Text Pre-Trained Embeddings
INTERSPEECH 2022
Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Muti-Person Video
INTERSPEECH 2022
CNN-based Audio Event Recognition for Automated Violence Classification and Rating for Prime Video Content
INTERSPEECH 2022
End-to-End Audio-Visual Neural Speaker Diarization
INTERSPEECH 2022
PM2F2N: Patient Multi-view Multi-modal Feature Fusion Networks for Clinical Outcome Prediction
EMNLP 2022
DocFin: Multimodal Financial Prediction and Bias Mitigation using Semi-structured Documents
EMNLP 2022
Contrastive Learning with Expectation-Maximization for Weakly Supervised Phrase Grounding
EMNLP 2022