Co-occurring keywords
Papers
ERVQA: A Dataset to Benchmark the Readiness of Large Vision Language Models in Hospital Environments
EMNLP 2024
Three Heads Are Better than One: Improving Cross-Domain NER with Progressive Decomposed Network
AAAI 2024
SaSLaW: Dialogue Speech Corpus with Audio-visual Egocentric Information Toward Environment-adaptive Dialogue Speech Synthesis
INTERSPEECH 2024
Zero-Shot End-To-End Spoken Question Answering In Medical Domain
INTERSPEECH 2024