Co-occurring keywords
Papers
Not All Classes Stand on Same Embeddings: Calibrating a Semantic Distance with Metric Tensor
CVPR 2024
MAT-SED: A Masked Audio Transformer with Masked-Reconstruction Based Pre-training for Sound Event Detection
INTERSPEECH 2024
Progressive Exploration-Conformal Learning for Sparsely Annotated Object Detection in Aerial Images
NIPS 2024
Robust Laughter Segmentation with Automatic Diverse Data Synthesis
INTERSPEECH 2024