Contextualizing the Limits of Model & Evaluation Dataset Curation on Semantic Similarity Classification Tasks

Daniel Theron

2023 EMNLP EMNLP 2023

Contextualizing the Limits of Model & Evaluation Dataset Curation on Semantic Similarity Classification Tasks

Abstract

AbstractThis paper demonstrates how the limitations of pre-trained models and open evaluation datasets factor into assessing the performance of binary semantic similarity classification tasks. As (1) end-user-facing documentation around the curation of these datasets and pre-trained model training regimes is often not easily accessible and (2) given the lower friction and higher demand to quickly deploy such systems in real-world contexts, our study reinforces prior work showing performance disparities across datasets, embedding techniques and distance metrics, while highlighting the importance of understanding how data is collected, curated and analyzed in semantic similarity classification.

🧭 Keyword Pioneer — performance disparities

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Daniel Theron

Topics

Machine Learning > Optimization & Theory > Statistical Learning Machine Learning > Application Areas > Domain Adaptation

Keywords

binary classification domain adaptation semantic similarity pre-trained model dataset curation performance disparities embedding technique

Download PDF

Related papers

Exploring Linguistic Probes for Morphological Generalization 2023

NameGuess: Column Name Expansion for Tabular Data 2023

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning 2023

Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation 2023

On the Calibration of Large Language Models and Alignment 2023