Understanding Large Language Model Based Metrics for Text Summarization

Abhishek Pradhan; Ketan Todi

2023 IJCNLP IJCNLP 2023

Understanding Large Language Model Based Metrics for Text Summarization

Abstract

AbstractThis paper compares the two most widely used techniques for evaluating generative tasks with large language models (LLMs): prompt-based evaluation and log-likelihood evaluation as part of the Eval4NLP shared task. We focus on the summarization task and evaluate both small and large LLM models. We also study the impact of LLAMA and LLAMA 2 on summarization, using the same set of prompts and techniques. We used the Eval4NLP dataset for our comparison. This study provides evidence of the advantages of prompt-based evaluation techniques over log-likelihood based techniques, especially for large models and models with better reasoning power.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Abhishek Pradhan , Ketan Todi

Topics

Natural Language Processing > Resources & Methods > Large Language Models Artificial Intelligence > Core AI > Large Language Models Natural Language Processing > Applications > Summarization

Keywords

text summarization summarization evaluation prompt-based evaluation log-likelihood evaluation large language model

Download PDF

Related papers

On the Use of Language Models for Function Identification of Citations in Scholarly Papers 2023

Team NLLG submission for Eval4NLP 2023 Shared Task: Retrieval-Augmented In-Context Learning for NLG Evaluation 2023

Automatic Translation of Span-Prediction Datasets 2023

PACT: Pretraining with Adversarial Contrastive Learning for Text Classification 2023

VACASPATI: A Diverse Corpus of Bangla Literature 2023