Co-occurring keywords
Papers
IndiBias: A Benchmark Dataset to Measure Social Biases in Language Models for Indian Context
NAACL 2024
A Closer Look at Claim Decomposition
NAACL 2024
Evaluating the Rationale Understanding of Critical Reasoning in Logical Reading Comprehension
EMNLP 2023
LeXFiles and LegalLAMA: Facilitating English Multinational Legal Language Model Development
ACL 2023
MedEval: A Multi-Level, Multi-Task, and Multi-Domain Medical Benchmark for Language Model Evaluation
EMNLP 2023
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation
EMNLP 2023
Is ChatGPT a Financial Expert? Evaluating Language Models on Financial Natural Language Processing
EMNLP 2023