← Learning Types

Machine Learning › Learning Types ›

Evaluation

1654 directly classified papers

Papers per year

Papers

Linguistic Evaluation for the 2021 State-of-the-art Machine Translation Systems for German to English and English to German EMNLP 2021

Common Sense Bias in Semantic Role Labeling EMNLP 2021

Who’s on First?: Probing the Learning and Representation Capabilities of Language Models on Deterministic Closed Domains EMNLP 2021

What Makes a Scientific Paper be Accepted for Publication? EMNLP 2021

CIDEr-R: Robust Consensus-based Image Description Evaluation EMNLP 2021

Probing Language Models for Understanding of Temporal Expressions EMNLP 2021

Differential Evaluation: a Qualitative Analysis of Natural Language Processing System Behavior Based Upon Data Resistance to Processing EMNLP 2021

Characterizing Fairness Over the Set of Good Models Under Selective Labels ICML 2021

Verifiability and Predictability: Interpreting Utilities of Network Architectures for Point Cloud Processing CVPR 2021

Have Fun Storming the Castle(s)! WACV 2021

StoryDB: Broad Multi-language Narrative Dataset EMNLP 2021

When All We Need is a Piece of the Pie: A Generic Framework for Optimizing Two-way Partial AUC ICML 2021

Meta-Cal: Well-controlled Post-hoc Calibration by Ranking ICML 2021

Active Testing: Sample-Efficient Model Evaluation ICML 2021

Marginal Contribution Feature Importance - an Axiomatic Approach for Explaining Data ICML 2021

Are We Summarizing the Right Way? A Survey of Dialogue Summarization Data Sets EMNLP 2021

Alibi Explain: Algorithms for Explaining Machine Learning Models JMLR 2021

Improving Reproducibility in Machine Learning Research(A Report from the NeurIPS 2019 Reproducibility Program) JMLR 2021

The Eval4NLP Shared Task on Explainable Quality Estimation: Overview and Results EMNLP 2021

Testing Cross-Database Semantic Parsers With Canonical Utterances EMNLP 2021

Training Dynamic based data filtering may not work for NLP datasets EMNLP 2021

Enriching ImageNet With Human Similarity Judgments and Psychological Embeddings CVPR 2021

A Targeted Assessment of Incremental Processing in Neural Language Models and Humans ACL 2021

What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks? ACL 2021

Boundary IoU: Improving Object-Centric Image Segmentation Evaluation CVPR 2021