Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Learning Types
Machine Learning
›
Learning Types
›
Evaluation
1654 directly classified papers
Papers per year
2005: 1
2006: 1
2007: 1
2008: 2
2009: 1
2010: 3
2011: 2
2012: 3
2013: 5
2014: 4
2015: 4
2016: 11
2017: 19
2018: 32
2019: 39
2020: 72
2021: 110
2022: 202
2023: 222
2024: 351
2025: 569
Papers
Linguistic Evaluation for the 2021 State-of-the-art Machine Translation Systems for German to English and English to German
EMNLP 2021
Common Sense Bias in Semantic Role Labeling
EMNLP 2021
Who’s on First?: Probing the Learning and Representation Capabilities of Language Models on Deterministic Closed Domains
EMNLP 2021
What Makes a Scientific Paper be Accepted for Publication?
EMNLP 2021
CIDEr-R: Robust Consensus-based Image Description Evaluation
EMNLP 2021
Probing Language Models for Understanding of Temporal Expressions
EMNLP 2021
Differential Evaluation: a Qualitative Analysis of Natural Language Processing System Behavior Based Upon Data Resistance to Processing
EMNLP 2021
Characterizing Fairness Over the Set of Good Models Under Selective Labels
ICML 2021
Verifiability and Predictability: Interpreting Utilities of Network Architectures for Point Cloud Processing
CVPR 2021
Have Fun Storming the Castle(s)!
WACV 2021
StoryDB: Broad Multi-language Narrative Dataset
EMNLP 2021
When All We Need is a Piece of the Pie: A Generic Framework for Optimizing Two-way Partial AUC
ICML 2021
Meta-Cal: Well-controlled Post-hoc Calibration by Ranking
ICML 2021
Active Testing: Sample-Efficient Model Evaluation
ICML 2021
Marginal Contribution Feature Importance - an Axiomatic Approach for Explaining Data
ICML 2021
Are We Summarizing the Right Way? A Survey of Dialogue Summarization Data Sets
EMNLP 2021
Alibi Explain: Algorithms for Explaining Machine Learning Models
JMLR 2021
Improving Reproducibility in Machine Learning Research(A Report from the NeurIPS 2019 Reproducibility Program)
JMLR 2021
The Eval4NLP Shared Task on Explainable Quality Estimation: Overview and Results
EMNLP 2021
Testing Cross-Database Semantic Parsers With Canonical Utterances
EMNLP 2021
Training Dynamic based data filtering may not work for NLP datasets
EMNLP 2021
Enriching ImageNet With Human Similarity Judgments and Psychological Embeddings
CVPR 2021
A Targeted Assessment of Incremental Processing in Neural Language Models and Humans
ACL 2021
What Ingredients Make for an Effective Crowdsourcing Protocol for Difficult NLU Data Collection Tasks?
ACL 2021
Boundary IoU: Improving Object-Centric Image Segmentation Evaluation
CVPR 2021
<
1
…
57
58
59
…
67
>