Interpretability
7318 directly classified papers
Papers per year
Papers
Fairwashing: the risk of rationalization
ICML 2019
Learning Rules-First Classifiers
AISTATS 2019
Towards Debiasing Fact Verification Models
IJCNLP 2019