SoftCAM: Making black box models self-explainable for medical image analysis

Kerol Djoumessi; Philipp Berens

2026 MIDL MIDL 2026

SoftCAM: Making black box models self-explainable for medical image analysis

Abstract

Convolutional neural networks (CNNs) are widely used for high-stakes applications like medicine, often surpassing human performance. However, most explanation methods rely on post-hoc attribution, approximating the decision-making process of already trained black-box models. These methods are often sensitive, unreliable, and fail to reflect true model reasoning, limiting their trustworthiness in critical applications. In this work, we introduce SoftCAM, a straightforward yet effective approach that makes standard CNN architectures inherently interpretable. By removing the global average pooling layer and replacing the fully connected classification layer with a convolution-based class evidence layer, SoftCAM preserves spatial information and produces explicit class activation maps that form the basis of the model’s predictions. Evaluated on three medical datasets spanning three imaging modalities, SoftCAM maintains classification performance while significantly improving both the qualitative and quantitative explanation compared to existing post-hoc methods.

Authors

Kerol Djoumessi , Philipp Berens

Topics

Healthcare & Medicine > Clinical > Medical Imaging Deep Learning > Optimization & Theory > Interpretability Artificial Intelligence > Core AI > Explainability

Keywords

medical image analysis convolutional neural network post-hoc explanation class activation map self-explainable model

Download PDF

Related papers

OxEnsemble: Fair Ensembles for Low-Data Classification 2026

BETA: Resting-state fMRI Biotypes for tDCS Efficacy in Anxiety Among Older Adults At Risk For Alzheimer’s Disease 2026

Guideline-Informed MLLM Reasoning for Pathology-Aware Postoperative Prostate CTV Segmentation 2026

Scalable Detection of Undiagnosed ILD in Population Screening: A Multi-Cohort Study using 3D Foundation Models 2026

DIST-CLIP: Arbitrary Metadata and Image Guided MRI Harmonization via Disentangled Anatomy-Contrast Representations 2026