Explainable AI for computational pathology identifies model limitations and tissue biomarkers

Jakub R Kaczmarzyk; Joel H Saltz; Peter K Koo

Explainable AI for computational pathology identifies model limitations and tissue biomarkers

ArXiv [Preprint]. 2024 Sep 4:arXiv:2409.03080v1.

Authors

Jakub R Kaczmarzyk^{1

2

3}, Joel H Saltz¹, Peter K Koo²

Affiliations

¹ Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY, USA.
² Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
³ Medical Scientist Training Program, Stony Brook University, Stony Brook, NY, USA.

PMID: 39279830
PMCID: PMC11398542

Abstract

Deep learning models have shown promise in histopathology image analysis, but their opaque decision-making process poses challenges in high-risk medical scenarios. Here we introduce HIPPO, an explainable AI method that interrogates attention-based multiple instance learning (ABMIL) models in computational pathology by generating counterfactual examples through tissue patch modifications in whole slide images. Applying HIPPO to ABMIL models trained to detect breast cancer metastasis reveals that they may overlook small tumors and can be misled by non-tumor tissue, while attention maps-widely used for interpretation-often highlight regions that do not directly influence predictions. By interpreting ABMIL models trained on a prognostic prediction task, HIPPO identified tissue areas with stronger prognostic effects than high-attention regions, which sometimes showed counterintuitive influences on risk scores. These findings demonstrate HIPPO's capacity for comprehensive model evaluation, bias detection, and quantitative hypothesis testing. HIPPO greatly expands the capabilities of explainable AI tools to assess the trustworthy and reliable development, deployment, and regulation of weakly-supervised models in computational pathology.

Publication types

Preprint

Grants and funding

T32 GM008444/GM/NIGMS NIH HHS/United States