Clinically Significant Prostate Cancer Detection in bpMRI using models trained with Report Guided Annotations

About

Editor:

joeran.bosma

Contact email:

Joeran.Bosma@radboudumc.nl

Image Version:

e9cff96e-f57d-4de4-80ca-d9c8ae26f0e0 — March 23, 2023

Associated publication:

Bosma JS, Saha A, Hosseinzadeh M, Slootweg I, de Rooij M, Huisman H. Semisupervised Learning with Report-guided Pseudo Labels for Deep Learning–based Prostate Cancer Detection Using Biparametric MRI. Radiology: Artificial Intelligence. 2023;5(5).

Summary

This algorithm predicts a heatmap for the likelihood of clinically significant prostate cancer (csPCa) using biparametric MRI (bpMRI). The algorithm ensembles fifteen independent models that were trained jointly on manually and report-guided automatically annotated MRI examinations. The heatmap is resampled to the same spatial resolution and physical dimensions as the input T2W image for easier visualisation.

Mechanism

This algorithm is a deep learning-based detection/diagnosis model, which ensembles 15 independent nnU-Net models (5-fold cross-validation and 3 restarts). To train these models, a total of 6,578 prostate biparametric MRI (bpMRI) scans paired with a manual or report-guided automatic annotation (PI-RADS v2 ≥ 4) were used. For a description of the report-guided automatic annotation procedure, details on the deep learning model and more, see the associated publication:

J. S. Bosma, A. Saha, M. Hosseinzadeh, I. Slootweg, M. de Rooij, and H. Huisman, "Semi-supervised Learning with Report-guided Pseudo Labels for Deep Learning-based Prostate Cancer Detection Using Biparametric MRI", Radiology: Artificial Intelligence, 230031, 2023. doi:10.1148/ryai.230031

Source Code: https://github.com/DIAGNijmegen/Report-Guided-Annotation

To prevent patient overlap with the PI-CAI Hidden Test Cohort and Hidden Validation and Tuning Cohort, this algorithm was re-trained using the RUMC cases from the PI-CAI Public and Private Training Dataset.

Interfaces

This algorithm implements all of the following input-output combinations:

Inputs Outputs

	Inputs	Outputs
1	Transverse T2 Prostate MRI Slug `transverse-t2-prostate-mri` Description Transverse T2 MRI of the Prostate Kind Image Read from `/input/images/transverse-t2-prostate-mri/<uuid>.mha` or `/input/images/transverse-t2-prostate-mri/<uuid>.tif` Transverse T2 Prostate MRI Transverse HBV Prostate MRI Slug `transverse-hbv-prostate-mri` Description Transverse High B-Value Prostate MRI Kind Image Read from `/input/images/transverse-hbv-prostate-mri/<uuid>.mha` or `/input/images/transverse-hbv-prostate-mri/<uuid>.tif` Transverse HBV Prostate MRI Transverse ADC Prostate MRI Slug `transverse-adc-prostate-mri` Description Transverse Apparent Diffusion Coefficient Prostate MRI Kind Image Read from `/input/images/transverse-adc-prostate-mri/<uuid>.mha` or `/input/images/transverse-adc-prostate-mri/<uuid>.tif` Transverse ADC Prostate MRI	Transverse Cancer Heatmap Prostate MRI Slug `transverse-cancer-heatmap-prostate-mri` Description Single-class, probabilistic segmentation of prostate cancer in 3D, where each voxel represents a floating point in range [0,1]. Kind Heat Map Write to `/output/images/transverse-cancer-heatmap-prostate-mri/<uuid>.mha` or `/output/images/transverse-cancer-heatmap-prostate-mri/<uuid>.tif` Transverse Cancer Heatmap Prostate MRI

Validation and Performance

This algorithm is evaluated on 300 external visits from Ziekenhuisgroep Twente (ZGT), with histopathological ground truth for all patients. Studies are considered positive if they have at least one Gleason grade group ≥ 2 lesion (csPCa). Each model of the ensemble is also evaluated individually and reflects the performance reported in the accompanying paper, Bosma et. al. (2021).

Patient-based diagnostic performance was evaluated using the Receiver Operating Characteristic (ROC), and summarised to the area under the ROC curve (AUROC). Lesion-based diagnostic performance was evaluated using Free-Response Receiver Operating Characteristic (FROC), and summarised to the partial area under the FROC curve (pAUC) between 0.01 and 2.50 false positives per case.

Metric	This algorithm	Models individually	Saha et. al. (2021)*	Hosseinzadeh et. al. (2021)**	Radiologists
AUROC	91.2%	90.4 ± 0.6%	84.0%	84.7%	N/A
pAUC	0.768	0.735 ± 0.017	0.670	0.693	N/A
Specificity at sensitivity of radiologists (PI-RADS ≥ 4)	75.0%	69.2 ± 4.9%	39.2%	45.8%	75.9%
Number of training cases	6,578	5262 ± 17	1,584	1,586	N/A

*: The CAD❋ algorithm proposed in Saha et. al. (2021) was used to evaluate all 300 visits from the external test set (ZGT).
**: The algorithm trained with 1586 cases from Hosseinzadeh et. al. (2021) was used to evaluate all 300 visits from the external test set (ZGT).

Uses and Directions

For research use only. This algorithm is intended to be used only on biparametric prostate MRI examinations of patients with raised PSA levels or clinical suspicion of prostate cancer. This algorithm should not be used in different patient demographics.
Benefits: Risk stratification for clinically significant prostate cancer using prostate MRI is instrumental to reduce over-treatment and unnecessary biopsies.
Target population: This algorithm was trained on patients with raised PSA levels or clinical suspicion of prostate cancer, without prior treatment (e.g. radiotherapy, transurethral resection of the prostate (TURP), transurethral ultrasound ablation (TULSA), cryoablation, etc.), without prior positive biopsies, without artefacts and with reasonably-well aligned sequences.
MRI scanner: This algorithm was trained and evaluated exclusively on prostate bpMRI scans derived from Siemens Healthineers (Skyra/Prisma/Trio/Avanto) MRI scanners with surface coils. It does not account for vendor-neutral properties or domain adaptation, and in turn, is not compatible with scans derived using any other MRI scanner or those using endorectal coils.
Sequence alignment and position of the prostate: While the input images (T2W, HBV, ADC) can be of different spatial resolutions, the algorithm assumes that they are co-registered or aligned reasonably well and that the prostate gland is localized within a volume of 460 cm³ from the centre coordinate.
General use: This model is intended to be used by radiologists for predicting clinically significant prostate cancer in biparametric MRI examinations. The model is not a diagnostic for cancer and is not meant to guide or drive clinical care. This model is intended to complement other pieces of patient information in order to determine the appropriate follow-up recommendation.
Appropriate decision support: The model identifies lesion X as at a high risk of being malignant. The referring radiologist reviews the prediction along with other clinical information and decides the appropriate follow-up recommendation for the patient.
Before using this model: Test the model retrospectively and prospectively on a diagnostic cohort that reflects the target population that the model will be used upon to confirm the validity of the model within a local setting.
Safety and efficacy evaluation: To be determined in a clinical validation study.

Warnings

Risks: Even if used appropriately, clinicians using this model can misdiagnose cancer. Delays in cancer diagnosis can lead to metastasis and mortality. Patients who are incorrectly treated for cancer can be exposed to risks associated with unnecessary interventions and treatment costs related to follow-ups.
Inappropriate Settings: This model was not trained on MRI examinations of patients with prior treatment (e.g. radiotherapy, transurethral resection of the prostate (TURP), transurethral ultrasound ablation (TULSA), cryoablation, etc.), prior positive biopsies, artefacts or misalignment between sequences. Hence it is susceptible to faulty predictions and unintended behaviour when presented with such cases. Do not use the model in the clinic without further evaluation.
Clinical rationale: The model is not interpretable and does not provide a rationale for high risk scores. Clinical end users are expected to place the model output in context with other clinical information to make the final determination of diagnosis.
Inappropriate decision support: This model may not be accurate outside of the target population. This model is not designed to guide clinical diagnosis and treatment for prostate cancer.
Generalizability: This model was primarily developed with prostate MRI examinations from Radboud University Medical Centre and the Andros Kliniek. Do not use this model in an external setting without further evaluation.
Discontinue use if: Clinical staff raise concerns about the utility of the model for the intended use case or large, systematic changes occur at the data level that necessitates re-training of the model.

Common Error Messages

Left empty by the Algorithm Editors

Information on this algorithm has been provided by the Algorithm Editors, following the Model Facts labels guidelines from Sendak, M.P., Gao, M., Brajer, N. et al. Presenting machine learning model information to clinical end users with model facts labels. npj Digit. Med. 3, 41 (2020). 10.1038/s41746-020-0253-3