Deep learning to estimate pulmonary nodule malignancy risk using a current and a prior CT image
About
- Nodule Locations (Locations of pulmonary nodules)
- CT Image (Any CT image)
- Prior CT image (A CT image from a prior study)
- Prior Nodule Locations (Locations of pulmonary nodules on a prior CT image)
- Lung nodule malignancy risk (Malignancy risk of lung nodule detected in a chest CT ranging between 0 to 1 with 0 being the lowest malignancy risk and 1 being the highest malignancy risk)
Model Facts
Summary
This is a deep learning (DL) algorithm based on convolutional neural networks (CNNs) that accepts a current and a prior low-dose chest CT, the coordinates of a pulmonary nodule, and the time difference (delta) between the CT examinations as input to estimate the malignancy risk of the nodule. This algorithm was developed with pulmonary nodules from the National Lung Screening Trial (NLST).
Performance icons created by Freepik - Flaticon
Mechanism
Outcome | risk of malignancy in the pulmonary nodule(s) |
Output | 0 - 1 with 3 decimal values indicating the malignancy risk in the pulmonary nodule(s) |
Target population | high-risk population eligible for lung cancer screening |
Input data source | a current and a prior low-dose chest CT examination, the coordinates of pulmonary nodules, and the time difference between the CT examinations |
Training data location and time-period | 33 centers in US participating in NLST. CT examinations were acquired between 2002 – 2004 |
Reference standard | histopathological confirmation for lung cancers |
Model type | Convolutional neural network |
Usage instructions¶
Assuming the following information about the Patient and the Pulmonary Nodule in question:
PatientID | 1234 | |||
LesionID | 1 | |||
Study Date for current CT examination (DD-MM-YYYY) | 01-02-2006 | |||
Study Date for prior CT examination | 01-02-2005 | |||
World coordinate of pulmonary nodule at current CT (X, Y, Z) | (96.4, 101.06, 1597) | |||
World coordinate of pulmonary nodule at prior CT | (82.38, 96.38, 1514.15) |
Then generate the following JSON files as described below.
First JSON input, nodule-locations.json
, which should contain the following:
{
"type": "Multiple points",
"points": [
{
"point": [
96.41,
101.06,
1597
],
"name": "1234_1_20060102"
}
],
"version": {
"major": 1,
"minor": 0
}
}
Second JSON input, prior-nodule-locations.json
, which should contain the following:
{
"type": "Multiple points",
"points": [
{
"point": [
82.38,
96.38,
1514.15
],
"name": "1234_1_20050102"
}
],
"version": {
"major": 1,
"minor": 0
}
}
These two JSONs, along with the CT examinations, should be uploaded inside Try-out Algorithm.
Validation and Performance
Uses and Directions
For research use only. The algorithm is intended to be used only on pulmonary nodules detected at low-dose chest CT examinations of high-risk participants eligible for lung cancer screening programs.
Benefits: Early detection of lung cancer may improve patient prognosis and potentially reduce treatment costs.
Target population and use case: High-risk population participating in lung cancer screening undergo annual low-dose CT scans. Radiologists reading chest CT scans will have to provide appropriate follow-up recommendations based on their assessment of pulmonary nodules.
General use: This model is intended to be used by radiologists for predicting the malignancy in pulmonary nodules detected at screening CT. The model is not a diagnostic for cancer and is not meant to guide or drive clinical care. This model is intended to complement other pieces of patient information in order to determine the appropriate follow-up recommendation.
Appropriate decision support: The model identifies pulmonary nodule X as at a high risk of being malignant. The referring radiologist reviews the prediction along with other clinical information and decides the appropriate follow-up recommendation for the patient.
Before using this model: Test the model retrospectively and prospectively on a diagnostic cohort that reflects the target population that the model will be used upon to confirm the validity of the model within a local setting.
Safety and efficacy evaluation: TBD
Warnings
Risks: Even if used appropriately, clinicians using this model can misdiagnose cancer. Delays in cancer diagnosis can lead to metastasis and mortality. Patients who are incorrectly treated for cancer can be exposed to risks associated with unnecessary interventions and treatment costs related to follow-ups.
Inappropriate Settings: This model was not trained on CT examinations of patients from a clinical or diagnostic setting. Do not use the model in the clinic without further evaluation. This model was trained to differentiate malignant pulmonary nodules from benign nodules in a high-risk population undergoing lung cancer screening with low-dose CT examinations. Do not use this model elsewhere.
Clinical rationale: The model is not interpretable and does not provide a rationale for high-risk scores. Clinical end users are expected to place the model output in context with other clinical information to make the final determination of diagnosis.
Inappropriate decision support: This model may not be accurate outside of the target population and was developed on a high-risk population undergoing lung cancer screening. This model is not a diagnostic and is not designed to guide clinical diagnosis and treatment for malignant pulmonary nodules.
Generalizability: This model was primarily developed with pulmonary nodules from the National Lung Screening Trial and validated with pulmonary nodules from the Danish Lung Cancer Screening Trial and the Multicentric Italian Lung Detection Trial. Do not use this model in an external setting without further evaluation.
Discontinue use if: Clinical staff raise concerns about the utility of the model for the intended use case or large, systematic changes occur at the data level that necessitates re-training of the model.
Common Error Messages
Information on this algorithm has been provided by the Algorithm Editors, following the Model Facts labels guidelines from Sendak, M.P., Gao, M., Brajer, N. et al. Presenting machine learning model information to clinical end users with model facts labels. npj Digit. Med. 3, 41 (2020). 10.1038/s41746-020-0253-3