Public Training and Development Dataset: Updates and Fixes ¶
By: anindo on May 13, 2022, 12:05 a.m.
The PI-CAI: Public Training and Development Dataset, consisting of 1500 multi-center, multi-vendor cases, is now online! Learn more about all the considerations that we've made to curate and release the all-new largest public training dataset for prostate cancer detection in MRI: pi-cai.grand-challenge.org/DATA/. Please monitor this thread for all updates and fixes regarding this dataset.
Imaging data has been released via: zenodo.org/record/6624726 (DOI: 10.5281/zenodo.6624726). Annotations have been released and are maintained via: github.com/DIAGNijmegen/picai_labels.
Updates since v1.0:
- Diffusion b-value for all high b-value DWI scans, as present in the DICOM attribute
(0018,9087)
. - File
10121_1000121_t2w.mha
(i.e. T2W imaging for study ID1000121
, under patient ID10121
) was corrupted. - Folder
10403
(including all imaging for study ID1000409
, under patient ID10403
) was missing. - Clinical outcome
lesion_GS
for study1001040
under patient11020
, as stated in the overall clinical information marksheet, was missing. - High b-value DWI scan for study
1000715
under patient10699
, was incorrect. - Clinical variable
patient_age
, as stated in the metadata/header of each MRI scan and the overall clinical information marksheet, was inconsistent or incorrect for nearly half of all training cases. - Intensity values for Philips-based MRI scans were incorrectly rescaled during conversion from DICOM to MHA (see #1766 for more details).
Pending Updates (scheduled to be added in 2-4 weeks):
- AI-derived csPCa lesion delineations for all training cases.
- AI-derived prostate gland delineations for all training cases.
Pending Fixes: None at the moment. Please do not hesitate to let us know if you come across any other issues!