Out of the 1500 cases shared in the Public Training and Development Dataset, 1075 cases have benign tissue or indolent PCa (i.e. their labels should be empty or full of 0s) and 425 cases have csPCa (i.e. their labels should have lesion blobs of value 2, 3, 4 or 5). Out of these 425 positive cases, only 220 cases carry an annotation derived by a human expert. Remaining 205 positive cases have not been annotated. In other words, only 17% (220/1295) of the annotations provided in picai_labels/csPCa_lesion_delineations/human_expert should have csPCa lesion annotations, while the remaining 83% (1075/1295) of annotations should be empty.
For more details, please check out the following page where this has been documented more extensively: https://pi-cai.grand-challenge.org/DATA/
Indeed. During evaluation, PSA (if reported during clinical routine), PSA density (if reported during clinical routine), prostate volume (if reported during clinical routine), patient age (always), MRI scanner manufacturer (always), MRI scanner model name (always) and diffusion b-value of the high b-value DWI/HBV scan (always), will be available to every AI algorithm per validation/testing case.
For the Public Training and Development Dataset, these clinical variables can be found in the marksheet.
Hope this helps.