XML Annotations comparison to PNG Masks ¶
By: arian.arab on Feb. 3, 2022, 4:13 p.m.
Thanks for organizing the challenge. I have a question regarding the annotation of the PNG files compated to the annotations from the XML files.
Firstly, I guess one annotation is missing for "TC" files and 27 annotations for the "TCGA" files. (The tiff image is available but the XML file is missing), please correct me if I am wrong.
Secondly, I have looked at the data both loading the annotations from the XML files and the PNG masks (JSON and PNG mask).
In the XML annotations there are 9 labels corresponding to: "ROI", "Invasive Tumor", "Tumor Associated Stroma", "In-situ Tumor", "Healthy Glands", "Necrosis not in situ", "Inflamed Stroma", "Rest" and “Lymphocytes and plasma cells".
From the PNG masks there are 8 different labels corresponding to: "ROI", "Invasive Tumor", "Tumor Associated Stroma", "In-situ Tumor", "Healthy Glands", "Necrosis not in situ", "Inflamed Stroma", "Rest"
The "Lymphocytes and plasma cells" locations are stored in the JSON file correctly.
However, I noticed that for some TIFF files, when looking at the XML file, some part of the ROI is annotated as "tumor-associated stroma" and this is the only annotation for that specific ROI. However, for the same file when I look at the PNG file, I can see that the annotations are labeled for the “invasive tumor” and “tumor-associated stroma”. It seems like that the ROI box is assigned to the invasive tumor. Why is that and how is it done? How one decides to correspond the not-labeled annotations to different classes? By looking at the XML annotations, it seems like that ROI is the correcponding larger bounding box targeting the region of interest to annotate further.
Are these XML annotations, manually converted into PNG files or is there an algorithm?
The PNG masks have a class with the value of zero, which my guess is that coming from the ROI box.
I am not sure how the PNG masks for the class zero are created, because there is also another class as "Rest".
I have prepared several examples in a PowerPoint file; I could share it if that helps.
Thanks,
Arian