Problems such as inconsistencies or errors in the training data ¶
By: hebingdou on Nov. 1, 2024, 3:46 p.m.
1、XML annotation of ROI regions and issue-masks regions is heavily biased.The error is visualized as an error in at least the following images:['D_P000012_mask.tif', 'D_P000006_mask.tif', 'D_P000015_mask.tif', 'D_P000014_mask.tif', 'D_P000013_mask.tif', 'D_P000018_mask.tif', 'D_P000003_mask.tif', 'D_P000011_mask.tif', 'C_P000038_mask.tif', 'C_P000030_mask.tif', 'D_P000010_mask.tif', 'D_P000009_mask.tif', 'D_P000019_mask.tif', 'D_P000016_mask.tif', 'D_P000017_mask.tif']. 2、The presence of two different types of cell sites in the same cell---error:A_P000006_PAS_CPG.tif,The bug exists quite a bit, but it's very time consuming to check, it can be troubleshooted through the code, but with the time constraints, I'd like to submit the issue to you guys for fixes. 3、Labeled cell sites fall outside the ROI region of the. 4、The number of labeled cells in json and the number of labels presented in xml, are grossly unequal.['B_P000005','B_P000010'] 5、There is a serious deviation between the tag position XY in XML and XY in json, resulting in a score of almost 0 when evaluated using the json file.['A_P000033','A_P000022']
I have shown only a part of all the above errors, the errors appeared in the training set, so is it possible that they appeared in the validation set, the test set, some of them are acceptable and some of them will lead to a serious deviation from the right track. Therefore, organizers are requested to carefully review the above errors and update them in the training validation test data.