Interestingly, the performance of radiologists to diagnose COVID from CT-scans on the STOIC database in also 80% (see Table 2. in https://pubs.rsna.org/doi/10.1148/radiol.2021210384). It reinforces the hypothesis that it is the maximum achievable on this cohort.
Other interesting facts from the paper for the severity prediction tasl (not linked to this thread) :
- CT acquisitions were performed without contrast material administration except when pulmonary embolism was suspected as a confounding diagnosis to COVID-19 pneumonia at presentation. Injected CT are quite easy to detect on images so may be an indirect way to add the feature "suspected pulmonary embolism"
- In Table 3 we see a list of variables significantly associated with a severe outcome: age, sex, oxygen supplementation, diabetes, coronary atery disease, hypertension, coronary calcification score, emphysema and lung disease extent
- In Figure 5, we see that a logistic regression get 69% AUC to predict severity based on some of these variables. I find it surprinsingly low compared to the litterature and the performances obtained on the leaderboard (~80%).