Question about runtime evaluation ¶
By: XA on July 14, 2022, 3:17 a.m.
Hi,
In the released evaluation code, we can see that runtime computation per testing case starts from the docker container initialization and ends at the container close. This evaluation manner includes repeated calculation for environment initialization cost, e.g., docker deploy and gpu warmup, which may not reveal the real runtime of the testing case inference. In practical deployment scenario, we usually initialize the docker container once, and then execute the inference tasks sequentially. We can observe that the first task often takes longer time, and later tasks will have a stable runtime performance. Therefore, maybe start docker container once, inference all testing cases, measure the complete time and get the average is a better solution for inference runtime evalution.