Dear Moderator,

I am writing to request a re-evaluation of past submissions and a correction of the scores on the leaderboard. Upon careful examination, I have identified a discrepancy between the calculated scores and the aggregate values in the recently updated evaluation container, specifically related to the output of dice_ps values.

The observed difference ranges from 0.002 to -0.001, and even a slight variation of 0.0002 can have a significant impact on the top rankings. To ensure the accuracy of the leaderboard, I kindly ask that you re-evaluate the past submissions using the updated evaluation system and make any necessary adjustments to the scores.

For your convenience, I have prepared a Colab link that can be used to reproduce the results: Colab Link. This will help verify the calculations and ensure the fairness of the evaluation process.

I sincerely apologize if I am wrong in my calculations, I believe you may correct me.

Thank you for your attention to this matter.

Sincerely,

Marawan Elbatel

name presented calculated difference
gregor.koe 0.939487 0.939996 0.000508632
NKCGP 0.931228 0.932307 0.00107997
fangyijie.wang 0.928281 0.930615 0.00233378
坤坤kk 0.922983 0.922011 -0.000972009
柑橘乌云 0.897364 0.898451 0.00108722