shm limitation trigers bus error in nnUNetv2

shm limitation trigers bus error in nnUNetv2  

  By: Zhack on July 28, 2023, 6:20 p.m.

Dear organisers,

I am using a modified version of the nnUNet_baseline, to fit nnUNetv2

I have noticed the docker settings on shared memory size (i.e.: --shm-size="128m") does not seem to allow nnUNetV2's prediction pipeline to run, raising a bus error (SIGBUS) on the preprocessing workers (even if there is only one worker) in my test environment

I am taking a guess that this is also a problem in the validation environment, as I have isolated this limitation as the only problem, as test.sh runs without problem once we set --shm-size="1g".

As a side note, setting the ipc="host" flag also allows the image to run properly

Is this setup locked, or can this limitation be lifted? Or am I simply missing something else?

Regards, Zhack

 Last edited by: Zhack on Aug. 15, 2023, 12:59 p.m., edited 1 time in total.

Re: shm limitation trigers bus error in nnUNetv2  

  By: Isensee on Aug. 4, 2023, 9:25 p.m.

Wasn't aware of your forum post and ran into the same issue today, see here I found a workaround for this though. No secrets in my process.py so here it is. Feel free to adapt as needed. Since the Docker will always only be applied too one test case at a time (inefficient. meh.) there is no benefit from using nnU-Net's complicated interleaving of preprocessing, prediction and segmentation export.

Google Drive Link

Best and good luck!

Fabian

 Last edited by: Isensee on Feb. 6, 2024, 9:43 a.m., edited 8 times in total.

Re: shm limitation trigers bus error in nnUNetv2  

  By: Zhack on Aug. 5, 2023, 6:24 p.m.

Thank you so much for your answer!

I'm glad I figured the problem correctly, I can see we went through the same (painful) debugging steps :')

Just to let you know in case this wasn't intended, the drive link requires to send an access request

Best of luck to you, and thanks again!

Zhack

Re: shm limitation trigers bus error in nnUNetv2  

  By: Isensee on Aug. 7, 2023, 7:41 a.m.

Hey, the link should now work. Apologies! Best, Fabian