The algorithm failed on one or more cases. ¶
By: tpvagenas on July 14, 2023, 7:01 p.m.
Hello, could you tell me more details on the error received today? Thank you
By: apepe on July 15, 2023, 1:45 p.m.
Hi
Here is your output. You might need to use a lower number of workers. I have seen that you had a successful submission today.
2023-07-14T18:43:51.395000+00:00 ERROR: Unexpected bus error encountered in worker. This might be caused by insufficient shared memory (shm). 2023-07-14T18:44:04.399000+00:00 Traceback (most recent call last): 2023-07-14T18:44:04.399000+00:00 File "/home/user/.local/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 1132, in _try_get_data 2023-07-14T18:44:04.399000+00:00 data = self._data_queue.get(timeout=timeout) 2023-07-14T18:44:04.399000+00:00 File "/usr/local/lib/python3.8/multiprocessing/queues.py", line 107, in get 2023-07-14T18:44:04.399000+00:00 if not self._poll(timeout): 2023-07-14T18:44:04.399000+00:00 File "/usr/local/lib/python3.8/multiprocessing/connection.py", line 257, in poll 2023-07-14T18:44:04.399000+00:00 return self._poll(timeout) 2023-07-14T18:44:04.399000+00:00 File "/usr/local/lib/python3.8/multiprocessing/connection.py", line 424, in _poll 2023-07-14T18:44:04.399000+00:00 r = wait([self], timeout) 2023-07-14T18:44:04.399000+00:00 File "/usr/local/lib/python3.8/multiprocessing/connection.py", line 931, in wait 2023-07-14T18:44:04.399000+00:00 ready = selector.select(timeout) 2023-07-14T18:44:04.399000+00:00 File "/usr/local/lib/python3.8/selectors.py", line 415, in select 2023-07-14T18:44:04.399000+00:00 fd_event_list = self._selector.poll(timeout) 2023-07-14T18:44:04.399000+00:00 File "/home/user/.local/lib/python3.8/site-packages/torch/utils/data/_utils/signal_handling.py", line 66, in handler 2023-07-14T18:44:04.399000+00:00 _error_if_any_worker_fails() 2023-07-14T18:44:04.399000+00:00 RuntimeError: DataLoader worker (pid 40) is killed by signal: Bus error. It is possible that dataloader's workers are out of shared memory. Please try to raise your shared memory limit. 2023-07-14T18:44:04.399000+00:00 2023-07-14T18:44:04.399000+00:00 The above exception was the direct cause of the following exception: 2023-07-14T18:44:04.399000+00:00 2023-07-14T18:44:04.399000+00:00 Traceback (most recent call last): 2023-07-14T18:44:04.399000+00:00 File "/usr/local/lib/python3.8/runpy.py", line 194, in _run_module_as_main 2023-07-14T18:44:04.399000+00:00 return _run_code(code, main_globals, None, 2023-07-14T18:44:04.399000+00:00 File "/usr/local/lib/python3.8/runpy.py", line 87, in _run_code 2023-07-14T18:44:04.399000+00:00 exec(code, run_globals) 2023-07-14T18:44:04.399000+00:00 File "/opt/app/process.py", line 201, in 2023-07-14T18:44:04.399000+00:00 Segaalgorithm().process() 2023-07-14T18:44:04.399000+00:00 File "/home/user/.local/lib/python3.8/site-packages/evalutils/evalutils.py", line 183, in process 2023-07-14T18:44:04.399000+00:00 self.process_cases() 2023-07-14T18:44:04.399000+00:00 File "/home/user/.local/lib/python3.8/site-packages/evalutils/evalutils.py", line 191, in process_cases 2023-07-14T18:44:04.399000+00:00 self._case_results.append(self.process_case(idx=idx, case=case)) 2023-07-14T18:44:04.399000+00:00 File "/opt/app/process.py", line 125, in process_case 2023-07-14T18:44:04.399000+00:00 predictions = self.predict(input_image=input_image,filepath=input_image_file_path) 2023-07-14T18:44:04.399000+00:00 File "/opt/app/process.py", line 170, in predict 2023-07-14T18:44:04.399000+00:00 for val_data in test_org_loader: 2023-07-14T18:44:04.399000+00:00 File "/home/user/.local/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 633, in next 2023-07-14T18:44:04.399000+00:00 data = self._next_data() 2023-07-14T18:44:04.399000+00:00 File "/home/user/.local/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 1328, in _next_data 2023-07-14T18:44:04.399000+00:00 idx, data = self._get_data() 2023-07-14T18:44:04.399000+00:00 File "/home/user/.local/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 1294, in _get_data 2023-07-14T18:44:04.399000+00:00 success, data = self._try_get_data() 2023-07-14T18:44:04.399000+00:00 File "/home/user/.local/lib/python3.8/site-packages/torch/utils/data/dataloader.py", line 1145, in _try_get_data 2023-07-14T18:44:04.400000+00:00 raise RuntimeError('DataLoader worker (pid(s) {}) exited unexpectedly'.format(pids_str)) from e 2023-07-14T18:44:04.400000+00:00 RuntimeError: DataLoader worker (pid(s) 40) exited unexpectedly