Unable to access large gpu even when they are free and people are using them ¶
By: agaldran on March 6, 2024, 9:15 p.m.
Hello!
So this has been happening to me all day: I launch a jobman process requesting large-gpu, and it remains waiting forever. However, when I check the queue, there are like two or three users of that resource, so I should be able to access it, right? In the attached screenshot shows my current situation. The most annoying thing is that all day I have been seeing that resource being used by a variable number of users, which means that the problem is on my end?
I have been emailing the organizers all day, asking what could be happenning, but no answer. I thought that maybe some caritative soul in the forum has encountered the same situation and knows how to resolve it? It's very frustrating that, so close to the deadline, I am only able to train toy models...