Hi there,
This morning, I found several nodes of our cluster unreachable by the queueing system, i.e. their status became "au" or "E" (pls see the attached pic.). Indeed, I can access them by ssh, e.g., "ssh compute-0-10", but it seems that the queueing system cannot find them.
Any idea about why this happens and any possible solution?
Thanks a lot.