Qblg002 is drained frequently

From HPC users
Revision as of 09:29, 6 January 2022 by Harfst (talk | contribs) (Created page with "Der Knoten qblg002 wird häufiger (alle paar Monate) in den Status drained gesetzt. Tickets: 20220105-0193 Status des Knotens $ scontrol show node qblg002 ... Reason=Ki...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

Der Knoten qblg002 wird häufiger (alle paar Monate) in den Status drained gesetzt.

Tickets: 20220105-0193

Status des Knotens

$ scontrol show node qblg002
...
  Reason=Kill task failed [root@2022-01-05T16:45:45]

Beheben mit

$ scontrol update node=qblg002 state=undrain