8000 feature request: show slurm job state code in batch connect session viewer · Issue #4391 · OSC/ondemand · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
feature request: show slurm job state code in batch connect session viewer #4391
Open
@simonLeary42

Description

@simonLeary42

if my batch connect job was OOM killed, it would be nice to see that information in my batch connect session viewer instead of "completed".

https://slurm.schedmd.com/sacct.html#SECTION_JOB-STATE-CODES

JOB STATE CODES
       BF  BOOT_FAIL       Job  terminated due to launch failure, typically due to a hardware failure (e.g. unable to boot the node or block and the job
                           can not be requeued).
       CA  CANCELLED       Job was explicitly cancelled by the user or system administrator.  The job may or may not have been initiated.
       CD  COMPLETED       Job has terminated all processes on all nodes with an exit code of zero.
       DL  DEADLINE        Job terminated on deadline.
       F   FAILED          Job terminated with non-zero exit code or other failure condition.
       NF  NODE_FAIL       Job terminated due to failure of one or more allocated nodes.
       OOM OUT_OF_MEMORY   Job experienced out of memory error.
       PD  PENDING         Job is awaiting resource allocation.
       PR  PREEMPTED       Job terminated due to preemption.
       R   RUNNING         Job currently has an allocation.
       RQ  REQUEUED        Job was requeued.
       RS  RESIZING        Job is about to change size.
       RV  REVOKED         Sibling was removed from cluster due to other cluster starting the job.
       S   SUSPENDED       Job has an allocation, but execution has been suspended and CPUs have been released for other jobs.
       TO  TIMEOUT         Job terminated upon reaching its time limit.

from #4275, I know that OOD already detects the "suspended" state, so I don't think this would be too difficult to implement.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      0