Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • sinfo shows existing queues
  • scontrol show job <JobID> shows information about specific job
  • sstat <JobID> shows resources used by a specific job
  • squeue shows information about queues and used nodes
  • smap curses-graphic of queues and nodes
  • sbatch <script> submits a batch job
  • salloc <resources> requests access to compute nodes for interactive use
  • scancel <JobID> cancels a batch job
  • srun <ressources> <executable> starts a (parallel) code
  • sshare and sprio give information on fair share value and job priority

Do's & Don'ts

  • Do not use srun for simple non-parallel jobs like cplnrm, g[un]zip
  • Make use of parallel srun p[gu]igz instead of g[un]zip if you have allocated more than one CPU already
  • Do not allocate costly resources (like fat/gpu nodes) if you not need them. Check the CPU/Memory-Efficiency of your jobs with info.sh -S

Example Scripts

Job arrays

...