Account
To help attributing the usage of computing resources to the groups and projects of AWI, which is needed for reporting, on Albedo it is necessary to specify an account.
This is done by setting
-A, --account=<account>
Possible slurm accounts are listed after login. To enforce setting an account, no (valid) default account is set.
Users are, however, able to change this setting on their own:
sacctmgr modify user <user> set DefaultAccount=<account>
Partitions
Albedo’s compute nodes are divided into the following partitions, which are shown in the table below.
The smp
partiton is the default and is for jobs with cores.
Nodes in the mpp
partition are exclusively reserved. This partition is used when one ore more nodes are needed.
The fat nodes can be selected via the fat
partition. This partition resembles the smp
partition but each node has much more memory.
Similarly, the GPU nodes can be accessed via the gpu
partition. Note, that the type and number of GPUs need to be specified. More infos about the hardware specification of each node can be found in the System Overview.
Partition | Nodes | Description |
---|---|---|
smp | prod-[001-240] |
|
mpp | prod-[001-240] |
|
fat | fat-00[1-4] |
|
gpu | gpu-00[1-2] |
|
Quality of service (QOS)
By default, the QOS 30min
is used. It has a max. walltime of 30 minutes and jobs with this QOS get a higher priority and have access to a special SLURM reservation during working time (TODO: add details when set up), to facilitate development and testing. For longer runs, another QOS (and walltime) has to be specified. See table below. Note: long running jobs (longer than 12 hours, up to 48 hours) “cost” more in terms of fairshare.
QOS | max. walltime | UsageFactor | Priority QOS_factor |
---|---|---|---|
short | 0:30:00 | 1 | 50 |
12h | 12:00:00 | 1 | 0 |
48h | 48:00:00 | 2 | 0 |
A short note on the definitions:
UsageFactor: A float that is factored into a job’s TRES usage (e.g. RawUsage, …)
and RawUsage= cpu-seconds (#CPUs * seconds). → Jobs using the 48h
QOS are twice as expensive when calculating job priorities (see Scheduling).