Model	User	A40 vs. A100
tensorflow-gpu AI application	vhelm	no difference
python3, matrix operations with with numpy (fat) vs cupy (gpu)	sviquera

Disk Access

NVMe: Infinniband GPFS: albedo

		albedo			ollie
Application	user	node internal	/tmp (NVMe)	100 Gb Infiniband /albedo (GPFS)	10 Gb Ethernet /isibhv (NVMe)	node internal /tmp (SSD)	100 Gb	Omnipath /	work (BeeGFS)	10 Gb Ethernet /isibhv (NVMe)
idm	vhelm	~9 sec	10~13 sec	8~11 sec spikes up to 181 sec	27~29 sec	27~37 sec	29~60 sec spikes up to 98 sec
ls # default with stat/color ls -f	directory with 30000 entires	0.08 sec 0.19 sec	0.04 sec 6~15 sec	0.03 sec 0.2 sec	0.1 sec 0.4 sec	0.2 sec 1.6 sec	0.08 sec 0.3~0.7 sec

Runtime compared with ollie

...