Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents

Introduction

The HSM (Hierarchical Storage ManagementHierarchical Storage Management) at AWI provides 

  1. slightly smiling face (nearly) unlimited storage space and
  2. (smile)  two replicates on tape (in different buildings for security) and
  3. slightly smiling face a third copy on disk (for selected data, smaller files for faster access) → VTL (Virtual Tape Library, Silent Brick)
  4. slightly smiling face for a (comparable) low cost

However, there are two caveats:

  • (warning) Your (larger) Data is archived on tape, so it will take a few minutes to get it back (unless it is online or has a disk-copy).
  • (warning) You need a project to archive your data (apply for a project here: eResources).


HSM at AWI

Purpose(mainly) PANGAEA and Registry on how Open to all usersArchive of sensorawi.de(tick)IB, IPisirep-...Disaster recovery only fast
Four domains for tape archiving at AWI

Image Modified


Domain
Usage
PurposePathDisk CopyDescriptionHow to apply

A

PANGAEA

/hs/platforms

(tick)

Archive of sensor.awi.de

Permanent archive for irrecoverable data. Metadata is needed for this domain. Please contact PANGAEA
on how to submit your data.
/hs/usero
/hs/pangaea
(tick)PANGAEA
Note: Former /hs/usero has moved to /hs/pangaea/data/legacy/ on 2024-06-17
PProjects

/hs/D-P

/projects
/hs/D-P/s3projects

(tick)Project data

Long term project data with predefined life time. A project can be created with eResources https://cloud.awi.de/#/projects 
The setgid on directories ensures that new (sub-)directories belong to the project (POSIX) group automatically

IB, IPDesaster recoveryDaily (or weekly) automatic replication of online project data from the Isilon (when selected in eResources) in Bremerhaven and Potsdam, respectivelyDInternal IT stuffFor internal IT use (used for additional backups and storing of expired user/project data)
DomainPathDisk CopyDescriptionHow to apply

A

/hs/platforms

(tick)

.

Only possible with sufficient metadata, see links.
/hs/usero
/hs/pangaea
(tick)PANGAEA
P

/hs/projects
/hs/s3projects


Project data

Please use eResources to create a project and request storage resources. 

CDepricated

/hs/usera
/hs/userc

/hs/userm


Have vanished early 2020


IDesaster recovery/hs/
D-I/(minus)Project replica from the Isilons (Bhv, Pot)
Daily (or weekly) automatic replication of online project data from the Isilon (when selected in eResources) in Bremerhaven and Potsdam, respectively. Used for disaster recovery only (=hopefully never (wink) ).
DInternal IT stuff/hs/backup(tick)samfs-dumps, ScoutFS-dumps, logfiles, VeeamIT internal use only
/hs/store(minus)10-year storing of expired user and project data
/hs/s3gateway
experimental storage

A disk copyis available for specific (smaller) files for some file systems. This allows a

significantly faster access of offline files. The availability of this disk archive depends on the actual resources/usage and might change.

Concept

Principle Idea 

  • HSM:  A (H)ierarchical (S)torage (M)anagement system consists of (at least) two storage systems: A cache speeds up access, and the hierarchy reduces cost. Based on a set of rules, data is stored on certain connected storage devices (tapes and optional disks).
  • ScoutFS is used at AWI since 2024 and supersedes samfs (used since 2004)  www.versity.com/products/scoutfs/  www.scoutfs.org
  • ScoutAM is used as (A)rchive (M)anager www.versity.com/products/scoutam/ and supersedes OHSM/VSM

...

  • 2 TFinity Tape Libraries in two buildings
  • each has 3100 licensed slots for LTO tapes
  • 600 LTO-9 tapes in each library (600x ~20 TB → ~12 PB)

Image Modified

TFin-E


Image ModifiedTFin-D

Live View Frame #2 or Frame #3
User: viewer PW: Viewer123!

Live View Frame #2 or Frame #3
User: viewer PW: Viewer123!

Server, Switches and disks

  • Three Dell R760xd server (hssrv2[a-c].dmawi.de)
  • 4 Brocade (32Gb) 6610 SAN switches
  • 270 TB SSD primary cache (Dell ME5084-SSD)
  • 488 TB extended HDD-cache (Dell ME5084-HDD)
  • Ab Ende Planed for end of 2024:
    1120 TB Virtual Tape Library (VTL) Silent Brick von Fast-LTA 
    • Controller G5200
    • SAS Switch
    • 7x SilentBrick Max

...