wiki:Public/User_Guide/News

Version 12 (modified by Jochen Kreutz, 4 years ago) (diff)

Latest news on the DEEP-EST prototype system

This is a summary of the latest news concerning the system. For a list of known problems related to the system, please refer to this page.

Last update: 2020-09-07

System will be in maintenance in CW37 (Monday, 2020-09-07 to Friday 2020-09-11)

System hardware

CM nodes

ESB nodes

  • remaining two ESB racks have been installed: now 75 ESB nodes present
  • currently connected via IB EDR, to be changed to Extoll network later this year

DAM nodes

  • all nodes still run in app-direct mode for the DCPMM (see #2366)
  • BIOS update is being tested and will be rolled out to all DAM nodes soon

Network Federation Gateways

  • two gateway nodes for IB ↔ Extoll bridging between CM/ESB and DAM are available for the users
  • for an example on how to use the gateway nodes and for further information, please refer to the batchsystem wiki page.

Global resources

File Systems

  • DEEP-EST storage is being rebuilt in CW37 for performance reasons
    • affecting /usr/local and /work
    • BeeGFS servers and clients will be updated
  • Due to the BeeGFS updates for the DEEP-EST storage the SDV will be de-couled meaning that after CW37 the SDV nodes will only mount /sdv-work (not /work) and the DEEP-EST nodes will only mount /work (not /sdv-work)
  • BeeGFS (/work) will get user quotas (see section "User management")
  • It is possible to access the $ARCHIVE file system from the deepv login node under /arch. For more information about $ARCHIVE, please refer to the Filesystems page and see also the hint in the MOTD for efficient usage of the archive filesystem
  • DCPMM usage within BeeGFS currently being tested on dp-dam03
    • more tests (with CentOS 8) to follow

System software

SW updates

  • new SLURM features are being integrated and will be rolled out in near future:
    • extended logging and improved resource management for jobs within a workflow
    • burst buffer pluggin
  • new 2020a Easybuild stage is currently being set up
  • new ParaStationMPI version (5.4.6-1) has been installed and is used as default now

User management

BeeGFS Quotas

  • a quota for the BeeGFS file system (mounted to /work) is being implemented