wiki:Public/User_Guide/News

Version 26 (modified by Jochen Kreutz, 2 years ago) (diff)

general update to reflect current system status

Latest news on the DEEP-EST prototype system

This is a summary of the latest news concerning the system. For a list of known problems related to the system, please refer to this page.

Last update: 2022-01-21

Moving to Rocky Linux 8.5 on compute nodes, please expect limited system access. See also System software section

System hardware

CM nodes

  • the cluster nodes have direct EBR IB access to the SSSM storage nodes now (without using the IB ↔ 40 GbE gateway)

ESB nodes

  • all ESB nodes (dp-esb[01-75]) are using EDR Infiniband interconnect (no Extoll anymore)
  • SSSM and AFSM file servers can be directly accessed through IB

DAM nodes

  • DAM nodes are using EDR Infiniband (instead of using 40 GbE and Extoll) now
  • SSSM and AFSM file servers can be directly accessed through IB

Network Federation Gateways

  • with the all IB solution NFGWs are not needed anymore ! this also affects heterogenous jobs
  • dp-nfgw[01,04] (IB EDR ↔ 40GbE) still present, but not in use anymore
  • remaining NFGWs are being used for BXI testing now: dp-nfgw[02,03,05,06]

SDV

  • 4 node BXI test setup has been installed using the former GW nodes
  • FPGA test nodes available for using FPGAs with oneAPI, OpenCL:
    • Arria10: deeper-sdv[09,10]
    • Stratix10: dp-sdv-esb[01,02]

File Systems

please also refer to the Filesystems overview

  • the All Flash Storage Module (AFSM) provides a fast work file system mounted to /afsm (symbolic link to /work) on all compute nodes (CM, DAM, ESB) and the login node (deepv)
  • the older System Services and Storage Module (SSSM) work file system is obsolete, but still available at (/work_old) for data migration
  • SSSM still serves the /usr/local/software file system, but
    • starting from Rocky 8 image /usr/local will be a local file system
    • /usr/local/software will be shared and provided by the SSSM storage
    • in addition to the !Easybuild software stack the shared /usr/local/software filesystem will contain manually installed software in a legacy subfolder

System software

  • ParaStation? update (psmgmt) to 5.1.45-3 has been performed

OS

  • the decision was taken to use one Rocky linux image for all compute nodes
  • transition from CentOS to Rocky Linux 8.5 is ongoing (starting on the CM and then continue with ESB and DAM)
  • login node deepvhas moved to Rocky linux 8 as well

EasyBuild

  • 2022 EasyBuild stage is being set up
    • expect to have a first version (basic components like compilers etc.) available in Feburary 2022