[[TOC]] = Latest news on the DEEP-EST prototype system = This is a summary of the latest news concerning the system. For a list of known problems related to the system, please refer to [wiki:Public/User_Guide/PaS this page]. ''Last update: 2020-09-07'' [[span(style=color: #FF0000, System will be in maintenance in CW37 (Monday, 2020-09-07 to Friday 2020-09-11))]] == System hardware == === CM nodes === === ESB nodes === - remaining two ESB racks have been installed: now 75 ESB nodes present - currently connected via IB EDR, to be changed to Extoll network later this year === DAM nodes === - all nodes still run in app-direct mode for the DCPMM (see #2366) - BIOS update is being tested and will be rolled out to all DAM nodes soon === Network Federation Gateways === - two gateway nodes for IB <-> Extoll bridging between CM/ESB and DAM are available for the users - for an example on how to use the gateway nodes and for further information, please refer to the [wiki:/Public/User_Guide/Batch_system#HeterogeneousjobswithMPIcommunicationacrossmodules batchsystem] wiki page. === Global resources === {{{#!comment ==== NAM ==== - A mock-up of the NAM management software will be developed in the next weeks to allow working in parallel on the development of the real NAM manager and its integration with the system software (`psmgmnt` and `psslurm`). }}} === File Systems === - DEEP-EST storage is being rebuilt in CW37 for performance reasons - affecting `/usr/local` and `/work` - BeeGFS servers and clients will be updated - **Due to the BeeGFS updates for the DEEP-EST storage the SDV will be de-couled meaning that after CW37 the SDV nodes will only mount `/sdv-work` (not `/work`) and the DEEP-EST nodes will only mount `/work` (not `/sdv-work`)** - BeeGFS (`/work`) will get user quotas (see section "User management") - It is possible to access the `$ARCHIVE` file system from the `deepv` login node under `/arch`. For more information about `$ARCHIVE`, please refer to the [wiki:Filesystems Filesystems page] and see also the hint in the MOTD for efficient usage of the archive filesystem - DCPMM usage within BeeGFS currently being tested on `dp-dam03` - more tests (with CentOS 8) to follow == System software == === SW updates === - new SLURM features are being integrated and will be rolled out in near future: - extended logging and improved resource management for jobs within a workflow - burst buffer pluggin - new 2020a Easybuild stage is currently being set up - new ParaStationMPI version (5.4.6-1) has been installed and is used as default now {{{#!comment - modules successfully tested in the Devel stage will be included in the production (default) stage - currently, the latest ParaStationMPI (including pscom) can be tested in the Devel stage }}} === User management === ==== BeeGFS Quotas ==== - a quota for the BeeGFS file system (mounted to /work) is being implemented