[[TOC]] = Latest news on the DEEP-EST prototype system = This is a summary of the latest news concerning the system. For a list of known problems related to the system, please refer to [wiki:Public/User_Guide/PaS this page]. == System hardware == === CM nodes === === ESB nodes === - currently only 1 rack installed in the system (25 nodes). === DAM nodes === - all nodes still run in app-direct mode for the DCPMM (see #2366) === Network Federation Gateways === - two gateway nodes for IB <-> Extoll bridging between CM/ESB and DAM are available for the users - for an example on how to use the gateway nodes and for further information, please refer to the [wiki:/Public/User_Guide/Batch_system#HeterogeneousjobswithMPIcommunicationacrossmodules batchsystem] wiki page. === Global resources === {{{#!comment ==== NAM ==== - A mock-up of the NAM management software will be developed in the next weeks to allow working in parallel on the development of the real NAM manager and its integration with the system software (`psmgmnt` and `psslurm`). }}} == System software == === SW updates === - upcoming SLURM update to version 19.5.5 - will require a maintenance for the full system - no date fixed yet - Easybuild has been updated to version 4.1.2 - modules successfully tested in the Devel stage will be included in the production (default) stage - !ParaStation Health Checker now includes a check for GPU presence on the DAM and ESB nodes === User management === ==== QoS ==== - the quality of service groups (QoS) within Slurm (determining the jobs priority) are currently being re-organized with regards to the accounting introduced with the EAP - please, report any issues in starting jobs or in specifying an account for your jobs using the `-A` option ==== BeeGFS Quotas ==== - it's currently under consideration to implement a quota for the BeeGFS file system (mounted to /work)