Latest news on the DEEP-EST prototype system
This is a summary of the latest news concerning the system. For a list of known problems related to the system, please refer to this page.
System hardware
CM nodes
ESB nodes
- currently only 1 rack installed in the system (25 nodes).
DAM nodes
- all nodes still run in app-direct mode for the DCPMM (see #2366)
Network Federation Gateways
- two gateway nodes for IB ↔ Extoll bridging between CM/ESB and DAM are available for the users
- for an example on how to use the gateway nodes and for further information,
please refer to the batchsystem wiki page.
Global resources
System software
SW updates
- upcoming SLURM update to version 19.5.5
- will require a maintenance for the full system
- no date fixed yet
- Easybuild has been updated to version 4.1.2
- modules successfully tested in the Devel stage will be included in the production (default) stage
- ParaStation Health Checker now includes a check for GPU presence on the DAM and ESB nodes
User management
QoS
- the quality of service groups (QoS) within Slurm (determining the jobs priority) are currently being re-organized with regards to the accounting introduced with the EAP
- please, report any issues in starting jobs or in specifying an account for your jobs using the
-A
option
BeeGFS Quotas
- it's currently under consideration to implement a quota for the BeeGFS file system (mounted to /work)