Context Navigation

Changes between Version 6 and Version 7 of Public/Energy

Timestamp:: Sep 28, 2020, 10:57:36 AM (5 years ago)
Author:: Alessio Netti
Comment:: —

Legend:

: Unmodified
: Added
: Removed
: Modified

Public/Energy

-                      v6
+                      v7
 == Energy Measurement
+For the CM and ESB modules there is a fine grained energy measurement in place using Megwares energy meters attached to the compute nodes of these modules.
+There are different ways to get information about energy consumption for your jobs (and the nodes). Preferred methods are:
+For the CM and ESB modules there is fine-grained energy measurement in place using Megware's energy meters, attached to the respective compute nodes. In addition, the DCDB monitoring framework is deployed on the entire DEEP-EST prototype, providing a variety of sensor measurements in addition to energy consumption. There are different ways to get information about energy consumption for your jobs (and the nodes). Preferred methods are:
 === SLURM sacctl command
 This is probably the easiest way to get energy consumption for your interactive and batch jobs.  Once your job has finshed, you can use the
 `sacct` command enquire the Slurm database about its energy consumption. For further information and an example
+This is probably the easiest way to get energy consumption for your interactive and batch jobs. Once your job has finished, you can use the
+`sacct` command to enquire the Slurm database about its energy consumption. For further information and an example
 on how to use the command for energy measurements, please see [wiki:/Public/User_Guide/Batch_system#Informationonpastjobsandaccounting accounting].
+=== DCDB (coming soon)
+The "Datacenter Database" very frequently (every 10 seconds) stores measured values from node and infrastructure sensors including power and
+energy consumption of the compute nodes. This allows for a very fine grained analysis of consumed energy for your jobs, e.g. by specifying
+precise time stamps / ranges and by providing access to measured data from different components like CPU, GPU, memory etc. instead of making
+available an accumulated value only. On the other hand it offers a convenient way for analysis of SLURM jobs.
+The DCDB can be queried from the login node using the DCDB client tools. A user guide that
+gives some more details on the database and explains how to use it for energy measurements will be attached to this page in the next days.
+=== Use /sys files
+The energy meters provide their measured values through the "/sys" filesystem on the nodes using different files.
+=== Using DCDB
+The "Data Center Data Base" (DCDB) stores measured values from node and infrastructure sensors at a high frequency (every 10 seconds), including power and
+energy consumption of the compute nodes. This allows for a very fine-grained analysis of consumed energy for your jobs, e.g. by specifying
+precise time-stamp ranges and by leveraging access to measured data from different components like CPU, GPU or memory, instead of using only single accumulated values. Hence, it offers a convenient way for analysis of SLURM jobs.
+DCDB is currently deployed on the DEEP-EST prototype and it is continuously collecting sensor data. A total of 62595 sensors are available, each with a time-to-live of 30 days. Please refer to D5.4 for the full list of monitored sensors and further details about the DCDB deployment. The DCDB Gitlab [https://dcdb.it repository] is also a good source of documentation and usage examples. Two types of data are available:
+* **Sensor Data**: ordinary sensors sampled from the compute nodes or the surrounding infrastructure (e.g., power, temperature, CPU instructions).
+* **Job Data**: a selected number of sensors are aggregated on a per-job basis, i.e., from all the compute nodes on which a certain job was running.
+DCDB is installed on the DEEP-EST {{{deepv}}} login node, from which data can be queried. To prepare the shell environment and expose the DCDB tools, please run the following:
+{{{
+source /opt/dcdb/dcdb.bash
+}}}
+At this point DCDB data can be queried using the {{{dcdbquery}}} and {{{dcdbconfig}}} tools. A distributed Cassandra instance is running on the dp-mngt01, dp-mngt02 and dp-mngt03 machines: only these three should be queried by the DCDB tools. By default, there is no need to be specify the target host, as the DCDB tools use a pre-defined value; in case you wish to explicitly select a host to be queried, you can do so with the {{{-h}}} option.
+==== The dcdbconfig Tool
+The {{{dcdbconfig}}} tool can be used to perform a variety of metadata-related actions. Running the following command will list all available actions:
+{{{
+dcdbconfig help
+}}}
+We will present the main dcdbconfig actions that can be of use to users. To print the complete list of sensors available to be queried on the prototype, run the following command:
+{{{
+dcdbconfig sensor listpublic
+}}}
+To show the metadata for a specific sensor name, run the following command:
+{{{
+Template: dcdbconfig sensor show <SENSORNAME>
+Example:  dcdbconfig sensor show /deepest/esb/s25/MemUsed
+}}}
+Sensor names are slash-separated strings structured like MQTT topics, where each segment indicates a level of the system (e.g., {{{deepest}}} system, {{{esb}}} module, {{{s25}}} compute node). To show the list of jobs stored in the DCDB database, that are currently running on the prototype, run the following command:
+{{{
+dcdbconfig job running
+}}}
+To show the details of a specific job (e.g., node list, start time), run the following command:
+{{{
+Template: dcdbconfig job show <JOBID>
+Example:  dcdbconfig job show 17435
+}}}
+A series of other actions to manage the sensor database are available, however users are
+advised not to use them so as not to alter the system’s behavior.
+==== The dcdbquery Tool
+The {{{dcdbquery}}} tool allows to perform sensor queries, returning results in CSV format. It requires a sensor name and a time range. To perform a basic query, run the following:
+{{{
+Template: dcdbquery <SENSORNAME> <TSTART> <TEND>
+Example:  dcdbquery /deepest/cn/s01/power now-5m now
+}}}
+Instead of a single sensor, a list of space-separated sensor names can be supplied as well; these will be queried sequentially. Instead of using relative time-stamps, absolute time-stamps (in nanoseconds) can also be used:
+{{{
+Example: dcdbquery /deepest/cn/s01/power 159654356469315800 1596543788812750000
+}}}
+==== Using dcdbquery for Job Queries
+DCDB collects a series of aggregated metrics for each job running on the system, every 10s. These can be queried using the same {{{dcdbquery}}} syntax as above. The sensor name, however, should be constructed as follows:
+{{{
+Template: /job<JOBID>/<SENSORNAME><STATNAME>
+Example:  /job788354/pkg-energy.avg
+}}}
+The {{{<JOBID>}}} field identifies the job to be queried. {{{<SENSORNAME>}}} identifies instead the aggregated metric and can be one of the following:
+* energy
+* dram-energy
+* pkg-energy
+* MemUsed
+* instructions
+* cpu-cycles
+* cache-misses
+* cache-misses-l2
+* cache-misses-l3
+* scalar-double
+* scalar-double-128
+* scalar-double-256
+* scalar-double-512
+* gpu-energy (DAM, ESB)
+* ib-portXmitData (CN, ESB)
+* ib-portRcvData (CN, ESB)
+Finally, the {{{<STATNAME>}}} field identifies the actual type of aggregation performed from the readings of the queried sensor, by combining the data of all compute nodes on which the job was running. It can be one of the following:
+* .avg (average)
+* .med (median)
+* .sum (sum)
+In addition, for jobs that are running on multiple modules of the prototype, per-module aggregated metrics are also available. In this case, the sensor name is built as follows:
+{{{
+Prototype: /job<JOBID>/<MODULE>/<SENSORNAME><STATNAME>
+Example:   /job1001/esb/instructions.med
+}}}
+The {{{<MODULE>}}} field identifies the prototype module to query, and can be one of the following:
+* cn (Cluster Module)
+* esb (Extreme-Scale Booster)
+* dam (Data Analytics Module)
+==== Long-term Sub-sampled Sensor Data
+The sensor data collected by DCDB (including per-job data) has a time-to-live of 30 days. After this interval the data will be automatically erased from the database. However, to keep track of the system’s history, DCDB automatically computes sub-sampled versions of all sensors. These are kept forever in the database, and can be queried as follows:
+{{{
+Template: dcdbquery “<OPNAME>(<SENSORNAME>)” <TSTART> <TEND>
+Example:  dcdbquery “.avg300(/deepest/cn/s01/power)” now-1h now
+}}}
+In this case, quoting is required to ensure proper parsing of the command in the shell. The {{{<OPNAME>}}} field defines the particular type of sub-sampling that is to be queried for the given sensor. The following alternatives are available:
+* .avg300 (5-minute averages)
+* .avg3600 (hourly averages)
+DCDB computes sub-sampled versions of all sensors, with the exception of CPU core-level sensors (e.g., performance counters for specific CPU cores) and per-job sensors, to avoid inflating the size of the databases indefinitely. In general, the operations available for a DCDB sensor are shown along with its metadata when using the {{{dcdbconfig sensor show}}} command.
+==== Grafana Visualization
+A Grafana instance is also running and is reachable at {{{grafanavm:3000}}}. The {{{grafanavm}}} host is visible only from the prototype’s network, so ssh tunneling is required to reach it. In order to use Grafana, an account must be created: if you wish to have access, please contact Alessio Netti (alessio.netti@lrz.de) and one will be created for you. Normally, the Grafana instance should be already set up and you will not need to configure the DCDB data source. If that is not the case, please follow the instructions below.
+Log into the Grafana web frontend and navigate into the {{{Configuration->Data Sources}}} menu. Here, click on {{{Add Data Source}}} and select DCDB. Please use the following values for the required fields in the configuration window of the DCDB data source:
+{{{
+URL: https://dp-mngt01:8081
+Access: Server (Default)
+User and Password: admin
+Check "Basic Auth" and "Skip TLS Verify" under the "Auth" table
+}}}
+At this point click on {{{Save & Test}}} once and, if the procedure was successful, you will be ready to add dashboards and panels using DCDB data.
+=== Using /sys files
+The energy meters provide their measured values through the "/sys" filesystem on the compute nodes using different files.
 To query the overall energy (in Joules) a node has consumed so far, you can use the `energy_j` file.
 You should integrate readings in your SLURM job script before and after you `srun` your commands to measure
 consumed energy by your commands (applications):
+consumed energy by your commands (applications):
 Unit=[Joules]
 …
 {{{
 srun sh -c 'if [ $SLURM_LOCALID == 0 ]; then echo ${SLURM_NODEID}: $(cat /sys/devices/platform/sem/energy_j); fi'
+srun sh -c 'if [ $SLURM_LOCALID == 0 ]; then echo ${SLURM_NODEID}: $(cat /sys/devices/platform/sem.5/energy_j); fi'
 }}}