Context Navigation

OmpSs-2

Timestamp:: Jun 14, 2019, 3:47:45 PM (6 years ago)
Author:: Pedro Martinez-Ferror
Comment:: —

Legend:

: Unmodified
: Added
: Removed
: Modified

Public/User_Guide/OmpSs-2

-                      v36
+                      v37
 * [#QuickSetuponDEEPSystemforaPureOmpSs-2Application Quick Setup on DEEP System for a Pure OmpSs-2 Application]
 * [#UsingtheRepositories Using the Repositories]
+* [#multisaxpybenchmarkOmpSs-2 multisaxpy benchmark (OmpSs-2)]
+* [#dot-productbenchmarkOmpSs-2 dot-product benchmark (OmpSs-2)]
+* [#mergesortbenchmarkOmpSs-2 mergesort benchmark (OmpSs-2)]
+* [#nqueensbenchmarkOmpSs-2 nqueens benchmark (OmpSs-2)]
+* [#matmulbenchmarkOmpSs-2 matmul benchmark (OmpSs-2)]
+* [#CholeskybenchmarkOmpSs-2MKL Cholesky benchmark (OmpSs-2+MKL)]
+* [#nbodybenchmarkMPI+OmpSs-2TAMPI nbody benchmark (MPI+OmpSs-2+TAMPI)]
+* [#heatbenchmarkMPI+OmpSs-2TAMPI heat benchmark (MPI+OmpSs-2+TAMPI)]
+* Examples:
+  * [#AStep-By-StepDetailedGuidetoExecutetheMultisaxpyBenchmark A Step-By-Step Detailed Guide to Execute the Multisaxpy Benchmark (OmpSs-2)]
+  * [#Dot-productBenchmarkOmpSs-2 Dot-product Benchmark (OmpSs-2)]
+  * [#MergesortBenchmarkOmpSs-2 Mergesort Benchmark (OmpSs-2)]
+  * [#NqueensBenchmarkOmpSs-2 Nqueens Benchmark (OmpSs-2)]
+  * [#MatmulBenchmarkOmpSs-2 Matmul Benchmark (OmpSs-2)]
+  * [#CholeskyBenchmarkOmpSs-2MKL Cholesky Nenchmark (OmpSs-2+MKL)]
+  * [#NbodyBenchmarkMPI+OmpSs-2TAMPI Nbody Nenchmark (MPI+OmpSs-2+TAMPI)]
+  * [#HeatBenchmarkMPI+OmpSs-2TAMPI Heat Benchmark (MPI+OmpSs-2+TAMPI)]
 ----
 …
 All the examples shown here are publicly available at [https://pm.bsc.es/gitlab/ompss-2/examples].  Users must clone/download each example's repository and then transfer it to a DEEP working directory.
 == System configuration ==
+== System Configuration ==
 Please refer to section [#QuickSetuponDEEPSystem Quick Setup on DEEP System] to get a functional version of !OmpSs-2 on DEEP. It is also recommended to run !OmpSs-2 via an interactive session on a cluster module (CM) node.
 == Building and running the examples ==
+== Building and Running the Examples ==
 All the examples come with a Makefile already configured to build (e.g. `make`) and run (e.g. `make run`) them.  You can clean the directory with the command `make clean`.
 == Controlling available threads ==
+== Controlling the Available Threads ==
 In order to limit or constraint the available threads for an application, the Unix **taskset** tool can be used to launch applications with a given thread affinity.  In order to use taskset, simply precede the application's binary with taskset followed by a list of CPU IDs specifying the desired affinity:
 …
 The example above will run **application** with 4 cores: 0, 2, 3, 4.
 == Dependency graphs ==
+== Creating Dependency Graphs ==
 Nanos6 allows for a graphical representation of data dependencies to be extracted. In order to generate said graph, run the application with the **NANOS6** environment variable set to **graph**:
 …
 The result will be a PDF file with several pages, each representing the graph at a certain point in time. For best results, we suggest to display the PDF with **single page** view, showing a full page and to advance page by page.
 == Obtaining statistics ==
+== Obtaining Statistics ==
 Another equally interesting feature of Nanos6 is obtaining statistics. To do so, simply run the application as:
 …
 ----
 = multisaxpy benchmark (!OmpSs-2) =
+= Multisaxpy Benchmark (!OmpSs-2) =
 Users must clone/download this example's repository from [https://pm.bsc.es/gitlab/ompss-2/examples/multisaxpy] and transfer it to a DEEP working directory.
 …
 There are **7 implementations** of this benchmark.
 == Execution instructions ==
+== Execution Instructions ==
 `./multisaxpy SIZE BLOCK_SIZE INTERATIONS`
 …
 * `ITERATIONS` is the number of times the SAXPY operation is executed.
 == Example output ==
+== Example Output ==
 {{{
 …
 = dot-product benchmark (!OmpSs-2) =
+= Dot-product Benchmark (!OmpSs-2) =
 Users must clone/download this example's repository from [https://pm.bsc.es/gitlab/ompss-2/examples/dot-product] and transfer it to a DEEP working directory.
 …
 There are **3 implementations** of this benchmark.
 == Execution instructions ==
+== Execution Instructions ==
 `./dot_product SIZE CHUNK_SIZE`
 …
 = mergesort benchmark (!OmpSs-2) =
+= Mergesort Benchmark (!OmpSs-2) =
 Users must clone/download this example's repository from [https://pm.bsc.es/gitlab/ompss-2/examples/mergesort] and transfer it to a DEEP working directory.
 …
 There are **6 implementations** of this benchmark.
 == Execution instructions ==
+== Execution Instructions ==
 `./mergesort N BLOCK_SIZE`
 …
 = nqueens benchmark (!OmpSs-2) =
+= Nqueens Benchmark (!OmpSs-2) =
 Users must clone/download this example's repository from [https://pm.bsc.es/gitlab/ompss-2/examples/nqueens] and transfer it to a DEEP working directory.
 …
 There are **7 implementations** of this benchmark.
 == Execution instructions ==
+== Execution Instructions ==
 `./n-queens N [threshold]`
 …
 = matmul benchmark (!OmpSs-2) =
+= Matmul Benchmark (!OmpSs-2) =
 Users must clone/download this example's repository from [https://pm.bsc.es/gitlab/ompss-2/examples/matmul] and transfer it to a DEEP working directory.
 …
 There are **3 implementations** of this benchmark.
 == Execution instructions ==
+== Execution Instructions ==
 `./matmul N M P BLOCK_SIZE`
 …
 = Cholesky benchmark (!OmpSs-2+MKL) =
+= Cholesky Benchmark (!OmpSs-2+MKL) =
 Users must clone/download this example's repository from [https://pm.bsc.es/gitlab/ompss-2/examples/cholesky] and transfer it to a DEEP working directory.
 …
 For the graph instrumentation, it is recommended to view the resulting PDF in single page mode and to advance through the pages. This will show the actual instantiation and execution of the code. For the extrae instrumentation, extrae must be loaded and available at least through the `LD_LIBRARY_PATH` environment variable.
 == Execution instructions ==
+== Execution Instructions ==
 `./cholesky SIZE BLOCK_SIZE`
 …
 = nbody benchmark (MPI+!OmpSs-2+TAMPI) =
+= Nbody Benchmark (MPI+!OmpSs-2+TAMPI) =
 Users must clone/download this example's repository from [https://pm.bsc.es/gitlab/ompss-2/examples/nbody] and transfer it to a DEEP working directory.
 …
 The interoperability versions (MPI+!OmpSs-2+TAMPI) are compiled only if the environment variable `TAMPI_HOME` is set to the Task-Aware MPI (TAMPI) library's installation directory.
 == Execution instructions ==
+== Execution Instructions ==
 The binaries accept several options. The most relevant options are the number
 …
 = heat benchmark (MPI+!OmpSs-2+TAMPI) =
+= Heat Benchmark (MPI+!OmpSs-2+TAMPI) =
 Users must clone/download this example's repository from [https://pm.bsc.es/gitlab/ompss-2/examples/heat] and transfer it to a DEEP working directory.
 …
 The interoperability versions (MPI+!OmpSs-2+TAMPI) are compiled only if the environment variable `TAMPI_HOME` is set to the Task-Aware MPI (TAMPI) library's installation directory.
 == Execution instructions ==
+== Execution Instructions ==
 The binaries accept several options. The most relevant options are the size
 …
 ----
 = krist benchmark (!OmpSs-2+CUDA) =
+= Krist Benchmark (!OmpSs-2+CUDA) =
 Users must clone/download this example's repository from [https://pm.bsc.es/gitlab/ompss-2/examples/krist] and transfer it to a DEEP working directory.
 …
 There are **2 implementations** of this benchmark, ''krist'' and ''krist-unified'' using regular and unified CUDA memory, repectively.
 == Execution instructions ==
+== Execution Instructions ==
 `./krist N_A N_R`