Changes between Version 49 and Version 50 of Public/User_Guide/OmpSs-2


Ignore:
Timestamp:
Jun 17, 2019, 3:12:05 PM (5 years ago)
Author:
Pedro Martinez-Ferror
Comment:

Legend:

Unmodified
Added
Removed
Modified
  • Public/User_Guide/OmpSs-2

    v49 v50  
    88* [#UsingtheRepositories Using the Repositories]
    99* Examples:
    10   * [#AStep-By-StepDetailedGuidetoExecutetheMultisaxpyBenchmarkOmpSs-2 A Step-By-Step Detailed Guide to Execute the Multisaxpy Benchmark (OmpSs-2)]
    11   * [#Dot-productBenchmarkOmpSs-2 Dot-product Benchmark (OmpSs-2)]
    12   * [#MergesortBenchmarkOmpSs-2 Mergesort Benchmark (OmpSs-2)]
    13   * [#NqueensBenchmarkOmpSs-2 Nqueens Benchmark (OmpSs-2)]
    14   * [#MatmulBenchmarkOmpSs-2 Matmul Benchmark (OmpSs-2)]
     10  * [#AStep-By-StepDetailedGuidetoExecutetheMultisaxpyBenchmark A Step-By-Step Detailed Guide to Execute the Multisaxpy Benchmark]
     11  * [#Dot-productBenchmark Dot-product Benchmark]
     12  * [#MergesortBenchmark Mergesort Benchmark]
     13  * [#NqueensBenchmark Nqueens Benchmark]
     14  * [#MatmulBenchmark Matmul Benchmark]
    1515  * [#CholeskyBenchmarkOmpSs-2MKL Cholesky Nenchmark (OmpSs-2+MKL)]
    1616  * [#NbodyBenchmarkMPI+OmpSs-2TAMPI Nbody Nenchmark (MPI+OmpSs-2+TAMPI)]
     
    156156----
    157157
    158 = A Step-By-Step Detailed Guide to Execute the Multisaxpy Benchmark (!OmpSs-2) =
     158= A Step-By-Step Detailed Guide to Execute the Multisaxpy Benchmark =
    159159
    160160Users must clone/download this example's repository from [https://pm.bsc.es/gitlab/ompss-2/examples/multisaxpy] and transfer it to a DEEP working directory.
     
    560560
    561561
    562 = Dot-product Benchmark (!OmpSs-2) =
     562= Dot-product Benchmark =
    563563
    564564Users must clone/download this example's repository from [https://pm.bsc.es/gitlab/ompss-2/examples/dot-product] and transfer it to a DEEP working directory.
     
    586586
    587587
    588 = Mergesort Benchmark (!OmpSs-2) =
     588= Mergesort Benchmark =
    589589
    590590Users must clone/download this example's repository from [https://pm.bsc.es/gitlab/ompss-2/examples/mergesort] and transfer it to a DEEP working directory.
     
    613613
    614614
    615 = Nqueens Benchmark (!OmpSs-2) =
     615= Nqueens Benchmark =
    616616
    617617Users must clone/download this example's repository from [https://pm.bsc.es/gitlab/ompss-2/examples/nqueens] and transfer it to a DEEP working directory.
     
    644644
    645645
    646 = Matmul Benchmark (!OmpSs-2) =
     646= Matmul Benchmark =
    647647
    648648Users must clone/download this example's repository from [https://pm.bsc.es/gitlab/ompss-2/examples/matmul] and transfer it to a DEEP working directory.
     
    708708* [https://pm.bsc.es/ftp/ompss-2/doc/examples/02-examples/cholesky-mkl/README.html]
    709709* [https://en.wikipedia.org/wiki/Eight_queens_puzzle]
    710 
    711 
    712 ----
    713 
    714 
    715 = Nbody Benchmark (MPI+!OmpSs-2+TAMPI) =
    716 
    717 Users must clone/download this example's repository from [https://pm.bsc.es/gitlab/ompss-2/examples/nbody] and transfer it to a DEEP working directory.
    718 
    719 == Description ==
    720 
    721 This benchmark represents an N-body simulation to numerically approximate the evolution of a system of bodies in which each body continuously interacts with every other body.  A familiar example is an astrophysical simulation in which each body represents a galaxy or an individual star, and the bodies attract each other through the gravitational force.
    722 
    723 There are **7 implementations** of this benchmark which are compiled in different
    724 binaries by executing the command `make`. These versions can be blocking,
    725 when the particle space is divided into smaller blocks, or non-blocking, when
    726 it is not.
    727 
    728 The interoperability versions (MPI+!OmpSs-2+TAMPI) are compiled only if the environment variable `TAMPI_HOME` is set to the Task-Aware MPI (TAMPI) library's installation directory.
    729 
    730 == Execution Instructions ==
    731 
    732 The binaries accept several options. The most relevant options are the number
    733 of total particles (`-p`) and the number of timesteps (`-t`). More options
    734 can be seen with the `-h` option. An example of execution could be:
    735 
    736 `mpiexec -n 4 -bind-to hwthread:16 ./nbody -t 100 -p 8192`
    737 
    738 in which the application will perform 100 timesteps in 4 MPI processes with 16 hardware threads in each process (used by the !OmpSs-2 runtime). The total number of particles will be 8192 so that each process will have 2048 particles (2 blocks per process).
    739 
    740 == References ==
    741 
    742 * [https://pm.bsc.es/gitlab/ompss-2/examples/nbody]
    743 * [https://en.wikipedia.org/wiki/N-body_simulation]
    744 
    745 
    746 ----
    747 
    748 
    749 = Heat Benchmark (MPI+!OmpSs-2+TAMPI) =
    750 
    751 Users must clone/download this example's repository from [https://pm.bsc.es/gitlab/ompss-2/examples/heat] and transfer it to a DEEP working directory.
    752 
    753 == Description ==
    754 
    755 This benchmark uses an iterative Gauss-Seidel method to solve the heat equation,
    756 which is a parabolic partial differential equation that describes the distribution of heat (or variation in temperature) in a given region over time. The heat equation is of fundamental importance in a wide range of science fields. In
    757 mathematics, it is the parabolic partial differential equation par excellence. In statistics, it is related to the study of the Brownian motion. Also, the diffusion equation is a generic version of the heat equation, and it is related to the study of chemical diffusion processes.
    758 
    759 There are **9 implementations** of this benchmark which are compiled in different
    760 binaries by executing the command `make`.
    761 
    762 The interoperability versions (MPI+!OmpSs-2+TAMPI) are compiled only if the environment variable `TAMPI_HOME` is set to the Task-Aware MPI (TAMPI) library's installation directory.
    763 
    764 == Execution Instructions ==
    765 
    766 The binaries accept several options. The most relevant options are the size
    767 of the matrix in each dimension (`-s`) and the number of timesteps (`-t`). More options can be seen with the `-h` option. An example of execution
    768 could be:
    769 
    770 `mpiexec -n 4 -bind-to hwthread:16 ./heat -t 150 -s 8192`
    771 
    772 in which the application will perform 150 timesteps in 4 MPI processes with 16
    773 hardware threads in each process (used by the !OmpSs-2 runtime). The size of the
    774 matrix in each dimension will be 8192 (8192^2^ elements in total), this means
    775 that each process will have 2048x8192 elements (16 blocks per process).
    776 
    777 == References ==
    778 
    779 * [https://pm.bsc.es/gitlab/ompss-2/examples/heat]
    780 * [https://pm.bsc.es/ftp/ompss-2/doc/examples/local/sphinx/04-mpi+ompss-2.html]
    781 * [https://en.wikipedia.org/wiki/Heat_equation]
    782 
    783 ----
    784 
    785 = Krist Benchmark (!OmpSs-2+CUDA) =
    786 
    787 Users must clone/download this example's repository from [https://pm.bsc.es/gitlab/ompss-2/examples/krist] and transfer it to a DEEP working directory.
    788 
    789 == Description ==
    790 
    791 This benchmark represents the krist kernel, which is used in crystallography to find the exact shape of a molecule using Rntgen diffraction on single crystals or powders.
    792 
    793 There are **2 implementations** of this benchmark, ''krist'' and ''krist-unified'' using regular and unified CUDA memory, repectively.
    794 
    795 == Execution Instructions ==
    796 
    797 `./krist N_A N_R`
    798 
    799 where:
    800 * `N_A` is the number of atoms (1000 by default).
    801 * `N_R` is the umber of reflections (10000 by default).
    802 
    803 == References ==
    804 
    805 * [https://pm.bsc.es/gitlab/ompss-2/examples/krist]