Changes between Version 7 and Version 8 of Public/User_Guide/Offloading_hybrid_apps


Ignore:
Timestamp:
Sep 17, 2019, 2:55:07 PM (5 years ago)
Author:
Kevin Sala
Comment:

Legend:

Unmodified
Added
Removed
Modified
  • Public/User_Guide/Offloading_hybrid_apps

    v7 v8  
    157157simulated). For this reason, the application provides `-g` option in order to
    158158control the maximum number of GPU processes. By default, the number of GPU processes
    159 will be half of the total number of processes.
     159will be half of the total number of processes. Also note that the non-CUDA variants
     160cannot compute kernels on the GPU. In these cases, the structure of the application
     161is kept but the CUDA tasks are replaced by regular CPU tasks.
    160162
    161 Also note that the non-CUDA variants cannot compute kernels on the GPU. In this
    162 cases, the structure of the application is kept but the CUDA tasks are replaced
    163 by regular CPU tasks.
     163Finally, the OpenMP variants can be executed similarly, but setting the `OMP_NUM_THREADS`
     164to the corresponding number of CPUs per process. As an example, we could execute the
     165following command:
    164166
     167{{{#!bash
     168$ OMP_NUM_THREADS=24 srun -n 4 -c 24 ./nbody.tampi.omp.2048bs.bin -t 100 -p 8912 -g 2
     169}}}
    165170
    166171== References ==