Changes between Version 7 and Version 8 of Public/User_Guide/Offloading_hybrid_apps
- Timestamp:
- Sep 17, 2019, 2:55:07 PM (5 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
Public/User_Guide/Offloading_hybrid_apps
v7 v8 157 157 simulated). For this reason, the application provides `-g` option in order to 158 158 control the maximum number of GPU processes. By default, the number of GPU processes 159 will be half of the total number of processes. 159 will be half of the total number of processes. Also note that the non-CUDA variants 160 cannot compute kernels on the GPU. In these cases, the structure of the application 161 is kept but the CUDA tasks are replaced by regular CPU tasks. 160 162 161 Also note that the non-CUDA variants cannot compute kernels on the GPU. In this 162 cases, the structure of the application is kept but the CUDA tasks are replaced 163 by regular CPU tasks. 163 Finally, the OpenMP variants can be executed similarly, but setting the `OMP_NUM_THREADS` 164 to the corresponding number of CPUs per process. As an example, we could execute the 165 following command: 164 166 167 {{{#!bash 168 $ OMP_NUM_THREADS=24 srun -n 4 -c 24 ./nbody.tampi.omp.2048bs.bin -t 100 -p 8912 -g 2 169 }}} 165 170 166 171 == References ==