Changes between Version 26 and Version 27 of Public/User_Guide/Batch_system
- Timestamp:
- Mar 11, 2020, 3:20:56 PM (4 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
Public/User_Guide/Batch_system
v26 v27 23 23 || dp-dam || dp-dam[01-16], DEEP-EST Dam nodes (Xeon Cascadelake + 1 V100 + 1 Stratix 10) || 24 24 || dp-dam-ext || dp-dam[09-16], DEEP-EST Dam nodes connected with Extoll Tourmalet || 25 || dp-esb || dp-esb[01-25], DEEP-EST Esb nodes (Xeon Cascadelake + 1 V100) || 25 26 || dp-sdv-esb || dp-sdv-esb[01-02], DEEP-EST ESB Test nodes (Xeon Cascadelake + 1 V100) || 26 27 || ml-gpu || ml-gpu[01-03], GPU test nodes for ML applications (up to 4 V100 cards) || … … 57 58 First, start a shell on a node. You would like to run your mpi task on 4 machines with 2 tasks per machine: 58 59 {{{ 59 [kreutz1@deepv /p/project/cdeep/kreutz1/Temp]$ srun - p dp-cn -N 4 -n 8 -t 00:30:00 --pty /bin/bash -i60 [kreutz1@deepv /p/project/cdeep/kreutz1/Temp]$ srun -A deep -p dp-cn -N 4 -n 8 -t 00:30:00 --pty /bin/bash -i 60 61 [kreutz1@dp-cn01 /p/project/cdeep/kreutz1/Temp]$ 61 62 }}} 62 63 63 The environment is transported to the remote shell, no {{{.profile}}}, {{{.bashrc}}}, ... are sourced (especially not the modules default from {{{/etc/profile.d/modules.sh}}}). 64 The environment is transported to the remote shell, no {{{.profile}}}, {{{.bashrc}}}, ... are sourced (especially not the modules default from {{{/etc/profile.d/modules.sh}}}). As of March 2020, an account has to be specified using the `--account` (short `-A`) option, which is "deep" for the project members. For people not included in the DEEP-EST project, please use the "Budget" name you received along with your account creation. 64 65 65 66 Once you get to the compute node, start your application using {{{srun}}}. Note that the number of tasks used is the same as specified in the initial {{{srun}}} command above (4 nodes with two tasks each): 66 67 {{{ 67 [kreutz1@deepv Temp]$ srun - p dp-cn -N 4 -n 8 -t 00:30:00 --pty /bin/bash -i68 [kreutz1@deepv Temp]$ srun -A deep -p dp-cn -N 4 -n 8 -t 00:30:00 --pty /bin/bash -i 68 69 [kreutz1@dp-cn01 Temp]$ srun ./MPI_HelloWorld 69 70 Hello World from rank 3 of 8 on dp-cn02 … … 84 85 85 86 {{{ 86 [kreutz1@deepv Temp]$ srun - p dp-cn -N 4 -n 8 -t 00:30:00 ./MPI_HelloWorld87 [kreutz1@deepv Temp]$ srun -A deep -p dp-cn -N 4 -n 8 -t 00:30:00 ./MPI_HelloWorld 87 88 Hello World from rank 7 of 8 on dp-cn04 88 89 Hello World from rank 3 of 8 on dp-cn02 … … 98 99 99 100 {{{ 100 [kreutz1@deepv Temp]$ salloc - p dp-cn -N 4 -n 8 -t 00:30:00101 [kreutz1@deepv Temp]$ salloc -A deep -p dp-cn -N 4 -n 8 -t 00:30:00 101 102 salloc: Granted job allocation 69263 102 103 [kreutz1@deepv Temp]$ srun ./MPI_HelloWorld … … 125 126 126 127 #SBATCH --partition=dp-cn 128 #SBATCH -A deep 127 129 #SBATCH -N 4 128 130 #SBATCH -n 8 … … 168 170 169 171 {{{ 170 srun -- partition=dp-cn -N 1 -n 1 hostname : --partition=dp-dam -N 1 -n 1 hostname172 srun --account=deep --partition=dp-cn -N 1 -n 1 hostname : --partition=dp-dam -N 1 -n 1 hostname 171 173 dp-cn01 172 174 dp-dam01