Changes between Version 54 and Version 55 of Public/User_Guide/PaS
- Timestamp:
- Sep 21, 2022, 1:35:36 PM (3 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
TabularUnified Public/User_Guide/PaS
v54 v55 3 3 This page is intended to give a short overview on known issues and to provide potential solutions and workarounds to the issues seen. 4 4 5 ''Last update: 2022-09- 15''5 ''Last update: 2022-09-21'' 6 6 7 7 … … 17 17 === CM nodes === 18 18 * dp-cn25: SEL ProblemsFW issues (#2769) 19 20 * dp-cn27: MCE Errors found (#2919) 19 * dp-cn30: Image update needed (#2991) 20 * dp-cn35: Image update needed (#3005) 21 * dp-cn36: Image update needed (fixed EM issue, see #2992) 22 * dp-cn37: Image update needed (fixed EM issue, see #2993) 23 * dp-cn[47-50]: BeeOnd testbed 21 24 22 25 === DAM nodes === 23 26 * dp-dam02: reserved for FPGA tests 24 27 * dp-dam03: PCI link speed degraded (#2931) 25 * dp-dam 10: PMEM module issue (#2875)28 * dp-dam08: no turbo mode (#2974) 26 29 * dp-dam16: testbed 27 30 28 31 === ESB nodes === 29 32 * dp-esb[07]: used for Rocky 8.6 tests 30 * dp-esb[11]: memory issues 33 * dp-esb[11]: memory issues (#2857) 34 * dp-esb[25]: Image update needed 35 * dp-esb[31]: GPU issues (#2949) 36 * dp-esb[47]: SEL Problems (#2998) 37 * dp-esb[61]: Eth connections issues (#3010) 38 * dp-esb[65]: Eth connection issues (#2978) 31 39 32 40 === SDV nodes === … … 37 45 * knl01: serves as golden client for imaging only 38 46 39 * dp-sdv-esb[01,02]: Slurm update required47 * dp-sdv-esb[01,02]: will only be powered on demand 40 48 41 49 == Software issues == 50 51 {{{#!comment solved with EB 2022 stage 42 52 === nvidia driver mismatch === 43 53 * loading CUDA module and trying to run `nvidia-smi` (or any application trying to use the GPU) leads to … … 48 58 * workaround is to unload the unload the driver module: `ml -nvidia-driver/.default` 49 59 * for furhter information, please also see [https://gitlab.jsc.fz-juelich.de/hps-public/easybuild-repository/-/wikis/Failed-to-initialize-NVML-Driver-library-version-mismatch-message here][[BR]] 60 }}} 61 50 62 51 63 === nvidia profiling tools === … … 60 72 * you will still see a warning "OpenGL Version check failed. Falling back to Mesa software rendering.", but the profling tool (e.g. `nsight-sys`) should start up 61 73 62 === Easybuild ===63 * Moving the new Easybuild stage 2022 (in February) might cause unexpected behavior and problems with the installed software components:64 65 {{{#!comment JK: invalid66 67 === GPU direct usage with Extoll on DAM ===68 * new Extoll driver for GPU direct over Extoll still shows low performance on the DAM nodes69 * available via Developer stage, for testing load:70 71 {{{72 ml --force purge73 ml use $OTHERSTAGES74 ml load Stages/Devel-202075 ml load Intel76 ml load ParaStationMPI77 }}}78 * expect performance (and maybe also stability) issues79 80 }}}