<p>11/05/2022<br />
Dear users,</p>
<p>this is to inform you that the DGX maintenance has been completed and the<br />
cluster is back in production since yesterday evening.</p>
<p>During this maintenance:</p>
<p> * SLURM has been updated to version 21.08.8-2<br />
* a new version of Nvidia HPC-SDK has been installed (2022 - 22.3)<br />
* max wall time of QoS "dgx_qos_sprod" has been extended from 12 hours to 48<br />
hours<br />
* a new partition, "dgx_usr_preempt", has been defined. It is free of charge<br />
and there is no limit on the number of jobs running per user, but it has<br />
low priority and your jobs may be killed at any moment if a high priority<br />
job requests resources<br />
* the DGX User Guide has been updated, please visit the webpage<br /><a href="https://wiki.u-gov.it/confluence/display/SCAIUS/UG3.4%3A+DGX+A100+UserGuide">https://wiki.u-gov.it/confluence/display/SCAIUS/UG3.4%3A+DGX+A100+UserGuide</a></p>
<p>Best regards,</p>
<p>HPC User Support @CINECA</p>