CRI Gardner upgrade news
cri gardner upgrade
Operating System Upgrade - The operating system will be upgraded from Red Hat Linux 6.7 to 7.6. This will provide a kernel that will allow for a more modern software ecosystem. For example, software such as tensorflow will not run on Red Hat 6.
GPFS Upgrade - GPFS storage clients will be upgrade from GPFS 4.2 to GPFS 5. This will provide a performance increase for metadata operations such as creating, listing, and deleting files.
SLURM Scheduling - The Torque/Moab scheduling on the system will be replaced with SLURM. SLURM is an open-source scheduler that has become a de facto standard across many HPC sites.
Deep Learning capabilities - Last year, the CRI purchased two deep learning servers with 8 NVidia V100s per server. These servers have been open to users who requested the capabilites. With the upgrade, the deep learning systems will be added to the general scheduling queue.
Increased container capabilities - With the upgrade to Red Hat 7, we will be able to provide the ability for users to create their own singularity containers.
Authentication/Authorization - We are running authentication/authorization clients that are rather outdated at the moment. Upgrading those clients should provide a more stable environment when logging into the cluster and accessing files.
Upgraded compilers - The compilers on the cluster will be upgraded to the latest version. This will be gcc-10.2.0, llvm-11.0, intel-2020.2, and nvhpc-20.9. The compilers will provide implementation of the latest standards for C, C++, and Fortran.
Enhanced accounting - We will be provided accounting on both jobs submitted to the cluster as well as software use across the environment.