CHI@TACC NVIDIA GPU node unavailable

Resolved Posted by Cody Hammock on October 27, 2023
Outage start Thursday, October 26, 2023 10 p.m.
Expected end Monday, October 30, 2023 3:44 p.m.

Resolved The switch has been restored, and running instances have been reconnected.

The network switch connecting the nodes equiped with NVIDIA M40, K80, P100, and V100 GPUs has failed. A replacement switch is expected to be installed on Oct 30, 2023. 

Existing leases for the affected nodes will be extended to prevent instances from being shut down.