RCS System Outage

As of June 29th, 8am RCS systems are steadily coming back online. It is currently unknown when all systems will be fully operational and it may extend past to the previous estimate of June 29th, 9am.

We will distribute further notifications as we assess our systems and can give a concrete estimate of when each system will be back online.

RCS System Outage

There was an unplanned power outage in the UAF Butro Data Center this morning. OIT and Facilities Services have replaced the critical equipment.

This was a hard power failure and Research Computing Systems (RCS) is currently assessing the impacts to our hardware and services.
Network on UAF campus has been restored and all RCS HPC, storage, and web services are planned to be back online by 9 AM AKST, June 29, 2017.

We will distribute notifications as more information is available.

Unplanned Chinook Outage

The $CENTER1 Lustre filesystem became temporarily unavailable to the Chinook compute nodes on May 11th around 3pm AKDT, causing some submitted jobs to fail immediately. To resolve this issue the job partitions were taken down, and any submitted jobs were placed into a waiting queue until the partitions were brought back online.

Any jobs that were in the process of running during that timeframe should be unaffected.

Pages

Subscribe to RSS - Outage