2025
Status | Type | Start | End | Quarter | Scope | User Impact | Reason |
---|---|---|---|---|---|---|---|
Resolved | Full | 03 February 2025 08:00 | 03 February 2025 16:00 | 2024_q1 | Cirrus |
No login access No access to any data on the system Jobs will not run. |
Essential work on E1000 which hosts the cirrus work file system |
2024
Status | Type | Start | End | Quarter | Scope | User Impact | Reason |
---|---|---|---|---|---|---|---|
Resolved | Full | 3 June 2024 08:00 | 4 June 2024 11:00 | 2024_q2 | Cirrus |
No login access No access to any data on the system Jobs will not run. |
Cooling Ditribution Unit (CDU) maintenance for Cirrus. System will be brought back with new boot image which includes an updated CUDA driver. |
Completed | Full | 12 March 2024 09:00 | 12 March 2024 17:00 | 2024_q1 | Cirrus |
No login access No access to any data on the system Jobs will not run, and queued jobs will be deleted. |
Migration to E1000 including the change in authentication protocol and addition of new file system. |
2023
Status | Type | Start | End | Quarter | Scope | User Impact | Reason |
---|---|---|---|---|---|---|---|
Completed | Partial | 18 September 2023 09:00 | 22 September 2023 11.55 | 2023_q3 | Cirrus |
No login access No access to any data on the system Jobs will continue to run, and queued jobs will be started as usual The SAFE will be available during the outage but there will be reduced functionality due to the unavailability of the connection to ARCHER2 such as resetting of passwords or new account creation. |
Upgrade of network |
Completed | Full Maintenance | 2023-07-25 14:00 | 2023-07-26 09:20 | 2023_q3 | Cirrus | Cirrus will not be available to users. This includes the login nodes, compute nodes and access to the filesystems. We will notify users when Cirrus is returned to service. | A fix to the volume issue on the Cirrus CXFS /scratch file system which will be performed by the vendor, HPE. |
Planned | Scratch filesystem | 2023-05-15 16:00 | 2303-05-19 estimated | 2023_q2 | Cirrus /scratch |
The solid-state storage (/scratch) will be unavailable on Cirrus from Monday 15th May at 1600. We expect the disk to be unavailable until Friday 19th May but we will notify users once it is available again. This means that users will not be able to access any data on the /scratch filesystem during this time. |
The maintenance is to improve the resiliency and reliability of the solid-state storage (/scratch) by applying software updates, failover policy implementation and deploying additional packages. |
Planned | Partial Maintenance | 2023-02-07 09:00 | 2023-02-07 17:00 | 2023_q1 | Cirrus | CPU and GPU compute nodes will be unavailable. Login access and access to data will still be available. | Essential maintenance to the Cirrus liquid cooling system. |
2022
Status | Type | Start | End | Quarter | Scope | User Impact | Reason |
---|---|---|---|---|---|---|---|
Planned | Full Maintenance | 2022-12-07 09:00 | 2022-12-07 17:00 | 2022_q4 | Cirrus | Cirrus will not be available to users. This includes the login nodes, compute nodes and access to the filesystems. We will notify users when it is returned to service. | Upgrade to the slurm batch scheduler. |
Planned | Full Maintenance | 2022-02-21 09:00 | 2022-03-16 17:00 | 2022_q1 | Cirrus | There will be a full rebuild of the Cirrus Service. It will be unavailable during maintenance session. | Attach new storage storage, bring the system software up to date. |
2021
Status | Type | Start | End | Quarter | Scope | User Impact | Reason |
---|---|---|---|---|---|---|---|
Planned | Outage | 2021-12-01 09:00 | 2021-12-01 17:00 | 2021_q4 | Cirrus | Full system will be unavailable during maintenance session. | Third-party maintenance on cooling system. |
Completed | At-Risk | 2021-10-27 09:00 | 2021-10-27 17:00 | 2021_q4 | Cirrus | Period of up to 30mins when external connections are not possible. Compute nodes will continue to run jobs. | Network upgrade at the Advanced Computing Facility (ACF) |