All Systems Operational

About This Site

New Zealand eScience Infrastructure High Performance Compute and Storage Service Status

Apply for Access ? Operational
Data Transfer Operational
Submit new HPC Jobs Operational
Jobs running on HPC Operational
NeSI OnDemand ? Operational
90 days ago
99.95 % uptime
Today
HPC Storage Operational
User Support System ? Operational
Flexible High Performance Cloud ? Operational
Long-term Storage (Freezer) ? Operational
90 days ago
99.92 % uptime
Today
Flexible High Performance Cloud Services ? Operational
90 days ago
99.99 % uptime
Today
Virtual Compute Service Operational
Bare Metal Compute Service Operational
FlexiHPC Dashboard (web interface) ? Operational
90 days ago
100.0 % uptime
Today
FlexiHPC CLI interface ? Operational
90 days ago
100.0 % uptime
Today
Public API of the FlexiHPC Service ? Operational
90 days ago
99.99 % uptime
Today
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
No data exists for this day.
had a major outage.
had a partial outage.

Scheduled Maintenance

WEKA filesystem and compute core changes - Announcement Sep 8, 2025 09:00-17:00 NZST

In order to get our new WEKA filesystems up to their best possible performance we need to dedicate some cores on each compute node for exclusive use by WEKA. So on Milan nodes Slurm jobs will only be able to use 126 cores per node rather than 128, and on Genoa nodes 166 rather than 168.
This change has already begun, having already been applied to all of the Milan nodes and the majority of the Genoa nodes. We expect to be doing the last of the Genoa nodes on September 8th.

Posted on Sep 04, 2025 - 12:10 NZST

my.nesi.org.nz system update Oct 28, 2025 16:00-17:30 NZDT

We will be undergoing scheduled maintenance during this time.
Posted on Oct 13, 2025 - 10:45 NZDT
Oct 18, 2025

No incidents reported today.

Oct 17, 2025

No incidents reported.

Oct 16, 2025

No incidents reported.

Oct 15, 2025
Completed - The scheduled maintenance has been completed.
Oct 15, 10:57 NZDT
Update - Scheduled maintenance is still in progress. We will provide updates as necessary.
Oct 15, 10:56 NZDT
In progress - Scheduled maintenance is currently in progress. We will provide updates as necessary.
Oct 15, 09:30 NZDT
Scheduled - We will be undergoing scheduled maintenance during this time to update the system.
Oct 14, 09:10 NZDT
Oct 14, 2025

No incidents reported.

Oct 13, 2025

No incidents reported.

Oct 12, 2025

No incidents reported.

Oct 11, 2025

No incidents reported.

Oct 10, 2025

No incidents reported.

Oct 9, 2025

No incidents reported.

Oct 8, 2025
Resolved - We have made various changes to improve the Weka filesystem performance so we consider this to be resolved. However should you experience any I/O slow downs please advise us via support@nesi.org.nz
Oct 8, 16:27 NZDT
Update - We are continuing to tune the filesystem and monitor performance. We have had occasional reports of slow interactive metadata performance (e.g. when extracting many files from a bundle/archive or pulling code from a remote git repository). These issues appear to be limited to specific nodes/clients and we have recently made changes on login03 which have improved performance on that primary login node. However, if you notice anything out of the ordinary please report it to Support.
Oct 1, 17:36 NZDT
Update - The filesystem has been stable today, however several users have reported degraded interactive IO experience. We expect this is caused by ongoing heavy metadata load as a result of the continuing background integrity check. Based on current progress we unfortunately expect this to continue into next week.

There have been no major impact to jobs, though some workloads paused when trying to write to the filesystem during the incident, and as a result a few jobs have timed out. If you see this and need help resolving it then please contact support.

Aug 28, 16:08 NZST
Update - We are continuing to monitor for any further issues.
Aug 28, 00:16 NZST
Monitoring - Full filesystem functionality was restored at approx 11pm NZST. The issue appears to have been triggered by a brief backend network disruption - WEKA support are investigating why the filesystem didn't recover automatically. Ongoing data integrity checks may impact IO performance for a while longer.

Thankfully there seem to be no widespread job impacts, however we will check this more thoroughly in the morning and contact any users who may have had work impacted. Apologies again for the disruption (and goodnight)!

Aug 28, 00:16 NZST
Investigating - We have identified an ongoing issue with our high performance filesystems. This is impacting scratch/nobackup, project, home and likely impacting any new logins to the HPC and OnDemand services. At present, existing jobs are continuing to run and complete, however we anticipate there may be job failures as a result of this problem. We are currently awaiting urgent vendor support. Apologies for the inconvenience and disruption, we'll update as soon as we know more.
Aug 27, 21:19 NZST
Oct 7, 2025

No incidents reported.

Oct 6, 2025

No incidents reported.

Oct 5, 2025

No incidents reported.

Oct 4, 2025

No incidents reported.