Vilje filesystem is back

Vilje filesystem has been fixed with good help from DDN and we are now open for business.

Please be aware that some files may have been lost.
Always back up your files.

Vilje is online

The infiniband error was due to a controller module with bad connection. This has been corrected.

The queueing system is back online. Also: 19 additional nodes has been recovered.

Three jobs were lost. We apologize for the inconvenience.

 

 

Vilje is back online

Vilje is online.

The outage was caused by the loss of infiniband connectivity/loss of two infiniband switches.

36 nodes will remain out of production.

There may still be dns issues with connectivity from innside the cluster to outside (i.e: licence server lookups). Please report any issues to: support@metacenter.no

 

550 nodes down on Fram

550 nodes went down at 00:00 Monday morning. We are investigating the issue and will bring nodes back online as soon as possible