550 nodes went down at 00:00 Monday morning. We are investigating the issue and will bring nodes back online as soon as possible
Some of the Lustre object storage servers crashed during the night, making parts of the /cluster file system unaccessible. We working on the problem and will keep you updated.
A sudden high pressure in the cooling system around 13 o’clock, has taken one of the cooling units down. Starting it back affected the other unit as well.
This triggered a safety stop for some of the computing nodes, leading to premature crash for some of the running jobs.
Affected jobs has been re-queued.
Apologies for the inconvenience it has created.
- 2018-06-12 17:16 Access it NIRD is reopened now.
- 2018-06-12 16:05 Service are started and back in production on NIRD Service Platform.
- 2018-06-12 15:55 Queue reservation is now removed and jobs are running on part of Fram. Rest of the nodes will be added back to the queue as soon as they are updated.
- 2018-06-12 14:55 Access is re-opened to Fram. Queue reservation is still in place.
- 2018-06-12 08:34 Maintenance has started.
Dear Fram and NIRD user,
We will have a one day planned maintenance on 12th of June starting from 08:30 AM.
Fram, NIRD and the Service Platform will be affected. One storage enclosure must be replaced, needing downtime for the file systems served from NIRD.
There is a system reservation in place on Fram starting on 12.06.2018 08:45 AM. Jobs not being able to finish before the maintenance window, will be left pending in the queue with a Reason “ReqNodeNotAvail” and will be started when the maintenance is over.
We will keep you updated via OpsLog/Twitter.
Thank you for your consideration!
Update 15:50: NIRD login node is up again and user access reopened.
We have to urgently reboot the NIRD login node.
This post will be updated when login to NIRD is possible again.
As many of you know, we have a special setup for development jobs, i.e., short jobs meant for quick development. Now, we see that it is quite challenging to fulfill all development needs with one permanent setup. Hence, if you have proven needs for development of a temporary nature, and those needs do not fit in the devel QoS (https://documentation.sigma2.no/jobs/jobtypes.html), please contact us at email@example.com and we will try to help you.
We will have a four hour maintenance on the cooling system for Fram on 16th of May from 09:00. To limit cooling requirements, only half of the compute nodes will be operational during this time.
This might lead to longer queue times.
Thank you for your understanding!
Some of the compute nodes and additionally Fram login nodes lost connection to the NFS mounted $HOME.
Login nodes were rebooted to cleanup hanging processes and blocking I/O.
We are investigating this issue and working on a solution.
Thank you for your understanding!
Dear NIRD Users,
It is with great excitement that we in UNINETT Sigma2 hereby announce the launch of the easyDMP, a new service that offers researchers, with minimal experience in data management, a simple way of creating a Data Management Plan (DMP). This is achieved by transforming any funding agency’s or institution’s data management guidelines and policies into a series of easy to answer questions, many containing a simple list of canned answers to pick from. The resulting plan can be used as a blueprint for researchers to put in place the necessary elements that ensure their data are adequately managed. The plan can be edited and shared, and also duplicated to serve as a starting point for other datasets.
EasyDMP is free of charge and available to any researcher in Norway and in Europe:
EasyDMP has been developed and is operated by Sigma2 in collaboration with the EUDAT2020 project. EasyDMP presently implements the EU H2020 recommendations, but the service has been design to easily integrate other schemas, for example institutional specific recommendations. Please do not hesitate to contact us if you want to integrate the easyDMP with your own tailored DMP questionnaire scheme.
Improvements to the tool will be driven by your needs. Thanks to the continuous deployment method, the easyDMP service will be adding new functionalities continuously. We can already anticipate that the next release will have functionality that enables other services to make use of the plan output in compliance with the FAIR principles.
We are now working to establish an external reference group for the service, that will include experts from user communities, librarians and curators and national service providers. This because we really believe that the easyDMP service will benefit from a wide national pool of competence and stakeholders.
Please do feel free to test it and start using it, and please do not hesitate to give us feedback at (support @easydmp.sigma2.no).
More info about easyDMP here:
We apologize for the long downtime and appreciate your patience.