Skip to main content

47 posts tagged with "Maintenance"

View All Tags

· 2 min read

Publication date: November 9, 2022

The scheduled maintenance of the NIG supercomputer is scheduled on the following date and time in accordance with the legal power outage of the NIG. The supercomputer will not be available during the scheduled maintenance.

Period

December 2, 17:00 - December 8, 2022, 00:00(24h)

Work schedule

  • 12/2(Fri.) 17:00~  Supercomputer outage
  • 12/3(Sat.)      Legal power outage
  • 12/4(Sun.)~12/7(Wed.) Supercomputer scheduled maintenance work (UPS maintenance, Lustre maintenance, software updates, etc.)
  • 12/8(Thu.) is a spare day.

Main contents of Scheduled Maintenance

  1. Run yum update on all compute nodes to get the latest version of Cent OS 7.9.

  2. Singularity has been renamed Apptainer. Along with this, the following updates will be made.

  • Current Singularity 3.8.7 ⇒ Apptainer 1.1
  • SingularityCE 3.10.2 added
  • For more information, refer to the Apptainer description page.
  1. unification of SSH public key reflection flow

Until now, when the SSH public key was registered, it took about 24 hours to be reflected in the gateway gw.ddbj.nig.ac.jp in the general analysis division, and about 10 minutes in gw2.ddbj.nig.ac.jp. After the scheduled maintenance in December, the processing flow will be unified and both gw and gw2 will be reflected in about 10 minutes.

Notes

  • Running jobs will be deleted, so please resubmit jobs after the scheduled maintenance.

· One min read

Publication date: October 20, 2022

The Lustre7 high-speed storage system in the General Analysis division has experienced an equipment failure, and as of 11:41 a.m. on Thursday, October 21, there has been no impact on users. The equipment will be replaced at the following time and date.

Date

Friday, October 21, 2022 10:00 - 12 noon (24h notation)

Scope of impact

  • In the general analysis division, I/O suspensions to Lustre7 are expected to occur before and after the work for approximately 4 minutes each.
  • The personal genome analysis division will not be affected.
  • DDBJ services and other services are not affected.

· One min read

Publication date: October 6, 2022

Following several incidents of poor performance of the Lustre8 high-speed storage system for the personal genome analysis division, the MDS of Lustre8 will be upgraded to prevent this.

  • The update was completed at around 10:40.
  • I/O suspension occurred between 10:19 and 10:23 due to this work.

Date

Thursday, October 6, 2022 10:00 - 12 noon (24h notation)

Scope of impact

  • The personal genome analysis division will suspend I/O for a few minutes during the work. I/O will be automatically restored after the work is completed.
  • The general analysis division will not be affected.
  • DDBJ services and other services are not affected.

· One min read

Due to the NIG's mail server was down, token codes for connecting to the VPN was not sent during the following period.

22:50, Saturday, August 27 - around 9:00, Monday, August 29 (24h)

The problem was restored at around 9:00 on 29 Aug (Mon).

Please try the VPN connection again.

Scope of impact

  • The general analysis division and the personal gemone analysis division will not be affected.
  • It is possible to log in to the supercomputer. There is no impact on runnning jobs, etc.

We apologize for the inconvenience.

· One min read

Publication date: August 23, 2022

Due to SINET6 maintenance, the network will be temporarily out of service during the following time period.

  • Date: Monday, September 12, 2022 01:30 - 03:00 (24h notation)

    • Communication interruptions of 15 minutes will occur 1-3 times during the above time period.
  • Scope of Impact

    • During the communication breakdown, login to the supercomputer and data transfer operations will not be available.
    • No jobs in operation will be suspended.

We appreciate your understanding and cooperation.

· One min read

Publication date: August 1, 2022

Overview

Due to the firmware configuration of the NIG supercomputer, the application system for use will not be available during the following period.

Work schedule

Monday, August 1, 2022, 10:00-13:00(24h)

Scope of impact

· One min read

From 01:50 Saturday, July 9, a failure occurred in the application for new use system. The application for new use could not be accepted.

The system has been restored at 16:30 on 11 July.

Scope of impact

  • An error occurred during the last data transmission when trying to apply for use in the "the application for new use system".
  • The general analysis division and the personal gemone analysis division will not be affected.
  • It is possible to log in to the supercomputer. There is no impact on runnning jobs, etc.

We apologize for the inconvenience.

· One min read

Publication date: 2022年6月15日

Overview

Currently, we are working on the renewal of the account application system of the NIG supercomputer. Therefor, we will suspend the acceptance of application during the following period in order to perform the data migration.

Suspension period

Friday, June 17th - Tuesday, 21th, 2022 Monday, July 4th, 2022

Scope of Impact

  • There will be no system outages or other impacts associated with this work.

· One min read

Publication date: June 8, 2022

Overview

Currently, we are working on the development to renew the user registration and year-end renewal system of the NIG supercomputer. To test the system migration on the general analysis division, gateway gw2.ddbj.nig.ac.jp, you may not be able to login from gw2, the network may be irregularly interrupted and etc..

Work schedule

Wednesday, June 8th - Tuesday, 28th, 2022 (until regular maintenance)

Scope of Impact

  • One of the general analysis division gateways, gw2.ddbj.nig.ac.jp, can be affected.
  • The general analysis division, gw.ddbj.nig.ac.jp, will not be affected.
  • The personal genome analysis division will not be affected.
  • There will be no system outages or other impacts associated with this work.

· One min read

Publication date: May 19, 2022

On Thursday, May 19 at 0:36, one of the Lustre8 controllers has detected a failure. Accordingly, this controller status is Failover.

The controller will be replaced at the following date and time.

  • Date and time: 14:00 - 16:00, Friday, May 20, 2022

    • I/O will be suspended for up to 15 minutes during the above time period.
  • Scope of impact

    • During the above time period, Lustre8 in the personal genome analysis division will suspend I/O for a few minutes, but I/O will resume automatically.
    • The general analysis division etc. will not be affected.
    • There will be no suspension of active jobs.

Thank you for your understanding and cooperation.