The NIG Supercomputer

advance reservation

Overview

Advance Reservation is a service that users reserve a certain range of resources in the Grid Engine queue before using and can use exclusively the reserved area.

Note: this service is a billing service, so you must submit a usage plan before you use it.

The red dotted square shows the resources taken by the advance reservation (number of slots on the compute nodes ✕ usage time) and the other squares show the resources occupied by the job (number of slots used on compute nodes ✕ calculation time). The user who reserved the area can submit the job to the red square area.

(Users using the Advance Reservation Service can submit as usual jobs without using the reserved quota for computational resources by using the qsub command as well as users not using the Advance Reservation Service.)

About the billing period

Instruction for using Advance Reservation Service

To use the Advance Reservation Service, follow the steps below.

  1. Submit the usage plan
    • This service is a billing service and you must submit a usage plan before you start using it. Refer to "How to use billing service" for more information on its process.
  2. Get the reserved quota for computational resources
    • To get the reserved quota for computational resources using the QRSUB command.
  3. Confirm the reserved quota for computational resources
    • To confirm the reserved quota of computational resources using the qrstat command.
  4. Execute jobs
    • To execute jobs after getting the reserved quota for computational resources using the qsub command by specifying ar-id (reservation number) with the -ar option.
  5. Delet the reserved quota for computational resources
    • To delet the reserved quota for computational resources using the QRDEL command.

The QRSUB and QRDEL commands are wrapper programs for Grid Engine's qrsub and qrdel commands created to calculate billing on the NIG supercomputer. Both QRSUB and QRDEL command options are available for the qrsub and qrdel commands, respectively. for the options of each command, refer to the Grid Engine User's Manual (English only).

How to get the reserved quota for computational resources

When you get the reserved quota, you can ask supercomputer SE to reserve it, or you can reserve it yourself with the command.

It may not be possible to get it with QRSUB because there are no available resources depending on the running job. In such a case, move the date and time to be reserved it to a future date and time, and execute the QRSUB command again.

Execution example (1)

When you get the reserved quota for computer resorces using medium.q with one node, 4GB of memory, 3 slots, from April 2, 13:15, 2020 to May 4, 10:00, 2020, the memory is allocated 12GB (GB of memory x 3 slots). In getting it, you need available memory for it.

QRSUB -l medium -l mem_req=4G -l s_vmem=4G -pe def_slot 3 -a 202004021315 -e 202005041000

Execution example (2)

When you get the reserved quota for computer resorces using medium.q with the mpi job (parallel number is 2~5) that spans multiple nodes, 4GB of memory, 3 slots, from October 10, 00:00 to October 20, 7:05, the memory is allocated 12GB (GB of memory x 3 slots). In getting it, you need available memory for it.

QRSUB -l medium -pe mpi 2-10 -a 10100000 -e 10200705

*The memory is not specified, so the specified memory is considered the default value.

Execution Result

When the QRSUB command completes successfully, you will get the following message and a ar-id (4-digit reservation number) is issued.

Your advance reservation XXXX has been granted

XXXX : ar-id(reservation number).

When the QRSUB command does not complete successfully, check the specified options and try again.

How to specify the reservation start time and end time

Specify the reservation start time and end time using the following arguments with the QRSUB command.

To specify the time, use in the following format.

*CC:the first two digits of the year YY:the last two digits of the year MM:Month DD:Date hh:Time mm:Min SS:Sec

How to check the status that you got the reserved quota

List the status that you got the reserved quota

qrstat

Execution result

ar-id      name       owner        state start at             end at               duration
------------------------------------------------------------------------------------------
      0001            test-user   w     10/10/2017 00:00:00  10/20/2017 07:05:00  247:05:00
      0002            test-user   r     01/10/2017 00:00:00  10/20/2017 00:00:00  6792:00:00

Display details of the status that you got the reserved quota

qrstat -ar ar-id[,ar-id,ar-id・・・・]

Execution Result

--------------------------------------------------------------------------------
id                             0001
name
owner                          test-user
state                          w
start_time                     10/10/2017 00:00:00.000
end_time                       10/20/2017 07:05:00.000
duration                       247:05:00.000
project
message                        XXXXXXXXXX
submission_time                03/22/2017 14:56:47.314
group                          test
account                        sge
resource_list                  month=TRUE, medium=TRUE, mem_req=3.1G, s_vmem=3.1G
free_resources                 false
reserve_available_only         true
granted_slots_list             month_medium.q@nm5i=1,month_medium.q@nm6i=1,month_medium.q@nm7i=1
granted_parallel_environment   mpi slots 2-3
--------------------------------------------------------------------------------
id                             0002
name
owner                          test-user
state                          r
start_time                     01/10/2017 00:00:00.000
end_time                       01/20/2017 00:00:00.000
duration                       6792:05:00.000
project
message                        XXXXXXXXXX
submission_time                01/05/2017 13:23:20.001
group                          test
account                        sge
resource_list                  mem_req=3.1G, s_vmem=3.1G
free_resources                 false
reserve_available_only         true
granted_slots_list             month_hdd.q@nt013i=1
--------------------------------------------------------------------------------
id                             0003
name
owner                          test-user1
state                          w
start_time                     01/01/2018 00:00:00.000
end_time                       01/02/2018 00:00:00.000
duration                       24:00:00.000
project
message                        XXXXXXXXXX
submission_time                04/03/2017 10:50:10.285
group                          test
account                        sge
resource_list                  mem_req=10G, s_vmem=10G
free_resources                 false
reserve_available_only         true
granted_slots_list             month_ssd.q@nt060i=3
granted_parallel_environment   def_slot slots

How to execute the job

Specify ar-id (reservation number) with the -ar option of the qsub command, and execute the job.

For options, see the Grid Engine page.

If you execute the qsub command with ar-id before the start time of the reserved slot of the computational resource and submit the job, the job is executed as soon as the start time of the reserved slot comes.

Note that the executing job will be forced quit when the end time of the acquired reserved slot has passed.

Delete the reserved slot for computational resources

Execute the QRDEL command with ar-id (reserved number) to delete the reserved slot for computational resources.

Example:

QRDEL ar-id

Note: If "⑤ delet the reserved quota for computational resources (QRDEL command)" is executed while a job is running, the running job will also be terminated.