Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

MIG resources

...

entered the queues on Feb 26, 2024. Batch script changes will be required for some jobs to run correctly.

...



Please carefully read the below to determine if batch script changes are required for your jobs.

Details

  • The Four A100 GPUs on Puma Node r5u13n1 are each subdivided into 3 smaller virtual GPUs using the Nvidia MIG (Multi-Instance GPU) method.  Each of these MIG slices allows the use of 20 GB of GPU memory.  The vast majority of jobs run on Puma in 2023 used less than this amount of GPU memory.  The  12 MIG GPUs  will increase overall GPU availability on Puma by freeing the 32 GB V100 GPUs for users requiring larger amounts of GPU memory.

  • Jobs requesting MIG resources will ideally be scheduled more quickly than those requesting the standard V100 GPUs, so MIG resources should be preferred when sufficient.

  • A limitation is that only one MIG slice can be addressed by a single application, so MIG slices are not appropriate for jobs utilizing multiple GPUs.

...

With the inclusion of MIG GPU resources, the GPU resources allocated by the above directive may be either MIG slices or V100 GPUs, depending on which resources are available.

  • There are two conditions under which users will need to explicitly request the use of V100 (Volta) GPU resources in batch job submissions:

1. If the 20 GB of GPU memory provided by a MIG slice is insufficient for your analysis

...