Cluster Information:

The cluster consists of:

  • 5 worker nodes

The worker nodes detail:

  • HPE ProLiant XL270d Gen10
  • Intel(R) Xeon(R) Gold 6248 CPU @ 2.50GHz
  • NUMA node0 CPU(s): 0-19
  • NUMA node1 CPU(s): 20-39
  • Thread(s) per core: 1
  • Core(s) per socket: 20
  • Socket(s): 2
  • NUMA node(s): 2
  • 8x Nvidia V100-SXM2-32GB with NV-Link
  • 768 GB Memory
  • InfiniBand EDR connectivity

Partitions:

partition time limit allowed qos default time preempt mode user availability
volta 72 hours volta 5 minutes off ML
voltadebug 4 hours volta 5 minutes off ML

    voltadebug partion allow shared nodes, please request only need resource. e.g. if you need 1 gpu, use "--gres=gpu:1"

    Limits:

    • Each user can submit a maximum of 100 jobs