By Costin Caramarcu |
Fri, 11/12/2021 - 16:09
Cluster Information:
The cluster consists of:
- 5 worker nodes
The worker nodes detail:
- HPE ProLiant XL270d Gen10
- Intel(R) Xeon(R) Gold 6248 CPU @ 2.50GHz
- NUMA node0 CPU(s): 0-19
- NUMA node1 CPU(s): 20-39
- Thread(s) per core: 1
- Core(s) per socket: 20
- Socket(s): 2
- NUMA node(s): 2
- 8x Nvidia V100-SXM2-32GB with NV-Link
- 768 GB Memory
- InfiniBand EDR connectivity
Partitions:
partition | time limit | allowed qos | default time | preempt mode | user availability |
---|---|---|---|---|---|
volta | 72 hours | volta | 5 minutes | off | ML |
voltadebug | 4 hours | volta | 5 minutes | off | ML |
voltadebug partion allow shared nodes, please request only need resource. e.g. if you need 1 gpu, use "--gres=gpu:1"
Limits:
- Each user can submit a maximum of 100 jobs