System Architecture
System Architecture
Place system architecture description here.
Hardware and Component Specifications
| System Component | Configuration |
|---|---|
| Supermicro X12 Gaudi Training Nodes |
|
| CPU Type | Intel Xeon Gold 6336 |
| Habana Gaudi processors | 336 |
| Nodes | 42 |
| Training processors/Node | 8 |
| Host x86 processors/node | 2 |
| Sockets | 2 |
| Memory capacity |
* 512 GB DDR4 DRAM |
| Memory/training processor |
32 GB HDM2 |
| Local Storage |
6.4 TB local NVMe |
| Max CPU Memory bandwidth | ** GB/s |
| Intel First Generation Habana Inference Nodes | |
| CPU Type | Xeon Gold 6240 |
| First-Generation Habana Inference Processors | 16 |
| Nodes | 2 |
| First-Generation Habana Inference Cards/node | 8 |
| Cores/socket | 20 |
| Sockets | 2 |
| Clock speed | 2.5 GHz |
| Flop speed | 34.4 TFlop/s |
| Memory capacity | *384 GB DDR4 DRAM |
| Local Storage |
1.6TB Samsung PM1745b NVMe PCIe SSD |
| Max CPU Memory bandwidth | 281.6 GB/s |
| Standard Compute Nodes | |
| CPU Type | Intelx86 |
| Nodes | 36 |
| x86 processors/node | 2 |
| Memory Capacity | 384 GB |
| Local NVMe | 3.2 TB |
| Interconnect | |
| Topology | Full bi-section bandwidth switch |
| Per Node bandwidth | 6*400 Gb/s (bidirectional) |
| DISK I/O Subsystem | |
| File Systems | Ceph |
| Ceph Storage | 1 PB |