From 6b0a72f0f06f52d309ab7441721bd2e9ad48977b Mon Sep 17 00:00:00 2001 From: Michael Weisner Date: Thu, 26 Mar 2026 14:05:18 -0400 Subject: [PATCH] Update 10_spec_sheet.md added some headers to the spec sheet and an explanation of the stakeholder groups: --- docs/hpc/10_spec_sheet.md | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/docs/hpc/10_spec_sheet.md b/docs/hpc/10_spec_sheet.md index d73efec187..fd53dd5771 100644 --- a/docs/hpc/10_spec_sheet.md +++ b/docs/hpc/10_spec_sheet.md @@ -3,6 +3,7 @@ The Torch cluster has 518 [Intel "Xeon Platinum 8592+ 64C"](https://www.intel.com/content/www/us/en/products/sku/237261/intel-xeon-platinum-8592-processor-320m-cache-1-90-ghz/specifications.html) CPUs, 29 NVIDIA [H200](https://nvdam.widen.net/s/nb5zzzsjdf/hpc-datasheet-sc23-h200-datasheet-3002446) GPUs & 68 NVIDIA [L40S](https://resources.nvidia.com/en-us-l40s/l40s-datasheet-28413) GPUs connected together via Infiniband NDR400 interconnect. Further details on each kind of node is provided in the table below. +## Torch System Resources | Type | Nodes | CPU Cores | GPUs | Memory (GB) | CPUs per Node | GPUs per Node | Memory per Node (GB) | |---|---|---|---|---|---|---|---| | Standard Memory | 186 | 23,808 | N/A | 95,232 | 128 | N/A | 512 | @@ -24,7 +25,8 @@ Torch was tested in June 2025 using the [LINPACK benchmark system](https://top50 Torch was recently ranked [#40 on the Green 500 list](https://top500.org/lists/green500/list/2025/06/), a global list of the most energy efficient supercomputers in the world thanks to its advanced liquid cooling system. - The breakdown of these GPUs into stakeholder buckets is as follows: +## Torch Stakeholder Resources +Many GPUs on Torch are owned by stakeholder groups who have prioirty access to the resources. The breakdown of these GPUs into stakeholder groups is as follows: | Stakeholders | H200 | L40S | H100 | A100 | RTX Pro 6000 | Total | | :--- | :---: | :---: | :---: | :---: | :---: | :---: |