Skip to content

Commit e02ef02

Browse files
author
MFC Action
committed
Docs @ 1f586b9
1 parent 14e9329 commit e02ef02

File tree

1 file changed

+28
-24
lines changed

1 file changed

+28
-24
lines changed

documentation/md_expectedPerformance.html

Lines changed: 28 additions & 24 deletions
Original file line numberDiff line numberDiff line change
@@ -163,55 +163,59 @@ <h1><a class="anchor" id="autotoc_md67"></a>
163163
<tr class="markdownTableRowEven">
164164
<td class="markdownTableBodyRight">AMD MI100 </td><td class="markdownTableBodyRight"></td><td class="markdownTableBodyRight">GPU </td><td class="markdownTableBodyRight">1 GPU </td><td class="markdownTableBodyRight">1.4 </td><td class="markdownTableBodyLeft">CCE 16.0.1 </td><td class="markdownTableBodyLeft">Cray internal system </td></tr>
165165
<tr class="markdownTableRowOdd">
166-
<td class="markdownTableBodyRight">NVIDIA L40S </td><td class="markdownTableBodyRight">Single-precision GPU </td><td class="markdownTableBodyRight">GPU </td><td class="markdownTableBodyRight">1 GPU </td><td class="markdownTableBodyRight">1.7 </td><td class="markdownTableBodyLeft">NVHPC 24.5 </td><td class="markdownTableBodyLeft">GT ICE </td></tr>
166+
<td class="markdownTableBodyRight">NVIDIA L40S </td><td class="markdownTableBodyRight">FP32-only GPU </td><td class="markdownTableBodyRight">GPU </td><td class="markdownTableBodyRight">1 GPU </td><td class="markdownTableBodyRight">1.7 </td><td class="markdownTableBodyLeft">NVHPC 24.5 </td><td class="markdownTableBodyLeft">GT ICE </td></tr>
167167
<tr class="markdownTableRowEven">
168-
<td class="markdownTableBodyRight">AMD EPYC 9654 </td><td class="markdownTableBodyRight">Genoa </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">96/96 cores </td><td class="markdownTableBodyRight">1.7 </td><td class="markdownTableBodyLeft">Intel oneAPI 2021.9 </td><td class="markdownTableBodyLeft">DOD Carpenter </td></tr>
168+
<td class="markdownTableBodyRight">AMD EPYC 9654 </td><td class="markdownTableBodyRight">Genoa </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">96 cores </td><td class="markdownTableBodyRight">1.7 </td><td class="markdownTableBodyLeft">Intel 2021.9 </td><td class="markdownTableBodyLeft">DOD Carpenter </td></tr>
169169
<tr class="markdownTableRowOdd">
170170
<td class="markdownTableBodyRight">NVIDIA P100 </td><td class="markdownTableBodyRight"></td><td class="markdownTableBodyRight">GPU </td><td class="markdownTableBodyRight">1 GPU </td><td class="markdownTableBodyRight">2.4 </td><td class="markdownTableBodyLeft">NVHPC 23.5 </td><td class="markdownTableBodyLeft">GT CSE Internal </td></tr>
171171
<tr class="markdownTableRowEven">
172-
<td class="markdownTableBodyRight">AMD EPYC 9534 </td><td class="markdownTableBodyRight">Genoa </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">64/64 cores </td><td class="markdownTableBodyRight">2.7 </td><td class="markdownTableBodyLeft">GNU 12.3.0 </td><td class="markdownTableBodyLeft">GT Phoenix </td></tr>
172+
<td class="markdownTableBodyRight">AMD EPYC 9534 </td><td class="markdownTableBodyRight">Genoa </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">64 cores </td><td class="markdownTableBodyRight">2.7 </td><td class="markdownTableBodyLeft">GNU 12.3.0 </td><td class="markdownTableBodyLeft">GT Phoenix </td></tr>
173173
<tr class="markdownTableRowOdd">
174-
<td class="markdownTableBodyRight">NVIDIA A40 </td><td class="markdownTableBodyRight">Single-precision GPU </td><td class="markdownTableBodyRight">GPU </td><td class="markdownTableBodyRight">1 GPU </td><td class="markdownTableBodyRight">3.3 </td><td class="markdownTableBodyLeft">NVHPC 22.11 </td><td class="markdownTableBodyLeft">NCSA Delta </td></tr>
174+
<td class="markdownTableBodyRight">NVIDIA A40 </td><td class="markdownTableBodyRight">FP32-only GPU </td><td class="markdownTableBodyRight">GPU </td><td class="markdownTableBodyRight">1 GPU </td><td class="markdownTableBodyRight">3.3 </td><td class="markdownTableBodyLeft">NVHPC 22.11 </td><td class="markdownTableBodyLeft">NCSA Delta </td></tr>
175175
<tr class="markdownTableRowEven">
176-
<td class="markdownTableBodyRight">NVIDIA Grace CPU </td><td class="markdownTableBodyRight">Arm, Neoverse V2 </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">72/72 cores </td><td class="markdownTableBodyRight">3.7 </td><td class="markdownTableBodyLeft">NVHPC 24.1 </td><td class="markdownTableBodyLeft">GT Rogues Gallery </td></tr>
176+
<td class="markdownTableBodyRight">Intel Xeon Max 9468 </td><td class="markdownTableBodyRight">Sapphire Rapids HBM </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">48 cores </td><td class="markdownTableBodyRight">3.5 </td><td class="markdownTableBodyLeft">NVHPC 24.5 </td><td class="markdownTableBodyLeft">GT Rogues Gallery </td></tr>
177177
<tr class="markdownTableRowOdd">
178-
<td class="markdownTableBodyRight">NVIDIA RTX6000 </td><td class="markdownTableBodyRight">Single-precision GPU </td><td class="markdownTableBodyRight">GPU </td><td class="markdownTableBodyRight">1 GPU </td><td class="markdownTableBodyRight">3.9 </td><td class="markdownTableBodyLeft">NVHPC 22.11 </td><td class="markdownTableBodyLeft">GT Phoenix </td></tr>
178+
<td class="markdownTableBodyRight">NVIDIA Grace CPU </td><td class="markdownTableBodyRight">Arm, Neoverse V2 </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">72 cores </td><td class="markdownTableBodyRight">3.7 </td><td class="markdownTableBodyLeft">NVHPC 24.1 </td><td class="markdownTableBodyLeft">GT Rogues Gallery </td></tr>
179179
<tr class="markdownTableRowEven">
180-
<td class="markdownTableBodyRight">AMD EPYC 7763 </td><td class="markdownTableBodyRight">Milan </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">64/64 cores </td><td class="markdownTableBodyRight">4.1 </td><td class="markdownTableBodyLeft">GNU 11.4.0 </td><td class="markdownTableBodyLeft">NCSA Delta </td></tr>
180+
<td class="markdownTableBodyRight">NVIDIA RTX6000 </td><td class="markdownTableBodyRight">FP32-only GPU </td><td class="markdownTableBodyRight">GPU </td><td class="markdownTableBodyRight">1 GPU </td><td class="markdownTableBodyRight">3.9 </td><td class="markdownTableBodyLeft">NVHPC 22.11 </td><td class="markdownTableBodyLeft">GT Phoenix </td></tr>
181181
<tr class="markdownTableRowOdd">
182-
<td class="markdownTableBodyRight">AMD EPYC 7713 </td><td class="markdownTableBodyRight">Milan </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">64/64 cores </td><td class="markdownTableBodyRight">5.0 </td><td class="markdownTableBodyLeft">GNU 12.3.0 </td><td class="markdownTableBodyLeft">GT Phoenix </td></tr>
182+
<td class="markdownTableBodyRight">AMD EPYC 7763 </td><td class="markdownTableBodyRight">Milan </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">64 cores </td><td class="markdownTableBodyRight">4.1 </td><td class="markdownTableBodyLeft">GNU 11.4.0 </td><td class="markdownTableBodyLeft">NCSA Delta </td></tr>
183183
<tr class="markdownTableRowEven">
184-
<td class="markdownTableBodyRight">Intel Xeon 8480CL </td><td class="markdownTableBodyRight">Sapphire Rapids </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">56/56 cores </td><td class="markdownTableBodyRight">5.0 </td><td class="markdownTableBodyLeft">NVHPC 24.5 </td><td class="markdownTableBodyLeft">GT Phoenix </td></tr>
184+
<td class="markdownTableBodyRight">AMD EPYC 7713 </td><td class="markdownTableBodyRight">Milan </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">64 cores </td><td class="markdownTableBodyRight">5.0 </td><td class="markdownTableBodyLeft">GNU 12.3.0 </td><td class="markdownTableBodyLeft">GT Phoenix </td></tr>
185185
<tr class="markdownTableRowOdd">
186-
<td class="markdownTableBodyRight">Intel Xeon 6454S </td><td class="markdownTableBodyRight">Sapphire Rapids </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">32/32 cores </td><td class="markdownTableBodyRight">5.6 </td><td class="markdownTableBodyLeft">NVHPC 24.5 </td><td class="markdownTableBodyLeft">GT Rogues Gallery </td></tr>
186+
<td class="markdownTableBodyRight">Intel Xeon 8480CL </td><td class="markdownTableBodyRight">Sapphire Rapids </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">56 cores </td><td class="markdownTableBodyRight">5.0 </td><td class="markdownTableBodyLeft">NVHPC 24.5 </td><td class="markdownTableBodyLeft">GT Phoenix </td></tr>
187187
<tr class="markdownTableRowEven">
188-
<td class="markdownTableBodyRight">Intel Xeon 8462Y+ </td><td class="markdownTableBodyRight">Sapphire Rapids </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">32/32 cores </td><td class="markdownTableBodyRight">6.2 </td><td class="markdownTableBodyLeft">GNU 12.3.0 </td><td class="markdownTableBodyLeft">GT ICE </td></tr>
188+
<td class="markdownTableBodyRight">Intel Xeon 6454S </td><td class="markdownTableBodyRight">Sapphire Rapids </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">32 cores </td><td class="markdownTableBodyRight">5.6 </td><td class="markdownTableBodyLeft">NVHPC 24.5 </td><td class="markdownTableBodyLeft">GT Rogues Gallery </td></tr>
189189
<tr class="markdownTableRowOdd">
190-
<td class="markdownTableBodyRight">Intel Xeon 6548Y+ </td><td class="markdownTableBodyRight">Emerald Rapids </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">32/32 cores </td><td class="markdownTableBodyRight">6.6 </td><td class="markdownTableBodyLeft">Intel oneAPI 2021.9 </td><td class="markdownTableBodyLeft">GT ICE </td></tr>
190+
<td class="markdownTableBodyRight">Intel Xeon 8462Y+ </td><td class="markdownTableBodyRight">Sapphire Rapids </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">32 cores </td><td class="markdownTableBodyRight">6.2 </td><td class="markdownTableBodyLeft">GNU 12.3.0 </td><td class="markdownTableBodyLeft">GT ICE </td></tr>
191191
<tr class="markdownTableRowEven">
192-
<td class="markdownTableBodyRight">Intel Xeon 8352Y </td><td class="markdownTableBodyRight">Ice Lake </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">32/32 cores </td><td class="markdownTableBodyRight">6.6 </td><td class="markdownTableBodyLeft">NVHPC 24.5 </td><td class="markdownTableBodyLeft">GT Rogues Gallery </td></tr>
192+
<td class="markdownTableBodyRight">Intel Xeon 6548Y+ </td><td class="markdownTableBodyRight">Emerald Rapids </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">32 cores </td><td class="markdownTableBodyRight">6.6 </td><td class="markdownTableBodyLeft">Intel 2021.9 </td><td class="markdownTableBodyLeft">GT ICE </td></tr>
193193
<tr class="markdownTableRowOdd">
194-
<td class="markdownTableBodyRight">Ampere Altra Q80-28 </td><td class="markdownTableBodyRight">Arm, Neoverse-N1 </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">80/80 cores </td><td class="markdownTableBodyRight">6.8 </td><td class="markdownTableBodyLeft">GNU 12.2.0 </td><td class="markdownTableBodyLeft">OLCF Wombat </td></tr>
194+
<td class="markdownTableBodyRight">Intel Xeon 8352Y </td><td class="markdownTableBodyRight">Ice Lake </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">32 cores </td><td class="markdownTableBodyRight">6.6 </td><td class="markdownTableBodyLeft">NVHPC 24.5 </td><td class="markdownTableBodyLeft">GT Rogues Gallery </td></tr>
195195
<tr class="markdownTableRowEven">
196-
<td class="markdownTableBodyRight">AMD EPYC 7513 </td><td class="markdownTableBodyRight">Milan </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">32/32 cores </td><td class="markdownTableBodyRight">7.4 </td><td class="markdownTableBodyLeft">GNU 12.3.0 </td><td class="markdownTableBodyLeft">GT ICE </td></tr>
196+
<td class="markdownTableBodyRight">Ampere Altra Q80-28 </td><td class="markdownTableBodyRight">Arm, Neoverse-N1 </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">80 cores </td><td class="markdownTableBodyRight">6.8 </td><td class="markdownTableBodyLeft">GNU 12.2.0 </td><td class="markdownTableBodyLeft">OLCF Wombat </td></tr>
197197
<tr class="markdownTableRowOdd">
198-
<td class="markdownTableBodyRight">AMD EPYC 7452 </td><td class="markdownTableBodyRight">Rome </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">32/32 cores </td><td class="markdownTableBodyRight">8.4 </td><td class="markdownTableBodyLeft">GNU 12.3.0 </td><td class="markdownTableBodyLeft">GT ICE </td></tr>
198+
<td class="markdownTableBodyRight">AMD EPYC 7513 </td><td class="markdownTableBodyRight">Milan </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">32 cores </td><td class="markdownTableBodyRight">7.4 </td><td class="markdownTableBodyLeft">GNU 12.3.0 </td><td class="markdownTableBodyLeft">GT ICE </td></tr>
199199
<tr class="markdownTableRowEven">
200-
<td class="markdownTableBodyRight">IBM Power10 </td><td class="markdownTableBodyRight"></td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">24/24 cores </td><td class="markdownTableBodyRight">10 </td><td class="markdownTableBodyLeft">GNU 13.3.1 </td><td class="markdownTableBodyLeft">GT Rogues Gallery </td></tr>
200+
<td class="markdownTableBodyRight">AMD EPYC 7452 </td><td class="markdownTableBodyRight">Rome </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">32 cores </td><td class="markdownTableBodyRight">8.4 </td><td class="markdownTableBodyLeft">GNU 12.3.0 </td><td class="markdownTableBodyLeft">GT ICE </td></tr>
201201
<tr class="markdownTableRowOdd">
202-
<td class="markdownTableBodyRight">AMD EPYC 7401 </td><td class="markdownTableBodyRight">Naples </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">24/24 cores </td><td class="markdownTableBodyRight">10 </td><td class="markdownTableBodyLeft">GNU 10.3.1 </td><td class="markdownTableBodyLeft">LLNL Corona </td></tr>
202+
<td class="markdownTableBodyRight">IBM Power10 </td><td class="markdownTableBodyRight"></td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">24 cores </td><td class="markdownTableBodyRight">10 </td><td class="markdownTableBodyLeft">GNU 13.3.1 </td><td class="markdownTableBodyLeft">GT Rogues Gallery </td></tr>
203203
<tr class="markdownTableRowEven">
204-
<td class="markdownTableBodyRight">Apple M1 Pro </td><td class="markdownTableBodyRight"></td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">8/10 cores </td><td class="markdownTableBodyRight">14 </td><td class="markdownTableBodyLeft">GNU 13.2.0 </td><td class="markdownTableBodyLeft">N/A </td></tr>
204+
<td class="markdownTableBodyRight">AMD EPYC 7401 </td><td class="markdownTableBodyRight">Naples </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">24 cores </td><td class="markdownTableBodyRight">10 </td><td class="markdownTableBodyLeft">GNU 10.3.1 </td><td class="markdownTableBodyLeft">LLNL Corona </td></tr>
205205
<tr class="markdownTableRowOdd">
206-
<td class="markdownTableBodyRight">Intel Xeon 6226 </td><td class="markdownTableBodyRight">Cascade Lake </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">12/12 cores </td><td class="markdownTableBodyRight">17 </td><td class="markdownTableBodyLeft">GNU 12.3.0 </td><td class="markdownTableBodyLeft">GT ICE </td></tr>
206+
<td class="markdownTableBodyRight">Apple M1 Pro </td><td class="markdownTableBodyRight"></td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">8 cores </td><td class="markdownTableBodyRight">14 </td><td class="markdownTableBodyLeft">GNU 13.2.0 </td><td class="markdownTableBodyLeft">N/A </td></tr>
207207
<tr class="markdownTableRowEven">
208-
<td class="markdownTableBodyRight">Apple M1 Max </td><td class="markdownTableBodyRight"></td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">8/10 cores </td><td class="markdownTableBodyRight">18 </td><td class="markdownTableBodyLeft">GNU 14.1.0 </td><td class="markdownTableBodyLeft">N/A </td></tr>
208+
<td class="markdownTableBodyRight">Intel Xeon 6226 </td><td class="markdownTableBodyRight">Cascade Lake </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">12 cores </td><td class="markdownTableBodyRight">17 </td><td class="markdownTableBodyLeft">GNU 12.3.0 </td><td class="markdownTableBodyLeft">GT ICE </td></tr>
209209
<tr class="markdownTableRowOdd">
210-
<td class="markdownTableBodyRight">IBM Power9 </td><td class="markdownTableBodyRight"></td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">20/21 cores </td><td class="markdownTableBodyRight">21 </td><td class="markdownTableBodyLeft">GNU 9.1.0 </td><td class="markdownTableBodyLeft">OLCF Summit </td></tr>
210+
<td class="markdownTableBodyRight">Apple M1 Max </td><td class="markdownTableBodyRight"></td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">8 cores </td><td class="markdownTableBodyRight">18 </td><td class="markdownTableBodyLeft">GNU 14.1.0 </td><td class="markdownTableBodyLeft">N/A </td></tr>
211211
<tr class="markdownTableRowEven">
212-
<td class="markdownTableBodyRight">Intel Xeon E5-2650V4 </td><td class="markdownTableBodyRight">Broadwell </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">12/12 cores </td><td class="markdownTableBodyRight">27 </td><td class="markdownTableBodyLeft">NVHPC 23.5 </td><td class="markdownTableBodyLeft">GT CSE Internal </td></tr>
212+
<td class="markdownTableBodyRight">IBM Power9 </td><td class="markdownTableBodyRight"></td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">20 cores </td><td class="markdownTableBodyRight">21 </td><td class="markdownTableBodyLeft">GNU 9.1.0 </td><td class="markdownTableBodyLeft">OLCF Summit </td></tr>
213213
<tr class="markdownTableRowOdd">
214-
<td class="markdownTableBodyRight">Intel Xeon E7-4850V3 </td><td class="markdownTableBodyRight">Haswell </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">14/14 cores </td><td class="markdownTableBodyRight">34 </td><td class="markdownTableBodyLeft">GNU 9.4.0 </td><td class="markdownTableBodyLeft">GT CSE Internal </td></tr>
214+
<td class="markdownTableBodyRight">Arm Cortex-A78AE </td><td class="markdownTableBodyRight">Arm, BlueField3 </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">16 cores </td><td class="markdownTableBodyRight">25 </td><td class="markdownTableBodyLeft">NVHPC 24.5 </td><td class="markdownTableBodyLeft">GT Rogues Gallery </td></tr>
215+
<tr class="markdownTableRowEven">
216+
<td class="markdownTableBodyRight">Intel Xeon E5-2650V4 </td><td class="markdownTableBodyRight">Broadwell </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">12 cores </td><td class="markdownTableBodyRight">27 </td><td class="markdownTableBodyLeft">NVHPC 23.5 </td><td class="markdownTableBodyLeft">GT CSE Internal </td></tr>
217+
<tr class="markdownTableRowOdd">
218+
<td class="markdownTableBodyRight">Intel Xeon E7-4850V3 </td><td class="markdownTableBodyRight">Haswell </td><td class="markdownTableBodyRight">CPU </td><td class="markdownTableBodyRight">14 cores </td><td class="markdownTableBodyRight">34 </td><td class="markdownTableBodyLeft">GNU 9.4.0 </td><td class="markdownTableBodyLeft">GT CSE Internal </td></tr>
215219
</table>
216220
<p><b>All grind times are in nanoseconds (ns) per grid point (gp) per equation (eq) per right-hand side (rhs) evaluation, so X ns/gp/eq/rhs. Lower is better.</b></p>
217221
<h1><a class="anchor" id="autotoc_md68"></a>

0 commit comments

Comments
 (0)