Skip to content

Commit 1cf1f16

Browse files
author
MFC Action
committed
Docs @ d292c29
1 parent 6dbeeca commit 1cf1f16

File tree

241 files changed

+253
-248
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

241 files changed

+253
-248
lines changed

documentation/md_expectedPerformance.html

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -137,24 +137,24 @@
137137
<div class="textblock"><p><a class="anchor" id="autotoc_md54"></a> MFC has been benchmarked on several CPUs and GPU devices. This page shows a summary of these results.</p>
138138
<h1><a class="anchor" id="autotoc_md55"></a>
139139
Expected time-steps/hour</h1>
140-
<p>The following table outlines observed performance as nanoseconds per grid point (ns/GP) per right-hand side evaluation (lower is better). We solve an example 3D, inviscid, 5-equation model problem with two advected species (a total of 8 PDEs). The numerics are WENO5 and the HLLC approximate Riemann solver. We report results for various numbers of grid points per CPU die (or GPU device) and hardware.</p>
140+
<p>The following table outlines observed performance as nanoseconds per grid point (ns/GP) per equation (eq) per right-hand side (rhs) evaluation (lower is better). We solve an example 3D, inviscid, 5-equation model problem with two advected species (a total of 8 PDEs). The numerics are WENO5 and the HLLC approximate Riemann solver. This case is located in <code>examples/3D_performance_test</code>. We report results for various numbers of grid points per CPU die (or GPU device) and hardware.</p>
141141
<table class="markdownTable">
142142
<tr class="markdownTableHead">
143143
<th class="markdownTableHeadRight">Hardware </th><th class="markdownTableHeadCenter"></th><th class="markdownTableHeadCenter">1M GPs </th><th class="markdownTableHeadCenter">4M GPs </th><th class="markdownTableHeadCenter">8M GPs </th><th class="markdownTableHeadCenter">Compiler </th><th class="markdownTableHeadLeft">Computer </th></tr>
144144
<tr class="markdownTableRowOdd">
145-
<td class="markdownTableBodyRight">NVIDIA V100 </td><td class="markdownTableBodyCenter">1 device </td><td class="markdownTableBodyCenter">96 </td><td class="markdownTableBodyCenter">104 </td><td class="markdownTableBodyCenter">104 </td><td class="markdownTableBodyCenter">NVHPC 22.11 </td><td class="markdownTableBodyLeft">PACE Phoenix </td></tr>
145+
<td class="markdownTableBodyRight">NVIDIA V100 </td><td class="markdownTableBodyCenter">1 device </td><td class="markdownTableBodyCenter">12.0 </td><td class="markdownTableBodyCenter">13.0 </td><td class="markdownTableBodyCenter">13.0 </td><td class="markdownTableBodyCenter">NVHPC 22.11 </td><td class="markdownTableBodyLeft">PACE Phoenix </td></tr>
146146
<tr class="markdownTableRowEven">
147-
<td class="markdownTableBodyRight">NVIDIA V100 </td><td class="markdownTableBodyCenter">1 device </td><td class="markdownTableBodyCenter">101 </td><td class="markdownTableBodyCenter">104 </td><td class="markdownTableBodyCenter">104 </td><td class="markdownTableBodyCenter">NVHPC 22.11 </td><td class="markdownTableBodyLeft">OLCF Summit </td></tr>
147+
<td class="markdownTableBodyRight">NVIDIA V100 </td><td class="markdownTableBodyCenter">1 device </td><td class="markdownTableBodyCenter">12.6 </td><td class="markdownTableBodyCenter">13.0 </td><td class="markdownTableBodyCenter">13.0 </td><td class="markdownTableBodyCenter">NVHPC 22.11 </td><td class="markdownTableBodyLeft">OLCF Summit </td></tr>
148148
<tr class="markdownTableRowOdd">
149-
<td class="markdownTableBodyRight">NVIDIA A100 </td><td class="markdownTableBodyCenter">1 device </td><td class="markdownTableBodyCenter">71 </td><td class="markdownTableBodyCenter">56 </td><td class="markdownTableBodyCenter">59 </td><td class="markdownTableBodyCenter">NVHPC 23.5 </td><td class="markdownTableBodyLeft">Wingtip </td></tr>
149+
<td class="markdownTableBodyRight">NVIDIA A100 </td><td class="markdownTableBodyCenter">1 device </td><td class="markdownTableBodyCenter">8.9 </td><td class="markdownTableBodyCenter">7.0 </td><td class="markdownTableBodyCenter">7.4 </td><td class="markdownTableBodyCenter">NVHPC 23.5 </td><td class="markdownTableBodyLeft">Wingtip </td></tr>
150150
<tr class="markdownTableRowEven">
151-
<td class="markdownTableBodyRight">AMD MI250X </td><td class="markdownTableBodyCenter">1 GCD </td><td class="markdownTableBodyCenter">108 </td><td class="markdownTableBodyCenter">90 </td><td class="markdownTableBodyCenter">96 </td><td class="markdownTableBodyCenter">CCE 16.0.1 </td><td class="markdownTableBodyLeft">OLCF Frontier </td></tr>
151+
<td class="markdownTableBodyRight">AMD MI250X </td><td class="markdownTableBodyCenter">1 GCD </td><td class="markdownTableBodyCenter">13.5 </td><td class="markdownTableBodyCenter">11.3 </td><td class="markdownTableBodyCenter">12 </td><td class="markdownTableBodyCenter">CCE 16.0.1 </td><td class="markdownTableBodyLeft">OLCF Frontier </td></tr>
152152
<tr class="markdownTableRowOdd">
153-
<td class="markdownTableBodyRight">Intel Xeon Gold 6226 </td><td class="markdownTableBodyCenter">12 cores </td><td class="markdownTableBodyCenter">1963 </td><td class="markdownTableBodyCenter">1688 </td><td class="markdownTableBodyCenter">1686 </td><td class="markdownTableBodyCenter">GNU 10.3.0 </td><td class="markdownTableBodyLeft">PACE Phoenix </td></tr>
153+
<td class="markdownTableBodyRight">Intel Xeon Gold 6226 </td><td class="markdownTableBodyCenter">12 cores </td><td class="markdownTableBodyCenter">245 </td><td class="markdownTableBodyCenter">211 </td><td class="markdownTableBodyCenter">211 </td><td class="markdownTableBodyCenter">GNU 10.3.0 </td><td class="markdownTableBodyLeft">PACE Phoenix </td></tr>
154154
<tr class="markdownTableRowEven">
155-
<td class="markdownTableBodyRight">Apple M2 </td><td class="markdownTableBodyCenter">6 cores </td><td class="markdownTableBodyCenter">2919 </td><td class="markdownTableBodyCenter">245 </td><td class="markdownTableBodyCenter">4500 </td><td class="markdownTableBodyCenter">GNU 13.2.0 </td><td class="markdownTableBodyLeft">N/A </td></tr>
155+
<td class="markdownTableBodyRight">Apple M2 </td><td class="markdownTableBodyCenter">6 cores </td><td class="markdownTableBodyCenter">365 </td><td class="markdownTableBodyCenter">306 </td><td class="markdownTableBodyCenter">563 </td><td class="markdownTableBodyCenter">GNU 13.2.0 </td><td class="markdownTableBodyLeft">N/A </td></tr>
156156
</table>
157-
<p><b>All results are in nanoseconds (ns) per grid point (gp) per right-hand side (rhs) evaluation. Lower is better.</b></p>
157+
<p><b>All results are in nanoseconds (ns) per grid point (gp) per equation (eq) per right-hand side (rhs) evaluation, so X ns/gp/eq/rhs. Lower is better.</b></p>
158158
<h1><a class="anchor" id="autotoc_md56"></a>
159159
Weak scaling</h1>
160160
<p>Weak scaling results are obtained by increasing the problem size with the number of processes so that work per process remains constant.</p>

simulation/doxygen_crawl.html

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -491,11 +491,11 @@
491491
<a href="m__rhs_8fpp_8f90.html#a2d84a02a65f699f43818db9feca4e25d"/>
492492
<a href="m__rhs_8fpp_8f90.html#a340bafa0c8d745c3060facd9cfef7ec4"/>
493493
<a href="m__rhs_8fpp_8f90.html#a376bc3b64f78384ac773235ee7fbdd98"/>
494+
<a href="m__rhs_8fpp_8f90.html#a434cc22610390c8f0a8f8ff667c1c573"/>
494495
<a href="m__rhs_8fpp_8f90.html#a6c337b82dc4c47f2d3cc57408aa1fc9f"/>
495496
<a href="m__rhs_8fpp_8f90.html#a762d1c12479dcf998f8a5a47dc258ddc"/>
496497
<a href="m__rhs_8fpp_8f90.html#a796247036b68f5c2f04db3b4c943b5f2"/>
497498
<a href="m__rhs_8fpp_8f90.html#a811c16d33272b2c75d7a75f24c675795"/>
498-
<a href="m__rhs_8fpp_8f90.html#a8522bdf311a42fe6ff7f2b433656573f"/>
499499
<a href="m__rhs_8fpp_8f90.html#a87616394067f1273cd06c4d299d10add"/>
500500
<a href="m__rhs_8fpp_8f90.html#a8947fb914dfe4c2f0e688f4061e802ee"/>
501501
<a href="m__rhs_8fpp_8f90.html#a9dff053a7e3f9c1702a161f55d3e1726"/>

simulation/globals_func.html

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -235,7 +235,7 @@ <h3><a id="index_s" name="index_s"></a>- s -</h3><ul>
235235
<li>s_compute_ptg_residue()&#160;:&#160;<a class="el" href="m__phase__change_8fpp_8f90.html#abdbfe3d69ae51ba911409597a6870620">m_phase_change.fpp.f90</a></li>
236236
<li>s_compute_qbmm_rhs()&#160;:&#160;<a class="el" href="m__qbmm_8fpp_8f90.html#a76636f7da72646919d564340441e9696">m_qbmm.fpp.f90</a></li>
237237
<li>s_compute_rectangle_levelset()&#160;:&#160;<a class="el" href="m__compute__levelset_8fpp_8f90.html#ae85abb116c0a730a0ce3081a2f05761d">m_compute_levelset.fpp.f90</a></li>
238-
<li>s_compute_rhs()&#160;:&#160;<a class="el" href="m__rhs_8fpp_8f90.html#a8522bdf311a42fe6ff7f2b433656573f">m_rhs.fpp.f90</a></li>
238+
<li>s_compute_rhs()&#160;:&#160;<a class="el" href="m__rhs_8fpp_8f90.html#a434cc22610390c8f0a8f8ff667c1c573">m_rhs.fpp.f90</a></li>
239239
<li>s_compute_speed_of_sound()&#160;:&#160;<a class="el" href="m__cbc_8fpp_8f90.html#a29014c6ce75edc1b102b057644ebd9ee">m_cbc.fpp.f90</a>, <a class="el" href="m__data__output_8fpp_8f90.html#ae249d3380b38c2174721a676b9c062cc">m_data_output.fpp.f90</a>, <a class="el" href="m__riemann__solvers_8fpp_8f90.html#ae680604e8ff6dd9b8048f2be0c0f79db">m_riemann_solvers.fpp.f90</a></li>
240240
<li>s_compute_sphere_levelset()&#160;:&#160;<a class="el" href="m__compute__levelset_8fpp_8f90.html#a7d56ae2993a959a227476a8c0d19e1d3">m_compute_levelset.fpp.f90</a></li>
241241
<li>s_compute_viscous_rhs()&#160;:&#160;<a class="el" href="m__viscous_8fpp_8f90.html#ad3952fdb60dca7f4970ec6f2ef1fb0c6">m_viscous.fpp.f90</a></li>

simulation/globals_s.html

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -175,7 +175,7 @@ <h3><a id="index_s" name="index_s"></a>- s -</h3><ul>
175175
<li>s_compute_ptg_residue()&#160;:&#160;<a class="el" href="m__phase__change_8fpp_8f90.html#abdbfe3d69ae51ba911409597a6870620">m_phase_change.fpp.f90</a></li>
176176
<li>s_compute_qbmm_rhs()&#160;:&#160;<a class="el" href="m__qbmm_8fpp_8f90.html#a76636f7da72646919d564340441e9696">m_qbmm.fpp.f90</a></li>
177177
<li>s_compute_rectangle_levelset()&#160;:&#160;<a class="el" href="m__compute__levelset_8fpp_8f90.html#ae85abb116c0a730a0ce3081a2f05761d">m_compute_levelset.fpp.f90</a></li>
178-
<li>s_compute_rhs()&#160;:&#160;<a class="el" href="m__rhs_8fpp_8f90.html#a8522bdf311a42fe6ff7f2b433656573f">m_rhs.fpp.f90</a></li>
178+
<li>s_compute_rhs()&#160;:&#160;<a class="el" href="m__rhs_8fpp_8f90.html#a434cc22610390c8f0a8f8ff667c1c573">m_rhs.fpp.f90</a></li>
179179
<li>s_compute_speed_of_sound()&#160;:&#160;<a class="el" href="m__cbc_8fpp_8f90.html#a29014c6ce75edc1b102b057644ebd9ee">m_cbc.fpp.f90</a>, <a class="el" href="m__data__output_8fpp_8f90.html#ae249d3380b38c2174721a676b9c062cc">m_data_output.fpp.f90</a>, <a class="el" href="m__riemann__solvers_8fpp_8f90.html#ae680604e8ff6dd9b8048f2be0c0f79db">m_riemann_solvers.fpp.f90</a></li>
180180
<li>s_compute_sphere_levelset()&#160;:&#160;<a class="el" href="m__compute__levelset_8fpp_8f90.html#a7d56ae2993a959a227476a8c0d19e1d3">m_compute_levelset.fpp.f90</a></li>
181181
<li>s_compute_viscous_rhs()&#160;:&#160;<a class="el" href="m__viscous_8fpp_8f90.html#ad3952fdb60dca7f4970ec6f2ef1fb0c6">m_viscous.fpp.f90</a></li>

simulation/m__bubbles_8fpp_8f90_a1d5426908fc62e135de2e554b061f9cc_icgraph.map

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@
88
<area shape="poly" id="edge3_Node000002_Node000003" title=" " alt="" coords="1174,71,1140,71,1140,66,1174,66"/>
99
<area shape="rect" id="Node000004" href="$m__time__steppers_8fpp_8f90.html#a7d3b794980e24632e81102bb05044392" title="Bubble source part in Strang operator splitting scheme." alt="" coords="733,5,897,31"/>
1010
<area shape="poly" id="edge4_Node000003_Node000004" title=" " alt="" coords="971,56,872,33,874,28,972,50"/>
11-
<area shape="rect" id="Node000008" href="$m__rhs_8fpp_8f90.html#a8522bdf311a42fe6ff7f2b433656573f" title=" " alt="" coords="755,80,875,105"/>
11+
<area shape="rect" id="Node000008" href="$m__rhs_8fpp_8f90.html#a434cc22610390c8f0a8f8ff667c1c573" title=" " alt="" coords="755,80,875,105"/>
1212
<area shape="poly" id="edge8_Node000003_Node000008" title=" " alt="" coords="932,83,876,89,875,84,931,78"/>
1313
<area shape="rect" id="Node000005" href="$m__time__steppers_8fpp_8f90.html#a5858523edfce42df32b2b0a8af54bc0e" title="Strang splitting scheme with 3rd order TVD RK time&#45;stepping algorithm for the flux term and adaptive ..." alt="" coords="347,29,485,55"/>
1414
<area shape="poly" id="edge5_Node000004_Node000005" title=" " alt="" coords="720,16,632,16,539,21,471,32,470,27,538,15,632,10,720,11"/>
Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
2f527a9294aa969f00febc8c14fe96aa
1+
d9685010ca1289dffd0d2db2b30c51b5

simulation/m__bubbles_8fpp_8f90_a1d5426908fc62e135de2e554b061f9cc_icgraph.svg

Lines changed: 1 addition & 1 deletion
Loading

simulation/m__bubbles_8fpp_8f90_a1d5426908fc62e135de2e554b061f9cc_icgraph_org.svg

Lines changed: 1 addition & 1 deletion
Loading

simulation/m__bubbles_8fpp_8f90_a5a0193f314079c7e9d923b9374eb20e3_icgraph.map

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@
88
<area shape="poly" id="edge4_Node000002_Node000003" title=" " alt="" coords="1174,71,1140,71,1140,66,1174,66"/>
99
<area shape="rect" id="Node000004" href="$m__time__steppers_8fpp_8f90.html#a7d3b794980e24632e81102bb05044392" title="Bubble source part in Strang operator splitting scheme." alt="" coords="733,5,897,31"/>
1010
<area shape="poly" id="edge5_Node000003_Node000004" title=" " alt="" coords="971,56,872,33,874,28,972,50"/>
11-
<area shape="rect" id="Node000008" href="$m__rhs_8fpp_8f90.html#a8522bdf311a42fe6ff7f2b433656573f" title=" " alt="" coords="755,80,875,105"/>
11+
<area shape="rect" id="Node000008" href="$m__rhs_8fpp_8f90.html#a434cc22610390c8f0a8f8ff667c1c573" title=" " alt="" coords="755,80,875,105"/>
1212
<area shape="poly" id="edge9_Node000003_Node000008" title=" " alt="" coords="932,83,876,89,875,84,931,78"/>
1313
<area shape="rect" id="Node000005" href="$m__time__steppers_8fpp_8f90.html#a5858523edfce42df32b2b0a8af54bc0e" title="Strang splitting scheme with 3rd order TVD RK time&#45;stepping algorithm for the flux term and adaptive ..." alt="" coords="347,29,485,55"/>
1414
<area shape="poly" id="edge6_Node000004_Node000005" title=" " alt="" coords="720,16,632,16,539,21,471,32,470,27,538,15,632,10,720,11"/>
Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
a1649f730e7ae62a384ce2a728b355de
1+
46a4778f7e56ccac0d75e952e0bc996f

0 commit comments

Comments
 (0)