Skip to content

Commit b6b8e69

Browse files
Deploying to gh-pages from @ dstackai/dstack@c45ecfb 🚀
1 parent 0be8b36 commit b6b8e69

File tree

12 files changed

+4287
-169
lines changed

12 files changed

+4287
-169
lines changed
46.3 KB
Loading

blog/archive/2025/index.html

Lines changed: 68 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2880,6 +2880,17 @@
28802880
</label>
28812881
<ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
28822882

2883+
<li class="md-nav__item">
2884+
<a href="#supporting-intel-gaudi-accelerators" class="md-nav__link">
2885+
<span class="md-ellipsis">
2886+
2887+
Supporting Intel Gaudi accelerators
2888+
2889+
</span>
2890+
</a>
2891+
2892+
</li>
2893+
28832894
<li class="md-nav__item">
28842895
<a href="#efficient-distributed-training-with-aws-efa" class="md-nav__link">
28852896
<span class="md-ellipsis">
@@ -3433,6 +3444,17 @@
34333444
</label>
34343445
<ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
34353446

3447+
<li class="md-nav__item">
3448+
<a href="#supporting-intel-gaudi-accelerators" class="md-nav__link">
3449+
<span class="md-ellipsis">
3450+
3451+
Supporting Intel Gaudi accelerators
3452+
3453+
</span>
3454+
</a>
3455+
3456+
</li>
3457+
34363458
<li class="md-nav__item">
34373459
<a href="#efficient-distributed-training-with-aws-efa" class="md-nav__link">
34383460
<span class="md-ellipsis">
@@ -3495,6 +3517,52 @@ <h1 id="2025">2025<a class="headerlink" href="#2025" title="Permanent link">&par
34953517
<article class="md-post md-post--excerpt">
34963518
<header class="md-post__header">
34973519

3520+
<div class="md-post__meta md-meta">
3521+
<ul class="md-meta__list">
3522+
<li class="md-meta__item">
3523+
<time datetime="2025-02-21 00:00:00+00:00">February 21, 2025</time></li>
3524+
3525+
<li class="md-meta__item">
3526+
in
3527+
3528+
<a href="../../category/fleets/" class="md-meta__link">Fleets</a></li>
3529+
3530+
3531+
3532+
<li class="md-meta__item">
3533+
3534+
3 min read
3535+
3536+
</li>
3537+
3538+
3539+
</ul>
3540+
3541+
</div>
3542+
</header>
3543+
<div class="md-post__content md-typeset">
3544+
<h2 id="supporting-intel-gaudi-accelerators"><a class="toclink" href="../../intel-gaudi/">Supporting Intel Gaudi accelerators</a></h2>
3545+
<p>At <code>dstack</code>, our goal is to make AI container orchestration simpler and fully vendor-agnostic. That’s why we support not
3546+
just leading cloud providers and on-prem environments but also a wide range of accelerators.</p>
3547+
<p>With our latest release, we’re adding support
3548+
for <a href="https://www.intel.com/content/www/us/en/products/details/processors/ai-accelerators/gaudi.html" target="_blank">Intel Gaudi <span class="twemoji external"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="m11.93 5 2.83 2.83L5 17.59 6.42 19l9.76-9.75L19 12.07V5z"/></svg></span></a> and
3549+
launching a new partnership with Intel.</p>
3550+
<p><img src="https://github.com/dstackai/static-assets/blob/main/static-assets/images/dstack-intel-gaudi-and-intel-tiber-cloud-v2.png?raw=true" width="630"/></p>
3551+
3552+
3553+
<nav class="md-post__action">
3554+
<a href="../../intel-gaudi/">
3555+
Continue reading
3556+
</a>
3557+
</nav>
3558+
3559+
3560+
</div>
3561+
</article>
3562+
3563+
<article class="md-post md-post--excerpt">
3564+
<header class="md-post__header">
3565+
34983566
<div class="md-post__meta md-meta">
34993567
<ul class="md-meta__list">
35003568
<li class="md-meta__item">

blog/category/fleets/index.html

Lines changed: 68 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -3158,6 +3158,17 @@
31583158
</label>
31593159
<ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
31603160

3161+
<li class="md-nav__item">
3162+
<a href="#supporting-intel-gaudi-accelerators" class="md-nav__link">
3163+
<span class="md-ellipsis">
3164+
3165+
Supporting Intel Gaudi accelerators
3166+
3167+
</span>
3168+
</a>
3169+
3170+
</li>
3171+
31613172
<li class="md-nav__item">
31623173
<a href="#efficient-distributed-training-with-aws-efa" class="md-nav__link">
31633174
<span class="md-ellipsis">
@@ -3422,6 +3433,17 @@
34223433
</label>
34233434
<ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
34243435

3436+
<li class="md-nav__item">
3437+
<a href="#supporting-intel-gaudi-accelerators" class="md-nav__link">
3438+
<span class="md-ellipsis">
3439+
3440+
Supporting Intel Gaudi accelerators
3441+
3442+
</span>
3443+
</a>
3444+
3445+
</li>
3446+
34253447
<li class="md-nav__item">
34263448
<a href="#efficient-distributed-training-with-aws-efa" class="md-nav__link">
34273449
<span class="md-ellipsis">
@@ -3473,6 +3495,52 @@ <h1 id="fleets">Fleets<a class="headerlink" href="#fleets" title="Permanent link
34733495
<article class="md-post md-post--excerpt">
34743496
<header class="md-post__header">
34753497

3498+
<div class="md-post__meta md-meta">
3499+
<ul class="md-meta__list">
3500+
<li class="md-meta__item">
3501+
<time datetime="2025-02-21 00:00:00+00:00">February 21, 2025</time></li>
3502+
3503+
<li class="md-meta__item">
3504+
in
3505+
3506+
<a href="./" class="md-meta__link">Fleets</a></li>
3507+
3508+
3509+
3510+
<li class="md-meta__item">
3511+
3512+
3 min read
3513+
3514+
</li>
3515+
3516+
3517+
</ul>
3518+
3519+
</div>
3520+
</header>
3521+
<div class="md-post__content md-typeset">
3522+
<h2 id="supporting-intel-gaudi-accelerators"><a class="toclink" href="../../intel-gaudi/">Supporting Intel Gaudi accelerators</a></h2>
3523+
<p>At <code>dstack</code>, our goal is to make AI container orchestration simpler and fully vendor-agnostic. That’s why we support not
3524+
just leading cloud providers and on-prem environments but also a wide range of accelerators.</p>
3525+
<p>With our latest release, we’re adding support
3526+
for <a href="https://www.intel.com/content/www/us/en/products/details/processors/ai-accelerators/gaudi.html" target="_blank">Intel Gaudi <span class="twemoji external"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="m11.93 5 2.83 2.83L5 17.59 6.42 19l9.76-9.75L19 12.07V5z"/></svg></span></a> and
3527+
launching a new partnership with Intel.</p>
3528+
<p><img src="https://github.com/dstackai/static-assets/blob/main/static-assets/images/dstack-intel-gaudi-and-intel-tiber-cloud-v2.png?raw=true" width="630"/></p>
3529+
3530+
3531+
<nav class="md-post__action">
3532+
<a href="../../intel-gaudi/">
3533+
Continue reading
3534+
</a>
3535+
</nav>
3536+
3537+
3538+
</div>
3539+
</article>
3540+
3541+
<article class="md-post md-post--excerpt">
3542+
<header class="md-post__header">
3543+
34763544
<div class="md-post__meta md-meta">
34773545
<ul class="md-meta__list">
34783546
<li class="md-meta__item">

blog/distributed-training-with-aws-efa/index.html

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -18,6 +18,8 @@
1818
<link rel="prev" href="../inactive-dev-environments-auto-shutdown/">
1919

2020

21+
<link rel="next" href="../intel-gaudi/">
22+
2123

2224

2325

@@ -3344,11 +3346,11 @@
33443346
<ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
33453347

33463348
<li class="md-nav__item">
3347-
<a href="#why-efa" class="md-nav__link">
3349+
<a href="#about-efa" class="md-nav__link">
33483350
<span class="md-ellipsis">
33493351

33503352
<span class="md-typeset">
3351-
Why EFA?
3353+
About EFA
33523354
</span>
33533355

33543356
</span>
@@ -3494,7 +3496,7 @@ <h1 id="efficient-distributed-training-with-aws-efa">Efficient distributed train
34943496
<p><img src="https://github.com/dstackai/static-assets/blob/main/static-assets/images/distributed-training-with-aws-efa-v2.png?raw=true" width="630"/></p>
34953497
<!-- more -->
34963498

3497-
<h2 id="why-efa">Why EFA?<a class="headerlink" href="#why-efa" title="Permanent link">&para;</a></h2>
3499+
<h2 id="about-efa">About EFA<a class="headerlink" href="#about-efa" title="Permanent link">&para;</a></h2>
34983500
<p>AWS EFA delivers up to 400 Gbps of bandwidth, enabling lightning-fast GPU-to-GPU communication across nodes. By
34993501
bypassing the kernel and providing direct network access, EFA minimizes latency and maximizes throughput. Its native
35003502
integration with the <code>nccl</code> library ensures optimal performance for large-scale distributed training.</p>

blog/index.html

Lines changed: 68 additions & 69 deletions
Original file line numberDiff line numberDiff line change
@@ -2794,6 +2794,17 @@
27942794
</label>
27952795
<ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
27962796

2797+
<li class="md-nav__item">
2798+
<a href="#supporting-intel-gaudi-accelerators" class="md-nav__link">
2799+
<span class="md-ellipsis">
2800+
2801+
Supporting Intel Gaudi accelerators
2802+
2803+
</span>
2804+
</a>
2805+
2806+
</li>
2807+
27972808
<li class="md-nav__item">
27982809
<a href="#efficient-distributed-training-with-aws-efa" class="md-nav__link">
27992810
<span class="md-ellipsis">
@@ -2891,17 +2902,6 @@
28912902
</span>
28922903
</a>
28932904

2894-
</li>
2895-
2896-
<li class="md-nav__item">
2897-
<a href="#monitoring-gpu-usage-and-other-container-metrics" class="md-nav__link">
2898-
<span class="md-ellipsis">
2899-
2900-
Monitoring GPU usage and other container metrics
2901-
2902-
</span>
2903-
</a>
2904-
29052905
</li>
29062906

29072907
</ul>
@@ -3497,6 +3497,17 @@
34973497
</label>
34983498
<ul class="md-nav__list" data-md-component="toc" data-md-scrollfix>
34993499

3500+
<li class="md-nav__item">
3501+
<a href="#supporting-intel-gaudi-accelerators" class="md-nav__link">
3502+
<span class="md-ellipsis">
3503+
3504+
Supporting Intel Gaudi accelerators
3505+
3506+
</span>
3507+
</a>
3508+
3509+
</li>
3510+
35003511
<li class="md-nav__item">
35013512
<a href="#efficient-distributed-training-with-aws-efa" class="md-nav__link">
35023513
<span class="md-ellipsis">
@@ -3594,17 +3605,6 @@
35943605
</span>
35953606
</a>
35963607

3597-
</li>
3598-
3599-
<li class="md-nav__item">
3600-
<a href="#monitoring-gpu-usage-and-other-container-metrics" class="md-nav__link">
3601-
<span class="md-ellipsis">
3602-
3603-
Monitoring GPU usage and other container metrics
3604-
3605-
</span>
3606-
</a>
3607-
36083608
</li>
36093609

36103610
</ul>
@@ -3631,6 +3631,52 @@ <h1 id="blog">Blog<a class="headerlink" href="#blog" title="Permanent link">&par
36313631
<article class="md-post md-post--excerpt">
36323632
<header class="md-post__header">
36333633

3634+
<div class="md-post__meta md-meta">
3635+
<ul class="md-meta__list">
3636+
<li class="md-meta__item">
3637+
<time datetime="2025-02-21 00:00:00+00:00">February 21, 2025</time></li>
3638+
3639+
<li class="md-meta__item">
3640+
in
3641+
3642+
<a href="category/fleets/" class="md-meta__link">Fleets</a></li>
3643+
3644+
3645+
3646+
<li class="md-meta__item">
3647+
3648+
3 min read
3649+
3650+
</li>
3651+
3652+
3653+
</ul>
3654+
3655+
</div>
3656+
</header>
3657+
<div class="md-post__content md-typeset">
3658+
<h2 id="supporting-intel-gaudi-accelerators"><a class="toclink" href="intel-gaudi/">Supporting Intel Gaudi accelerators</a></h2>
3659+
<p>At <code>dstack</code>, our goal is to make AI container orchestration simpler and fully vendor-agnostic. That’s why we support not
3660+
just leading cloud providers and on-prem environments but also a wide range of accelerators.</p>
3661+
<p>With our latest release, we’re adding support
3662+
for <a href="https://www.intel.com/content/www/us/en/products/details/processors/ai-accelerators/gaudi.html" target="_blank">Intel Gaudi <span class="twemoji external"><svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 24 24"><path d="m11.93 5 2.83 2.83L5 17.59 6.42 19l9.76-9.75L19 12.07V5z"/></svg></span></a> and
3663+
launching a new partnership with Intel.</p>
3664+
<p><img src="https://github.com/dstackai/static-assets/blob/main/static-assets/images/dstack-intel-gaudi-and-intel-tiber-cloud-v2.png?raw=true" width="630"/></p>
3665+
3666+
3667+
<nav class="md-post__action">
3668+
<a href="intel-gaudi/">
3669+
Continue reading
3670+
</a>
3671+
</nav>
3672+
3673+
3674+
</div>
3675+
</article>
3676+
3677+
<article class="md-post md-post--excerpt">
3678+
<header class="md-post__header">
3679+
36343680
<div class="md-post__meta md-meta">
36353681
<ul class="md-meta__list">
36363682
<li class="md-meta__item">
@@ -4089,53 +4135,6 @@ <h2 id="using-docker-and-docker-compose-inside-gpu-enabled-containers"><a class=
40894135
</div>
40904136
</article>
40914137

4092-
<article class="md-post md-post--excerpt">
4093-
<header class="md-post__header">
4094-
4095-
<div class="md-post__meta md-meta">
4096-
<ul class="md-meta__list">
4097-
<li class="md-meta__item">
4098-
<time datetime="2024-10-22 00:00:00+00:00">October 22, 2024</time></li>
4099-
4100-
<li class="md-meta__item">
4101-
in
4102-
4103-
<a href="category/amd/" class="md-meta__link">AMD</a>,
4104-
<a href="category/nvidia/" class="md-meta__link">NVIDIA</a>,
4105-
<a href="category/monitoring/" class="md-meta__link">Monitoring</a></li>
4106-
4107-
4108-
4109-
<li class="md-meta__item">
4110-
4111-
2 min read
4112-
4113-
</li>
4114-
4115-
4116-
</ul>
4117-
4118-
</div>
4119-
</header>
4120-
<div class="md-post__content md-typeset">
4121-
<h2 id="monitoring-gpu-usage-and-other-container-metrics"><a class="toclink" href="monitoring-gpu-usage/">Monitoring GPU usage and other container metrics</a></h2>
4122-
<h3 id="how-it-works" style="display:none"><a class="toclink" href="monitoring-gpu-usage/#how-it-works">How it works</a></h3>
4123-
<p>While it's possible to use third-party monitoring tools with <code>dstack</code>, it is often more convenient to debug your run and
4124-
track metrics out of the box. That's why, with the latest release, <code>dstack</code> introduced <a href="../docs/reference/cli/dstack/stats/"><code>dstack stats</code></a>, a new CLI (and API)
4125-
for monitoring container metrics, including GPU usage for <code>NVIDIA</code>, <code>AMD</code>, and other accelerators.</p>
4126-
<p><img src="https://github.com/dstackai/static-assets/blob/main/static-assets/images/dstack-stats-v2.png?raw=true" width="725"/></p>
4127-
4128-
4129-
<nav class="md-post__action">
4130-
<a href="monitoring-gpu-usage/">
4131-
Continue reading
4132-
</a>
4133-
</nav>
4134-
4135-
4136-
</div>
4137-
</article>
4138-
41394138

41404139

41414140

0 commit comments

Comments
 (0)