Enable kubernetes_node_scale benchmark (up to 5k nodes) on AWS EKS with Karpenter by kiryl-filatau · Pull Request #6512 · GoogleCloudPlatform/PerfKitBenchmarker

kiryl-filatau · 2026-03-04T18:27:06Z

Summary

Enables running the kubernetes_node_scale benchmark (0→5k→0→5k nodes) on AWS EKS with Karpenter. The benchmark scales a deployment with pod anti-affinity, measures scale-up/scale-down and a second scale-up, then tears down the cluster.

Main changes

Kubernetes_node_scale benchmark: Template and scaling logic (scale up, scale down, phases), metrics collection, and timeouts tuned for large runs.
EKS + Karpenter: Nodepool template (instance types including t, higher CPU limit), EKS/Karpenter cluster lifecycle and cleanup.
Teardown robustness: Orphan ENI deletion in _CleanupKarpenter — retry with backoff on AWS throttle (RequestLimitExceeded), treat “ENI not found” as success; uses suppress_failure for these cases.
Tracker: Single get nodes pass in _StopWatchingForNodeChanges; resolve machine type only for current nodes, use "unknown" for others to avoid thousands of kubectl calls on 5k-node runs.

NOTE: Hardcoded values to be updated

…' and 'GetNodeNames' methods

vofish and others added 19 commits January 30, 2026 18:01

Add template and scaling logic

0991f6e

Add scaling down logic and gathering metrics

502bfab

Add scaling down logic, phases and gathering metrics

cf15c47

Refactor kubernetes_node_scale benchmark

f178b3d

Merge branch 'master' into azure-5k

dab9712

Add template and scaling logic

8f08511

Add scaling down logic and gathering metrics

3cca10e

Add scaling down logic, phases and gathering metrics

418abd6

Refactor kubernetes_node_scale benchmark

7226547

Update import and j2 template

97d4150

Fix issue where the first kubectl get command might failed

5dd8b70

Raise an error when the timeout is reached

58c3ddf

add pyink modification

214e6f0

Merge branch 'azure-5k' into aws-5k

e0b33ec

Add optional argument suppress_logging to 'GetAllNamesForResourceType…

d448ed9

…' and 'GetNodeNames' methods

Merge branch 'azure-5k' into aws-5k

3519bf3

extend timeout for deletion

5577dee

decrease the timeout, as 1 hour is enough

5869a7e

pyink reformat

ed94853

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable kubernetes_node_scale benchmark (up to 5k nodes) on AWS EKS with Karpenter#6512

Enable kubernetes_node_scale benchmark (up to 5k nodes) on AWS EKS with Karpenter#6512
kiryl-filatau wants to merge 19 commits intoGoogleCloudPlatform:masterfrom
kiryl-filatau:aws-5k

kiryl-filatau commented Mar 4, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kiryl-filatau commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Main changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kiryl-filatau commented Mar 4, 2026 •

edited

Loading