Add Claude AI test failure analysis to Slack notifications by robbycochran · Pull Request #3381 · stackrox/collector

robbycochran · 2026-05-20T19:54:08Z

Summary

Adds claude CI failure analysis that runs before posting failure to slack.

codecov-commenter · 2026-05-20T20:12:26Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 27.34%. Comparing base (01135a9) to head (4b9e0a0).
⚠️ Report is 2 commits behind head on master.
✅ All tests successful. No failed tests found.

Additional details and impacted files

@@           Coverage Diff           @@
##           master    #3381   +/-   ##
=======================================
  Coverage   27.34%   27.34%           
=======================================
  Files          95       95           
  Lines        5420     5420           
  Branches     2545     2545           
=======================================
  Hits         1482     1482           
  Misses       3211     3211           
  Partials      727      727

Flag	Coverage Δ
collector-unit-tests	`27.34% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Automatically analyzes integration test failures using Claude AI and includes intelligent insights in Slack notifications to #team-acs-collector-oncall. Key features: - Claude has full source code access via claude-code-base-action - Analyzes JUnit XML reports, failing test source, and git history - Detects platform-specific patterns (arch/OS) - Provides file:line precision and actionable recommendations - Skill-based approach (.claude/commands/) for maintainability - Graceful fallback if analysis fails - Test with PR label without Slack spam Architecture: - Integration tests fail → collect-failures job identifies failures - analyze-and-notify reusable workflow runs Claude skill - Claude creates analysis-report.md with root cause analysis - notify job posts to Slack (skipped for PR label tests) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

Molter73

I've only skimmed through the PR, I'm not very happy about it honestly, looks like a lot more work than it needs to be. ~~I'm assuming this was vibe coded and~~ (Just noticed the commit author) This needs a lot of cleanup before it can go in IMO.

Molter73 · 2026-05-21T09:11:04Z

I'm confused, why do we need a completely separate workflow for this? The integration-tests one already has GCP authenticated, it has all the reports downloaded... Can't we just add the couple steps that do the analisis in there and update the notify step? If you want the analisis step to be reusable I'd suggest using an action instead of a full on workflow, but I don't see anything particularly complicate, I would venture to say all you need is to add the Analyze test failures with Claude step into the integration tests.

Molter73 · 2026-05-21T09:13:32Z

Why is this in .github/scripts? Also, is this just a description of what analyze-test-failures.md does? Do we need a 200+ lines of markdown to explain what a separate 100+ line markdown file does?

Molter73 · 2026-05-21T09:16:34Z

+        continue-on-error: true
+        env:
+          ANTHROPIC_VERTEX_PROJECT_ID: ${{ secrets.GCP_CLAUDE_PROJECT_ID }}
+          CLOUD_ML_REGION: us-east5


Should this be a GHA variable instead? You know... So we can change it without opening a PR if we have to update it.

robbycochran requested a review from a team as a code owner May 20, 2026 19:54

robbycochran added the test-oncall-workflow label May 20, 2026

robbycochran mentioned this pull request May 20, 2026

feat: add /ci-report slash command for oncall CI monitoring #3170

Draft

robbycochran force-pushed the add-test-analysis-job branch from bed7306 to ebe5421 Compare May 21, 2026 01:41

StackRox Automation and others added 3 commits May 21, 2026 02:07

Remove actionlint.yaml

31cea3d

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

Fix shellcheck warnings: quote $GITHUB_OUTPUT and group redirects

37575ca

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

Consolidate Slack notification into single step with inline fallback

4b9e0a0

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>

Molter73 reviewed May 21, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Claude AI test failure analysis to Slack notifications#3381

Add Claude AI test failure analysis to Slack notifications#3381
robbycochran wants to merge 4 commits into
masterfrom
add-test-analysis-job

robbycochran commented May 20, 2026 •

edited

Loading

Uh oh!

codecov-commenter commented May 20, 2026 •

edited

Loading

Uh oh!

Molter73 left a comment •

edited

Loading

Uh oh!

Molter73 May 21, 2026

Uh oh!

Molter73 May 21, 2026

Uh oh!

Molter73 May 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

robbycochran commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

codecov-commenter commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Molter73 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Molter73 May 21, 2026

Choose a reason for hiding this comment

Uh oh!

Molter73 May 21, 2026

Choose a reason for hiding this comment

Uh oh!

Molter73 May 21, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

robbycochran commented May 20, 2026 •

edited

Loading

codecov-commenter commented May 20, 2026 •

edited

Loading

Molter73 left a comment •

edited

Loading