stacklok
diff --git a/‎.claude/skills/implement-story/SKILL.md‎
Lines changed: 218 additions & 0 deletions b/‎.claude/skills/implement-story/SKILL.md‎
Lines changed: 218 additions & 0 deletions
diff --git a/‎.github/workflows/claude.yml‎
Lines changed: 1 addition & 1 deletion b/‎.github/workflows/claude.yml‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎cmd/thv-operator/api/v1alpha1/embeddingserver_types.go‎
Lines changed: 2 additions & 0 deletions b/‎cmd/thv-operator/api/v1alpha1/embeddingserver_types.go‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎cmd/thv-operator/api/v1alpha1/mcpexternalauthconfig_types.go‎
Lines changed: 2 additions & 0 deletions b/‎cmd/thv-operator/api/v1alpha1/mcpexternalauthconfig_types.go‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎cmd/thv-operator/api/v1alpha1/mcpgroup_types.go‎
Lines changed: 2 additions & 0 deletions b/‎cmd/thv-operator/api/v1alpha1/mcpgroup_types.go‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎cmd/thv-operator/api/v1alpha1/mcpoidcconfig_types.go‎
Lines changed: 2 additions & 0 deletions b/‎cmd/thv-operator/api/v1alpha1/mcpoidcconfig_types.go‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎cmd/thv-operator/api/v1alpha1/mcpremoteproxy_types.go‎
Lines changed: 3 additions & 0 deletions b/‎cmd/thv-operator/api/v1alpha1/mcpremoteproxy_types.go‎
Lines changed: 3 additions & 0 deletions
diff --git a/‎cmd/thv-operator/api/v1alpha1/mcpserver_types.go‎
Lines changed: 2 additions & 0 deletions b/‎cmd/thv-operator/api/v1alpha1/mcpserver_types.go‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎cmd/thv-operator/api/v1alpha1/mcptelemetryconfig_types.go‎
Lines changed: 2 additions & 0 deletions b/‎cmd/thv-operator/api/v1alpha1/mcptelemetryconfig_types.go‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎cmd/thv-operator/api/v1alpha1/virtualmcpcompositetooldefinition_types.go‎
Lines changed: 2 additions & 0 deletions b/‎cmd/thv-operator/api/v1alpha1/virtualmcpcompositetooldefinition_types.go‎
Lines changed: 2 additions & 0 deletions
@@ -0,0 +1,218 @@
+---
+name: implement-story
+description: Implements a GitHub user story from planning through PR creation, with research, codebase analysis, and structured commits.
+---
+
+# Implement User Story
+
+Takes a GitHub user story issue and produces well-organized PR(s) that reliably meet the acceptance criteria.
+
+## Arguments
+
+The user provides a GitHub issue number or URL. Example:
+
+```
+/implement-story #4550
+/implement-story https://github.com/stacklok/toolhive/issues/4550
+```
+
+---
+
+## Phase 1: Gather Context
+
+### 1.1 Read the Issue
+
+Fetch the issue body using GitHub tools. Extract:
+
+- **User story**: The "As a / I want / so that" statement
+- **Acceptance criteria**: The checkbox list — this is the contract
+- **Context links**: RFC links, related issues, dependencies
+- **Out of scope**: What NOT to do
+
+### 1.2 Fetch RFC Context
+
+If the issue links to an RFC (look for `THV-XXXX` references or links to `toolhive-rfcs`):
+
+1. Clone or locate the RFC repo locally (check `../toolhive-rfcs/` first)
+2. Read the full RFC document
+3. Extract design decisions relevant to this story — config shapes, algorithm details, error formats, key schemas, etc.
+
+If no RFC is linked, skip this step.
+
+### 1.3 Find Related Stories
+
+Search for sibling stories that share context with this one. These inform how to factor the code for extensibility:
+
+```bash
+# Search by keywords from the issue title
+gh search issues "<keywords>" --repo stacklok/toolhive --state open --limit 10
+
+# Search for issues linking to the same RFC
+gh search issues "THV-XXXX" --repo stacklok/toolhive --limit 10
+```
+
+For each related story, read its acceptance criteria. Ask:
+
+- Will a future story need to extend a type, interface, or package I'm creating?
+- Should I define an interface now that a sibling story will implement later?
+- Are there naming conventions or patterns I should establish that siblings will follow?
+
+**Do not implement sibling stories.** But design the code so they can be implemented without refactoring what you build here.
+
+### 1.4 Research the Codebase
+
+Use the Explore agent or direct search to understand:
+
+1. **Where does this change fit?** Identify the packages, files, and functions that need modification.
+2. **What patterns exist?** Find analogous features already implemented. For example, if adding a new middleware, study how existing middleware (auth, mcp-parser, authz) is registered and wired.
+3. **What gets generated?** Identify files that are auto-generated (CRD manifests, mocks, docs) so you know what to regenerate.
+4. **What tests exist?** Find the test patterns used for similar features (table-driven tests, testcontainers, Chainsaw E2E).
+
+Document your findings before writing any code.
+
+---
+
+## Phase 2: Plan the Work
+
+### 2.1 Map AC to Changes
+
+For each acceptance criterion, identify:
+
+- Which files need to change
+- Whether it's new code or a modification
+- What tests verify it (unit, integration, or E2E)
+
+### 2.2 Decide PR Strategy
+
+Evaluate the total scope against the project's PR guidelines:
+
+- **< 10 files changed** (excluding tests, generated code, docs)
+- **< 400 lines of code changed** (excluding tests, generated code, docs)
+
+If the story fits in one PR, use a single PR. If not, split into multiple PRs following these patterns:
+
+1. **Foundation first**: New types, interfaces, packages
+2. **Wiring second**: Integration into existing code (middleware chain, reconciler, CRD)
+3. **Tests alongside**: Each PR includes its own tests
+4. **Generated code with its trigger**: CRD type changes + `task operator-manifests operator-generate` output in the same PR
+
+### 2.3 Present the Plan
+
+Show the user a plan that covers PR boundaries AND commit boundaries within each PR:
+
+```markdown
+## Implementation Plan
+
+**Story**: #XXXX — [title]
+**PRs**: [1 or N]
+
+### PR 1: [title]
+**Commits**:
+1. [commit message] — [what changes and why]
+2. [commit message] — [what changes and why]
+3. [commit message] — [what changes and why]
+**Tests**:
+- [Unit/E2E]: [what is tested]
+**AC covered**: [which acceptance criteria this satisfies]
+**Regeneration**: [which `task` commands need to run and in which commit]
+
+### PR 2: [title] (if needed)
+...
+```
+
+Wait for user approval before proceeding. Adjust if the user has feedback.
+
+---
+
+## Phase 3: Implement
+
+### 3.1 Create a Branch
+
+```bash
+git checkout -b <user>/<short-description> main
+```
+
+### 3.2 Write Code
+
+Implement the changes from the plan. Follow these principles:
+
+- **Match existing patterns**: Don't invent new conventions. Study the codebase and follow what's there.
+- **Design for siblings**: If related stories will extend this code, use interfaces and clear extension points. But don't build speculative abstractions — just leave the door open.
+- **Tests are not optional**: Every AC that says "Unit:" or "E2E:" must have a corresponding test. Write tests as you go, not at the end.
+
+### 3.3 Commit Per the Plan
+
+Follow the commit boundaries from the plan. Each commit should:
+
+- Be independently compilable (`go build ./...` passes)
+- Have a clear, descriptive message
+- Group related changes (e.g., don't mix CRD type changes with middleware logic)
+
+### 3.4 Run Regeneration Tasks
+
+After changes that affect generated artifacts, run the appropriate tasks:
+
+| Change Type | Regeneration Command |
+|-------------|---------------------|
+| CRD type definitions (`api/v1alpha1/*_types.go`) | `task operator-manifests operator-generate` |
+| Mock interfaces | `task gen` |
+| CLI commands or API endpoints | `task docs` |
+| Helm chart values | `task helm-docs` |
+| Any Go file | `task license-fix` |
+
+Run these **before committing** the related changes. Include the generated output in the same commit as the trigger.
+
+---
+
+## Phase 4: Create PR
+
+### 4.1 Push and Create PR
+
+Follow the PR template at `.github/pull_request_template.md` and the rules in `.claude/rules/pr-creation.md`:
+
+- Title: under 70 chars, imperative mood, no conventional commit prefix
+- Summary: why first, then what. Reference the issue with `Closes #XXXX`
+- Type of change: check exactly one
+- Test plan: check every verification step actually run
+
+### 4.2 Verify AC Coverage
+
+Before submitting, review each acceptance criterion from the issue:
+
+- [ ] Is there code that implements it?
+- [ ] Is there a test that verifies it?
+- [ ] Has the test passed?
+
+If any AC is not covered, either implement it or flag it to the user with a reason.
+
+### 4.3 Babysit CI
+
+After pushing, monitor CI status:
+
+```bash
+gh pr checks <pr-number> --repo stacklok/toolhive --watch
+```
+
+If CI fails:
+1. Read the failure logs
+2. Fix the issue
+3. Push the fix as a new commit (don't amend — keep the history clean for review)
+4. Re-check CI
+
+### 4.4 Multi-PR Workflow
+
+If the story spans multiple PRs:
+
+1. Create the first PR targeting `main`
+2. After merge, create subsequent PRs targeting `main`
+3. Each PR references the story issue (`Part of #XXXX`)
+4. The final PR uses `Closes #XXXX`
+
+---
+
+## Edge Cases
+
+- **AC references another story**: If an acceptance criterion depends on work from another story (e.g., "STORY-001 core middleware exists"), check if that story is merged. If not, flag it to the user.
+- **Generated code is large**: CRD manifest regeneration can produce hundreds of lines of diff. This is expected — note it in the PR description under "Special notes for reviewers."
+- **Tests require infrastructure**: E2E tests may need a Kind cluster, Redis, or Keycloak. Document the setup in the test plan. Don't skip the test — write it even if the user will run it separately.
+- **RFC is ambiguous**: If the RFC doesn't specify a detail needed for implementation, make a pragmatic choice, document it in a code comment, and flag it in the PR description.
@@ -59,7 +59,7 @@ jobs:
 
       - name: Run Claude Code
         id: claude
-        uses: anthropics/claude-code-action@88c168b39e7e64da0286d812b6e9fbebb6708185 # v1
+        uses: anthropics/claude-code-action@1eddb334cfa79fdb21ecbe2180ca1a016e8e7d47 # v1
         with:
           anthropic_api_key: ${{ secrets.ANTHROPIC_API_KEY }}
           # Security: Restrict tools to prevent arbitrary code execution.
 
@@ -156,6 +156,8 @@ type EmbeddingStatefulSetOverrides struct {
 // EmbeddingServerStatus defines the observed state of EmbeddingServer
 type EmbeddingServerStatus struct {
 	// Conditions represent the latest available observations of the EmbeddingServer's state
+	// +listType=map
+	// +listMapKey=type
 	// +optional
 	Conditions []metav1.Condition `json:"conditions,omitempty"`
 
 
@@ -704,6 +704,8 @@ type UpstreamInjectSpec struct {
 // MCPExternalAuthConfigStatus defines the observed state of MCPExternalAuthConfig
 type MCPExternalAuthConfigStatus struct {
 	// Conditions represent the latest available observations of the MCPExternalAuthConfig's state
+	// +listType=map
+	// +listMapKey=type
 	// +optional
 	Conditions []metav1.Condition `json:"conditions,omitempty"`
 
 
@@ -42,6 +42,8 @@ type MCPGroupStatus struct {
 	RemoteProxyCount int `json:"remoteProxyCount,omitempty"`
 
 	// Conditions represent observations
+	// +listType=map
+	// +listMapKey=type
 	// +optional
 	Conditions []metav1.Condition `json:"conditions,omitempty"`
 }
 
@@ -168,6 +168,8 @@ type WorkloadReference struct {
 // MCPOIDCConfigStatus defines the observed state of MCPOIDCConfig
 type MCPOIDCConfigStatus struct {
 	// Conditions represent the latest available observations of the MCPOIDCConfig's state
+	// +listType=map
+	// +listMapKey=type
 	// +optional
 	Conditions []metav1.Condition `json:"conditions,omitempty"`
 
 
@@ -157,6 +157,8 @@ type MCPRemoteProxyStatus struct {
 	ObservedGeneration int64 `json:"observedGeneration,omitempty"`
 
 	// Conditions represent the latest available observations of the MCPRemoteProxy's state
+	// +listType=map
+	// +listMapKey=type
 	// +optional
 	Conditions []metav1.Condition `json:"conditions,omitempty"`
 
@@ -304,6 +306,7 @@ const (
 //+kubebuilder:printcolumn:name="Phase",type="string",JSONPath=".status.phase"
 //+kubebuilder:printcolumn:name="Remote URL",type="string",JSONPath=".spec.remoteURL"
 //+kubebuilder:printcolumn:name="URL",type="string",JSONPath=".status.url"
+//+kubebuilder:printcolumn:name="Ready",type="string",JSONPath=".status.conditions[?(@.type=='Ready')].status"
 //+kubebuilder:printcolumn:name="Age",type="date",JSONPath=".metadata.creationTimestamp"
 
 // MCPRemoteProxy is the Schema for the mcpremoteproxies API
 
@@ -882,6 +882,8 @@ type OpenTelemetryMetricsConfig struct {
 // MCPServerStatus defines the observed state of MCPServer
 type MCPServerStatus struct {
 	// Conditions represent the latest available observations of the MCPServer's state
+	// +listType=map
+	// +listMapKey=type
 	// +optional
 	Conditions []metav1.Condition `json:"conditions,omitempty"`
 
 
@@ -98,6 +98,8 @@ type MCPTelemetryConfigSpec struct {
 // MCPTelemetryConfigStatus defines the observed state of MCPTelemetryConfig
 type MCPTelemetryConfigStatus struct {
 	// Conditions represent the latest available observations of the MCPTelemetryConfig's state
+	// +listType=map
+	// +listMapKey=type
 	// +optional
 	Conditions []metav1.Condition `json:"conditions,omitempty"`
 
 
@@ -39,6 +39,8 @@ type VirtualMCPCompositeToolDefinitionStatus struct {
 	ObservedGeneration int64 `json:"observedGeneration,omitempty"`
 
 	// Conditions represent the latest available observations of the workflow's state
+	// +listType=map
+	// +listMapKey=type
 	// +optional
 	Conditions []metav1.Condition `json:"conditions,omitempty"`
 }
Original file line number	Diff line number	Diff line change
`@@ -42,6 +42,8 @@ type MCPGroupStatus struct {`
`42`	`42`	RemoteProxyCount int `json:"remoteProxyCount,omitempty"`
`43`	`43`
`44`	`44`	`// Conditions represent observations`
	`45`	`+ // +listType=map`
	`46`	`+ // +listMapKey=type`
`45`	`47`	`// +optional`
`46`	`48`	Conditions []metav1.Condition `json:"conditions,omitempty"`
`47`	`49`	`}`
Original file line number	Diff line number	Diff line change
`@@ -39,6 +39,8 @@ type VirtualMCPCompositeToolDefinitionStatus struct {`
`39`	`39`	ObservedGeneration int64 `json:"observedGeneration,omitempty"`
`40`	`40`
`41`	`41`	`// Conditions represent the latest available observations of the workflow's state`
	`42`	`+ // +listType=map`
	`43`	`+ // +listMapKey=type`
`42`	`44`	`// +optional`
`43`	`45`	Conditions []metav1.Condition `json:"conditions,omitempty"`
`44`	`46`	`}`