docs: add dogfood report for v3.4.0 by carlos-alm · Pull Request #611 · optave/codegraph

carlos-alm · 2026-03-26T04:37:22Z

Summary

Dogfooding report for v3.4.0 — comprehensive testing of all commands, engine comparison, benchmarks, and release-specific feature validation.

Rating: 8.5/10 — solid release with clean TypeScript migration, 1.9x faster native builds, and excellent error handling.

Key findings

All 34+ commands pass (cold start graceful degradation, full sweep)
Native engine: 1.9x faster builds, 5.5x faster complexity, 9.6% higher call confidence
Engine parity: <1% edge difference, identical query results
MCP server: 34 tools single-repo, 35 multi-repo, graceful shutdown works
No blocking bugs found
4 suggestions for improvement (DB version warning, barrel file exports, tsconfig warning, EXTENSIONS type change docs)

See generated/dogfood/DOGFOOD_REPORT_v3.4.0.md for full details.

claude · 2026-03-26T04:37:39Z

Claude finished @carlos-alm's task —— View job

Reviewing PR: docs: add dogfood report for v3.4.0

Todo List:

Read and analyze the dogfood report file
Apply Phase 1: Root Cause Analysis
Apply Phase 2: Configuration Change Detection
Apply Phase 3: Test Integrity
Apply Phase 4: Code Quality
Apply Phase 5: Backlog Compliance Assessment
Provide final structured review feedback

Root Cause Analysis

Problem Being Solved: This PR provides comprehensive validation documentation for the v3.4.0 release through systematic dogfooding. The report documents that all 34+ commands work correctly, native/WASM engine comparison results, and validation of release-specific features like TypeScript migration and performance improvements.

Why This Approach: Dogfooding reports serve as release validation documentation and provide transparency about testing thoroughness. This is preventive documentation that validates release quality rather than hiding issues.

Risk Assessment: Zero risk - this is pure documentation with no code changes, configuration modifications, or behavioral changes.

Backlog Compliance

Zero-dep: ✓ — No dependencies added, pure documentation
Foundation-aligned: ✓ — Documentation supports P5 (transparency) and validates core functionality without violating any principles
Problem-fit: 3 — Useful documentation for users to understand release quality and feature validation, but doesn't directly address the core "lost AI" problem
Breaking: No — Documentation-only addition
Tier: 1

Code Quality Analysis

Documentation Quality: Excellent

The dogfood report demonstrates exceptional thoroughness:

Comprehensive Coverage:

34+ command cold-start testing (generated/dogfood/DOGFOOD_REPORT_v3.4.0.md:28-44)
Full command sweep with status validation (lines 51-86)
Engine comparison with detailed metrics (lines 175-225)
Release-specific feature validation (lines 230-244)
Programmatic API testing (lines 250-262)

Technical Rigor:

Quantified performance metrics: "Native 1.9x faster", "5.5x faster complexity" (lines 179, 198)
Engine parity validation: "<1% edge difference" (line 224)
Error handling validation: "100% graceful degradation" (line 45)
Specific test results with concrete numbers throughout

Transparency:

Documents found issues honestly (lines 290-306)
Provides actionable improvement suggestions (lines 312-323)
Includes both positive and negative findings

Structure and Formatting: Very Good

Well-organized sections with clear progression
Comprehensive tables for test results
Proper markdown formatting
Includes both technical metrics and qualitative assessment

Accuracy Validation

The report appears technically accurate:

Claims match expected v3.4.0 features from changelog references
Performance metrics are realistic (1.9x speedup is reasonable for native vs WASM)
Error scenarios tested appropriately
Test methodology is sound

Minor Observations

File location: Report placed in generated/dogfood/ which suggests this is generated content - appropriate placement
Scope appropriateness: Report scope matches PR title - focused on v3.4.0 validation
No configuration changes: Confirmed no risky config modifications
No test modifications: No test expectations lowered or assertions weakened

Final Recommendation

Rating: ⭐⭐⭐⭐☆ (4/5)
Action: APPROVE
Reasoning: This is high-quality release validation documentation that demonstrates thorough testing methodology. The report provides valuable transparency about release quality with no code risk. The comprehensive coverage, technical rigor, and honest reporting of both successes and minor issues make this excellent documentation that serves users well.

The only reason this doesn't merit 5 stars is that documentation PRs, while valuable, don't reach the "exceptional" bar that should be reserved for code contributions that solve critical problems or significantly advance the codebase.

greptile-apps · 2026-03-26T04:39:47Z

Greptile Summary

This PR adds a comprehensive dogfood report for @optave/codegraph@3.4.0, covering a full command sweep (34+ commands), engine comparison (native vs WASM), release-specific changelog validation, MCP server testing, and programmatic API verification. The report is well-structured and the previous round-2 review concerns — engine divergence framing (now references #613), cycle-count attribution (correctly tied to findCaller fallback removal in #607), and the EXTENSIONS/IGNORE_DIRS classification (updated to CAUTION) — have all been addressed.\n\n- Addressed from prior rounds: Engine divergence reframed without "acceptable range" language; cycle reduction from 8→4 attributed to #607; EXTENSIONS/IGNORE_DIRS upgraded from PASS to CAUTION\n- Remaining factual inconsistency: The CAUTION note for EXTENSIONS (Section 7, line 260) attributes the Array→Set change to v3.4.0's TypeScript migration, but the v3.3.1 dogfood report (DOGFOOD_REPORT_v3.3.1.md:215) already shows EXTENSIONS as Set (19 items) — the change predates v3.4.0, making the attribution incorrect\n- Unverified "pre-existing" claim: BUG 1 (tsconfig.json parse failure) is marked "Not filed — pre-existing warning," but the v3.3.1 dogfood report contains no mention of this warning anywhere; the TypeScript migration is the likely trigger, making this a v3.4.0 regression rather than a pre-existing issue\n- Notable codebase-state inconsistency: Section 4 node/edge counts (10,885 nodes, 20,752 edges) differ from Section 5 (10,941 nodes, 20,849 edges) because different commits were active at capture time — this is acknowledged in the report but slightly undermines the report as a snapshot of the released binary

Confidence Score: 4/5

Safe to merge after addressing the EXTENSIONS version-attribution inconsistency; no blocking issues remain.

Prior round-2 concerns are fully resolved and the report is thorough and well-evidenced. One concrete factual error remains — the EXTENSIONS Array→Set change is attributed to v3.4.0 but the v3.3.1 dogfood report shows it was already a Set, meaning the breaking-change attribution is wrong. A targeted one-line correction in Section 7 and Section 9.4 would resolve this. The tsconfig 'pre-existing' claim is a softer P2. This is on the happy path to merge with one targeted fix remaining.

generated/dogfood/DOGFOOD_REPORT_v3.4.0.md — Section 7 EXTENSIONS row (line 260) and Section 9.4 need correction on the version attribution of the Array→Set type change.

Important Files Changed

Filename	Overview
generated/dogfood/DOGFOOD_REPORT_v3.4.0.md	Comprehensive dogfood report for v3.4.0; prior review concerns (engine divergence framing, cycle count attribution, EXTENSIONS/IGNORE_DIRS classification) all addressed, but one factual inconsistency remains: EXTENSIONS was already a Set in v3.3.1, contradicting the report's attribution of the Array→Set change to v3.4.0's TS migration.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[Install v3.4.0] --> B[Setup & Installation]
    B --> C[Cold Start — 34+ commands\ngraceful degradation]
    C --> D[codegraph build\n462 files, 10941 nodes, 20849 edges]
    D --> E[Full Command Sweep\n34+ commands — all PASS]
    D --> F[Engine Comparison\nNative vs WASM]
    F --> F1[Native 1.9x faster build\n5.5x faster complexity]
    F --> F2[~68 call-edge divergence\nfiled as #613]
    E --> G[Rebuild & Staleness Tests\nincremental, no-op, force rebuild]
    G --> H[Release-Specific Tests\n10 v3.4.0 changelog items]
    H --> I[Additional Testing\nAPI, MCP, Registry, Config]
    I --> J[Bugs Found\nBUG 1: tsconfig warning\nBUG 2: no DB version check\nBUG 3: closed — already fixed]
    J --> K[Overall Assessment\n8.5/10 — no blocking bugs\n#613 open for call-edge parity]

_{Reviews (3): Last reviewed commit: "Merge branch 'main' into docs/dogfood-re..." | Re-trigger Greptile}

greptile-apps · 2026-03-26T04:39:50Z

generated/dogfood/DOGFOOD_REPORT_v3.4.0.md

+
+---
+
+## 6. Release-Specific Tests
+


Engine divergence framed as acceptable — violates CLAUDE.md policy

The analysis concludes "The 0.5% edge difference is within acceptable range," but CLAUDE.md is explicit on this point:

"Never document bugs as expected behavior. If two engines (native vs WASM) produce different results, that is a bug in the less-accurate engine — not an acceptable 'parity gap.' Adding comments or tests that frame wrong output as 'expected' blocks future agents from ever fixing it."

The report identifies that:

Native is missing ~68 call edges vs WASM (1.7% gap on ~4,000 calls)

WASM call confidence is 71.3% vs native 80.9% — a meaningful 9.6% gap

WASM produces 101 more edges overall

These are divergences that should be filed as a bug (e.g., "native engine under-extracts ~68 call sites that WASM finds" or "WASM over-extracts at lower confidence"), not documented as tolerable. The suggestion is to either open a tracking issue for the call-edge gap or remove the "within acceptable range" framing and replace it with "filed as #XYZ for investigation."

Context Used: CLAUDE.md (source)

Fixed. Removed the "within acceptable range" framing entirely. The analysis now states the divergence plainly and references #613, which I filed to track the investigation into which engine is correct for the ~68 divergent call edges.

Also updated the overall assessment (Section 11) and issues table (Section 12) to reference #613 rather than presenting the gap as tolerable.

greptile-apps · 2026-03-26T04:39:51Z

generated/dogfood/DOGFOOD_REPORT_v3.4.0.md

+
+### Query Commands
+
+| Command | Flags Tested | Status | Notes |


Function-level cycle count halved without explanation

The v3.3.1 dogfood report records cycles --functions as native: 8, WASM: 11 (with bug #597 filed for the discrepancy). This report shows both engines at 4 — a reduction of 50% for native — but there is no mention of this in the Section 6 release-specific tests, the bugs section, or the analysis.

Either the reduction was caused by a fix in v3.4.0 (e.g., findCaller fallback removal in #607 changing the call edge set) — in which case it should be explicitly validated and noted — or it represents an unexpected change that warrants investigation. The report should document which v3.4.0 change is responsible and confirm the remaining 4 cycles are the correct set.

Fixed. Added explicit attribution in the cycles row: the reduction from 8 native / 11 WASM in v3.3.1 to 4 (both engines) in v3.4.0 is attributed to the findCaller fallback removal in #607, which eliminated misattributed call edges that created false cycles. Both engines now agree on 4 cycles.

greptile-apps · 2026-03-26T04:39:52Z

generated/dogfood/DOGFOOD_REPORT_v3.4.0.md

+
+**CJS compatibility:** `require('@optave/codegraph')` works with `await` (async wrapper). Without `await`, returns Promise. Documented in the CJS wrapper file.
+
+### MCP Server
+
+| Test | Result |
+|------|--------|
+| JSON-RPC initialize | PASS — returns protocol version + server info |


EXTENSIONS/IGNORE_DIRS breaking change inconsistently classified

Section 7 marks EXTENSIONS and IGNORE_DIRS as PASS (changed from Array to Set in TS migration), while Section 9.4 correctly flags the same change as a "subtle breaking change" for consumers using .includes(), .indexOf(), etc.

These two sections contradict each other. A breaking API change should not be marked PASS — it should either be classified as a bug requiring a deprecation shim or a CHANGELOG entry, not just a suggestion. Consider updating the Section 7 row to reflect the breakage (e.g., CAUTION — breaking API change for downstream consumers) and adding a link to a filed issue or CHANGELOG update.

Fixed. Changed Section 7 classification from PASS to CAUTION for both EXTENSIONS and IGNORE_DIRS, with a note that this is a breaking API change (Array to Set) and a cross-reference to Section 9.4.

- Remove "within acceptable range" framing for engine call-edge divergence; file #613 to track the native/WASM gap per CLAUDE.md policy - Add explanation for cycle count reduction (8→4) attributing it to findCaller fallback removal in #607 - Reclassify EXTENSIONS/IGNORE_DIRS Array→Set change from PASS to CAUTION in Section 7 to match Section 9.4 breaking change note - Update issues table and overall assessment to reference #613

carlos-alm · 2026-03-26T05:15:45Z

@greptileai

…3.4.0

- Explain Section 4 vs Section 5 node/edge count discrepancy (different codebase states during testing) - Replace speculative tsconfig.json root cause with confirmed JSONC comments diagnosis - Add explicit analysis of engine-parity direction reversal from v3.3.1 to v3.4.0, connecting it to findCaller fallback removal in #607

carlos-alm · 2026-03-26T06:52:51Z

Addressed all three items from the round 2 review:

Section 4 vs Section 5 count discrepancy: Added a note to Section 4 explaining that the two sections were captured at different codebase states (Section 4 rebuild was run before the final v3.4.0 fixes, Section 5 engine comparison was run later). Counts are internally consistent within each section.
BUG 1 unverified root cause: Replaced the speculative "likely has a comment or special character" with the confirmed diagnosis: tsconfig.json uses JSONC syntax (inline /* ... */ comments on 7 lines) which standard JSON.parse rejects.
Engine-parity direction reversal: Added explicit analysis connecting the v3.3.1-to-v3.4.0 parity flip to the findCaller fallback removal in fix(edges): remove findCaller fallback that created false cycles in WASM #607. In v3.3.1, native produced +14 calls vs WASM; in v3.4.0, WASM produces +68. The reversal is consistent with removing the native-side heuristic that over-attributed calls.

Also merged main to pick up the await_expression AST fix, which should resolve the CI failure in ast-parity.test.ts.

carlos-alm · 2026-03-26T06:53:01Z

@greptileai

docs: add dogfood report for v3.4.0

b832165

greptile-apps bot reviewed Mar 26, 2026

View reviewed changes

Merge branch 'main' into docs/dogfood-report-v3.4.0

bac908a

carlos-alm mentioned this pull request Mar 26, 2026

bug(native): native engine under-extracts ~68 call edges vs WASM #613

Open

carlos-alm added 2 commits March 26, 2026 00:51

Merge remote-tracking branch 'origin/main' into docs/dogfood-report-v…

5338a53

…3.4.0

Merge branch 'main' into docs/dogfood-report-v3.4.0

5f836b6

carlos-alm merged commit fd26125 into main Mar 26, 2026
13 checks passed

carlos-alm deleted the docs/dogfood-report-v3.4.0 branch March 26, 2026 07:17

github-actions bot locked and limited conversation to collaborators Mar 26, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: add dogfood report for v3.4.0#611

docs: add dogfood report for v3.4.0#611
carlos-alm merged 6 commits intomainfrom
docs/dogfood-report-v3.4.0

carlos-alm commented Mar 26, 2026

Uh oh!

claude bot commented Mar 26, 2026 •

edited

Loading

Uh oh!

greptile-apps bot commented Mar 26, 2026 •

edited

Loading

Uh oh!

greptile-apps bot Mar 26, 2026

Uh oh!

carlos-alm Mar 26, 2026

Uh oh!

greptile-apps bot Mar 26, 2026

Uh oh!

carlos-alm Mar 26, 2026

Uh oh!

greptile-apps bot Mar 26, 2026

Uh oh!

carlos-alm Mar 26, 2026

Uh oh!

carlos-alm commented Mar 26, 2026

Uh oh!

carlos-alm commented Mar 26, 2026

Uh oh!

carlos-alm commented Mar 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant


		### Query Commands

		\| Command \| Flags Tested \| Status \| Notes \|

Conversation

carlos-alm commented Mar 26, 2026

Summary

Key findings

Uh oh!

claude bot commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviewing PR: docs: add dogfood report for v3.4.0

Todo List:

Root Cause Analysis

Backlog Compliance

Code Quality Analysis

Documentation Quality: Excellent

Structure and Formatting: Very Good

Accuracy Validation

Minor Observations

Final Recommendation

Uh oh!

greptile-apps bot commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 4/5

Important Files Changed

Flowchart

Uh oh!

greptile-apps bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

carlos-alm Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

carlos-alm Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

carlos-alm Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

carlos-alm commented Mar 26, 2026

Uh oh!

carlos-alm commented Mar 26, 2026

Uh oh!

carlos-alm commented Mar 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

claude bot commented Mar 26, 2026 •

edited

Loading

greptile-apps bot commented Mar 26, 2026 •

edited

Loading