You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
That is a weighting effect, not a contradiction: the `>40M` band has large absolute savings and enough mass to pull the overall weighted average down even when several smaller bands are MCP-expensive.
Figure: `docs/assets/blog/codescalebench_mcp/figure_7_cost_pairing_by_model_and_size.{png,svg}` (haiku only, size-binned by estimated LOC from GitHub repo size).
This replaces the prior single pooled cost headline and is the canonical cost estimate in this report. For the most stable comparison set (haiku, `n=392` valid pairs), MCP reduces average cost per task from **$0.7333** to **$0.5121** (**-30.16%**). Model effects remain heterogeneous: sonnet is slightly cheaper with MCP, while opus is more expensive.
1112
-
1113
-
**Haiku cost by estimated codebase LOC (same canonical pairing):**
1114
-
1115
-
| Estimated LOC Band | n | BL $/task | MCP $/task | Δ $/task | MCP vs BL |
Method note: size bins are derived from GitHub repo size in KB and mapped to LOC bands;`unknown` indicates missing or unresolved repository metadata.
1109
+
This is the canonical cost estimate in this report: for haiku (`n=392` valid pairs), MCP reduces average cost per task from **$0.7333** to **$0.5121** (**-30.16%**).
0 commit comments