You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Costs below are recomputed on the same **251 paired tasks** used in Section 11.2, using a single method: `task_metrics.json``cost_usd` (model-aware pricing including cache read/write tokens).
1036
+
1037
+
| Suite | n | Baseline Mean Cost/Task | MCP Mean Cost/Task |
MCP runs cost **37% less** on average ($0.47 vs $0.75 per task). This is driven by the truncated-source environment: with less local code to read, the agent processes fewer input tokens. The **fix** suite is the most expensive ($8.32/task baseline) due to large codebases and extensive multi-file editing. The **mcp_unique** suite is cheapest ($1.40/task) because artifact-mode tasks produce a short JSON answer rather than extensive code changes.
1054
+
On this paired slice, MCP is **~3.8% higher cost** on average (+$0.013/task), not lower. Cost impact is suite-dependent: MCP is cheaper in`design`, `document`, `understand`, and `mcp_unique`, and more expensive in`build`, `debug`, `fix`, `secure`, and `test`.
Costs below are recomputed on the same **251 paired tasks** used in Section 11.2, using a single method: `task_metrics.json``cost_usd` (model-aware pricing including cache read/write tokens).
1036
+
1037
+
| Suite | n | Baseline Mean Cost/Task | MCP Mean Cost/Task |
MCP runs cost **37% less** on average ($0.47 vs $0.75 per task). This is driven by the truncated-source environment: with less local code to read, the agent processes fewer input tokens. The **fix** suite is the most expensive ($8.32/task baseline) due to large codebases and extensive multi-file editing. The **mcp_unique** suite is cheapest ($1.40/task) because artifact-mode tasks produce a short JSON answer rather than extensive code changes.
1054
+
On this paired slice, MCP is **~3.8% higher cost** on average (+$0.013/task), not lower. Cost impact is suite-dependent: MCP is cheaper in`design`, `document`, `understand`, and `mcp_unique`, and more expensive in`build`, `debug`, `fix`, `secure`, and `test`.
0 commit comments