Commit f72717a
fix: curl-security-review-001 MCP verifier — wrong mirror URL in clone manifest
The Dockerfile.sg_only clone manifest had "sg-evals/curl/curl" (invalid 3-segment
GitHub path) instead of "sg-evals/curl--09e25b9d" (the actual mirror). This caused
the clone-at-verify step to fail, inject_defects.sh to crash on the empty workspace,
and set -e to propagate the failure — killing test.sh before writing reward.txt.
Every MCP run hit RewardFileNotFoundError. Validated fix: MCP rerun scores 0.51.
Also added defensive error handling around inject_defects.sh in the verifier wrapper
so future clone failures degrade gracefully instead of crashing.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>1 parent 87f1a5d commit f72717a
File tree
2 files changed
+6
-3
lines changed- benchmarks/ccb_test/curl-security-review-001
- environment
- tests
2 files changed
+6
-3
lines changedLines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
36 | 36 | | |
37 | 37 | | |
38 | 38 | | |
39 | | - | |
| 39 | + | |
40 | 40 | | |
41 | 41 | | |
42 | 42 | | |
| |||
Lines changed: 5 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
151 | 151 | | |
152 | 152 | | |
153 | 153 | | |
154 | | - | |
155 | | - | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
| 158 | + | |
156 | 159 | | |
157 | 160 | | |
158 | 161 | | |
| |||
0 commit comments