Commit af29675
feat: precision improvements — tighter prompt, pruning pass, parallel execution
- Add precision guidelines to SYSTEM_PROMPT_SUFFIX (exclude test files,
docs, tangential code; aim for 1-5 files on simple bugs)
- Add prune_oracle_cli() that runs a haiku pruning pass to filter
irrelevant files from agent output
- Add --prune flag to validate_on_contextbench.py
- Add --parallel N flag with ThreadPoolExecutor for concurrent tasks
- Refactor main loop into process_one_task() worker function
Phase 1 baseline: composite=0.6426, recall=0.90, precision=0.30
Target: improve precision to push composite above 0.65 threshold.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>1 parent ce7da6c commit af29675
File tree
2 files changed
+204
-34
lines changed- scripts
2 files changed
+204
-34
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
980 | 980 | | |
981 | 981 | | |
982 | 982 | | |
| 983 | + | |
| 984 | + | |
| 985 | + | |
| 986 | + | |
| 987 | + | |
| 988 | + | |
| 989 | + | |
| 990 | + | |
| 991 | + | |
| 992 | + | |
983 | 993 | | |
984 | 994 | | |
985 | 995 | | |
| |||
1264 | 1274 | | |
1265 | 1275 | | |
1266 | 1276 | | |
| 1277 | + | |
| 1278 | + | |
| 1279 | + | |
| 1280 | + | |
| 1281 | + | |
| 1282 | + | |
| 1283 | + | |
| 1284 | + | |
| 1285 | + | |
| 1286 | + | |
| 1287 | + | |
| 1288 | + | |
| 1289 | + | |
| 1290 | + | |
| 1291 | + | |
| 1292 | + | |
| 1293 | + | |
| 1294 | + | |
| 1295 | + | |
| 1296 | + | |
| 1297 | + | |
| 1298 | + | |
| 1299 | + | |
| 1300 | + | |
| 1301 | + | |
| 1302 | + | |
| 1303 | + | |
| 1304 | + | |
| 1305 | + | |
| 1306 | + | |
| 1307 | + | |
| 1308 | + | |
| 1309 | + | |
| 1310 | + | |
| 1311 | + | |
| 1312 | + | |
| 1313 | + | |
| 1314 | + | |
| 1315 | + | |
| 1316 | + | |
| 1317 | + | |
| 1318 | + | |
| 1319 | + | |
| 1320 | + | |
| 1321 | + | |
| 1322 | + | |
| 1323 | + | |
| 1324 | + | |
| 1325 | + | |
| 1326 | + | |
| 1327 | + | |
| 1328 | + | |
| 1329 | + | |
| 1330 | + | |
| 1331 | + | |
| 1332 | + | |
| 1333 | + | |
| 1334 | + | |
| 1335 | + | |
| 1336 | + | |
| 1337 | + | |
| 1338 | + | |
| 1339 | + | |
| 1340 | + | |
| 1341 | + | |
| 1342 | + | |
| 1343 | + | |
| 1344 | + | |
| 1345 | + | |
| 1346 | + | |
| 1347 | + | |
| 1348 | + | |
| 1349 | + | |
| 1350 | + | |
| 1351 | + | |
| 1352 | + | |
| 1353 | + | |
| 1354 | + | |
| 1355 | + | |
| 1356 | + | |
| 1357 | + | |
| 1358 | + | |
| 1359 | + | |
| 1360 | + | |
| 1361 | + | |
| 1362 | + | |
| 1363 | + | |
| 1364 | + | |
| 1365 | + | |
| 1366 | + | |
| 1367 | + | |
| 1368 | + | |
| 1369 | + | |
| 1370 | + | |
| 1371 | + | |
| 1372 | + | |
| 1373 | + | |
| 1374 | + | |
| 1375 | + | |
| 1376 | + | |
| 1377 | + | |
| 1378 | + | |
| 1379 | + | |
| 1380 | + | |
| 1381 | + | |
| 1382 | + | |
| 1383 | + | |
| 1384 | + | |
| 1385 | + | |
| 1386 | + | |
| 1387 | + | |
| 1388 | + | |
| 1389 | + | |
1267 | 1390 | | |
1268 | 1391 | | |
1269 | 1392 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
39 | 39 | | |
40 | 40 | | |
41 | 41 | | |
| 42 | + | |
42 | 43 | | |
43 | 44 | | |
44 | 45 | | |
| |||
732 | 733 | | |
733 | 734 | | |
734 | 735 | | |
| 736 | + | |
| 737 | + | |
| 738 | + | |
| 739 | + | |
| 740 | + | |
| 741 | + | |
| 742 | + | |
| 743 | + | |
735 | 744 | | |
736 | 745 | | |
737 | 746 | | |
| |||
844 | 853 | | |
845 | 854 | | |
846 | 855 | | |
847 | | - | |
848 | | - | |
849 | | - | |
850 | | - | |
851 | | - | |
852 | | - | |
853 | | - | |
854 | | - | |
855 | | - | |
856 | | - | |
857 | | - | |
858 | | - | |
859 | | - | |
| 856 | + | |
| 857 | + | |
| 858 | + | |
| 859 | + | |
860 | 860 | | |
861 | 861 | | |
862 | | - | |
863 | 862 | | |
864 | 863 | | |
865 | 864 | | |
866 | 865 | | |
867 | 866 | | |
868 | | - | |
869 | | - | |
870 | 867 | | |
871 | | - | |
872 | 868 | | |
873 | 869 | | |
874 | 870 | | |
875 | 871 | | |
876 | 872 | | |
877 | | - | |
878 | | - | |
| 873 | + | |
| 874 | + | |
| 875 | + | |
| 876 | + | |
879 | 877 | | |
880 | | - | |
881 | 878 | | |
882 | 879 | | |
883 | | - | |
884 | | - | |
| 880 | + | |
| 881 | + | |
885 | 882 | | |
886 | | - | |
887 | 883 | | |
888 | 884 | | |
889 | 885 | | |
| |||
892 | 888 | | |
893 | 889 | | |
894 | 890 | | |
895 | | - | |
896 | | - | |
| 891 | + | |
| 892 | + | |
| 893 | + | |
| 894 | + | |
| 895 | + | |
| 896 | + | |
| 897 | + | |
| 898 | + | |
| 899 | + | |
| 900 | + | |
| 901 | + | |
| 902 | + | |
| 903 | + | |
| 904 | + | |
| 905 | + | |
| 906 | + | |
| 907 | + | |
| 908 | + | |
897 | 909 | | |
898 | | - | |
| 910 | + | |
| 911 | + | |
| 912 | + | |
899 | 913 | | |
900 | | - | |
901 | 914 | | |
902 | 915 | | |
903 | 916 | | |
904 | 917 | | |
905 | | - | |
906 | | - | |
| 918 | + | |
907 | 919 | | |
908 | | - | |
909 | | - | |
910 | | - | |
911 | | - | |
912 | | - | |
| 920 | + | |
| 921 | + | |
| 922 | + | |
| 923 | + | |
| 924 | + | |
| 925 | + | |
| 926 | + | |
| 927 | + | |
| 928 | + | |
| 929 | + | |
| 930 | + | |
| 931 | + | |
| 932 | + | |
| 933 | + | |
| 934 | + | |
| 935 | + | |
| 936 | + | |
| 937 | + | |
| 938 | + | |
| 939 | + | |
| 940 | + | |
| 941 | + | |
| 942 | + | |
| 943 | + | |
| 944 | + | |
| 945 | + | |
| 946 | + | |
| 947 | + | |
| 948 | + | |
| 949 | + | |
| 950 | + | |
| 951 | + | |
| 952 | + | |
| 953 | + | |
| 954 | + | |
| 955 | + | |
| 956 | + | |
| 957 | + | |
| 958 | + | |
| 959 | + | |
913 | 960 | | |
914 | 961 | | |
915 | 962 | | |
| |||
0 commit comments