Commit d68ffba
Rework evalbuff: commit learning mode, parallel agents, trace compression
Two-mode architecture: learn mode walks git history commit-by-commit,
prompt mode runs a specific task. Both use iterative doc improvement
with parallel agent execution, judging, and keep/reject loop.
Key changes:
- Add commit-task-generator: extracts tasks from git history via LLM
- Add trace-compressor: hybrid compression stores large tool results
in files with inline pointers so doc writer can see agent reasoning
- Rewrite run-evalbuff with runLearnMode/runPromptMode, parallel
agent execution (N runs per task), and iterative doc improvement
- Fix cli-runner timeout: kill entire process group via detached spawn
- Update judge with judgeTaskResult for prompt mode (no ground truth)
- Update docs-optimizer: always analyze, agent trace support, revert
logic that preserves previously-accepted doc edits
- Rewrite tests for new architecture
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>1 parent f8ee6e8 commit d68ffba
File tree
13 files changed
+1782
-1092
lines changed- docs/conventions
- evalbuff
- src
- __tests__
13 files changed
+1782
-1092
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
43 | 43 | | |
44 | 44 | | |
45 | 45 | | |
46 | | - | |
This file was deleted.
0 commit comments