[codex] Build coding-deepgent local parity baselines by psymoth · Pull Request #220 · shareAI-lab/learn-claude-code

psymoth · 2026-04-14T13:26:46Z

Summary

Build the current coding-deepgent local parity baselines:

keep Approach A MVP as a verified historical baseline
complete Circle 1 local daily-driver parity baseline
complete a local Circle 2 expanded parity baseline
keep hosted SaaS ingress, multi-user auth, public marketplace backend, and cross-machine workers explicitly out of scope

This PR is no longer just an MVP closeout branch. It now represents the current local product baseline for coding-deepgent.

Scope

Historical MVP closeout included on this branch

The branch still contains the earlier Stage 12-29 work:

context / compact / session / recovery hardening
durable task / plan / verifier boundaries
bounded subagent / fork runtime
local extension foundation
observability / evidence closeout
canonical MVP dashboard and deferred-boundary ADR work

Circle 1 local daily-driver parity baseline

Implemented on this branch:

runtime-core parity pack
session inspect / history / projection / timeline / evidence / permissions CLI surfaces
durable task / plan control CLI surfaces
active TUI background-subagent controls and runtime snapshots
local skills / MCP / hooks / plugins inspect / validate / debug surfaces
deterministic coding-deepgent acceptance circle1

Circle 2 local expanded parity baseline

Implemented on this branch:

durable local event_stream
durable local worker_runtime
local mailbox
local teams orchestration records
local remote session/control records and replay
local extension_lifecycle
local continuity artifacts
deterministic coding-deepgent acceptance circle2

What This PR Delivers Now

The branch establishes a coherent local product baseline with:

one persistent local runtime store
one session/evidence/recovery model
one CLI/TUI product surface
one local extended-control substrate

Explicit Non-Goals / Deferred Beyond This PR

These are still not claimed by the current local baseline:

hosted remote/session ingress service
multi-user auth / org policy / billing
public marketplace backend
cross-machine workers or true distributed daemon supervision
true IDE plugin implementation beyond local remote-control records and surfaces

Reviewer Guide

Recommended review order:

.trellis/project-handoff.md
.trellis/plans/coding-deepgent-full-cc-parity-roadmap.md
.trellis/plans/coding-deepgent-circle-2-expanded-parity-plan.md
coding-deepgent/src/coding_deepgent/ runtime / session / task / subagent / event / worker / mailbox / team / remote / lifecycle / continuity domains
coding-deepgent/src/coding_deepgent/cli.py and cli_service.py
coding-deepgent/frontend/cli and frontend bridge protocol updates
coding-deepgent/tests

Validation

Current branch validation:

pytest -q coding-deepgent/tests -> 438 passed
npm --prefix coding-deepgent/frontend/cli test -> passed
npm --prefix coding-deepgent/frontend/cli run typecheck -> passed
ruff check coding-deepgent/src/coding_deepgent coding-deepgent/tests .trellis/spec .trellis/plans -> passed
python3 -m mypy coding-deepgent/src/coding_deepgent -> passed
PYTHONPATH=coding-deepgent/src python3 -m coding_deepgent acceptance circle1 -> passed
PYTHONPATH=coding-deepgent/src python3 -m coding_deepgent acceptance circle2 -> passed

Residual Risks

The PR is broad and spans the full transition from MVP closeout to local Circle 1/2 baselines, so review load is still high even after body cleanup.
Circle 2 currently provides a local expanded baseline, not hosted remote parity.
If future work wants true SaaS ingress, distributed workers, or marketplace backend, it should be scoped as a new explicit post-baseline phase rather than silently extending the current local model.

The OMX team runtime writes local state under .omx/, and worker worktrees require the leader workspace to be clean before launch. Committing the ignore rule preserves local orchestration artifacts outside source control while unblocking durable team execution. Constraint: omx team refuses to launch with a dirty leader workspace because it provisions worker worktrees Rejected: Stash .gitignore before launch | would make .omx/ unignored again during team execution Confidence: high Scope-risk: narrow Directive: Keep .omx/ ignored; do not remove unless replacing the OMX state location Tested: git diff showed only .omx/ ignore addition Not-tested: team launch after commit

The first LangChain milestone needs CI evidence that the parallel s01-s06 track exists, compiles without OpenAI credentials, avoids import-time model starts, and preserves visible teaching harness primitives. This adds the guardrail tests and wires CI through requirements.txt so later LangChain dependency additions are installed consistently. Constraint: Test lane owns tests/CI while code lane still owns agents_langchain implementation Confidence: medium Scope-risk: narrow Tested: python -m py_compile tests/test_langchain_agents_smoke.py; python -m pytest tests/test_agents_smoke.py -q Not-tested: tests/test_langchain_agents_smoke.py passes only after agents_langchain s01-s06 code lane lands

The docs lane needs a stable comparison entry point before the code and test lanes are integrated, so this records where the s01-s06 LangChain/OpenAI-interface track lives, how it should be configured, and how reviewers should keep it separate from the original agents/ baseline and web UI. Constraint: First milestone is s01-s06 only and must preserve agents/ plus web/ boundaries Constraint: LangChain docs currently install core langchain plus langchain-openai for OpenAI integration Rejected: Surface the track through web/ now | user explicitly scoped web UI/app out of this milestone Confidence: high Scope-risk: narrow Tested: python -m pytest tests/test_agents_smoke.py -q; python -m compileall agents tests -q; git diff --check; python -m pip install --dry-run -r requirements.txt pytest Not-tested: full pytest suite due pre-existing tests/test_s_full_background.py failure unrelated to docs/deps changes

Add a parallel agents_langchain s01-s06 track so learners can compare the existing hand-written Anthropic SDK baseline against LangChain's OpenAI-interface runtime without changing the web UI or original agents. Constraint: First milestone is s01-s06 only and must preserve agents/*.py plus web/ Rejected: Put LangChain files under agents/ | risks confusing the existing web extractor and baseline teaching boundary Confidence: high Scope-risk: moderate Tested: python -m py_compile agents_langchain/*.py; python -m pytest tests/test_agents_smoke.py tests/test_langchain_agents_smoke.py -q; env -u OPENAI_API_KEY import check for agents_langchain modules

The first LangChain milestone needs to sit beside the hand-written Anthropic SDK lessons, not replace them, so this adds a separate agents_langchain package, non-live smoke tests, OpenAI-style setup docs, and CI dependency wiring while leaving the web app and original s01-s06 scripts unchanged. Constraint: Preserve existing agents/*.py as the baseline and avoid web UI/app changes for this milestone Constraint: Automated tests must not require OPENAI_API_KEY or network access Rejected: Put LangChain files under agents/ | would blur the baseline boundary and risk web extractor churn Confidence: high Scope-risk: moderate Tested: python -m py_compile agents_langchain/*.py tests/test_langchain_agents_smoke.py Tested: python -m pytest tests/test_agents_smoke.py tests/test_langchain_agents_smoke.py -q Tested: env -u OPENAI_API_KEY python -m pytest tests/test_langchain_agents_smoke.py -q Not-tested: Full pytest suite is blocked by pre-existing tests/test_s_full_background.py failure in unmodified agents/s_full.py Not-tested: Live LangChain/OpenAI calls intentionally not run

The integrated LangChain milestone passed its targeted checks, but full repository pytest still failed in BackgroundManagerTests because a running background task with result=None rendered as '[running] None'. Normalizing the None case to the existing running placeholder keeps the capstone behavior aligned with the test and avoids a misleading status string. Constraint: Full post-change verification should pass before concluding the milestone Rejected: Leave the unrelated failure unresolved | would keep full pytest red at handoff time Confidence: high Scope-risk: narrow Directive: Preserve the '(running)' placeholder contract for unfinished background tasks unless tests and user-visible output are updated together Tested: python -m py_compile agents/s_full.py agents_langchain/*.py tests/test_langchain_agents_smoke.py; python -m pytest tests -q Not-tested: Interactive manual run of agents/s_full.py background task commands

…rfaces

…aseline

psymoth · 2026-04-19T22:51:48Z

Release-validation is complete. Current status:\n\n- acceptance circle1: pass\n- acceptance circle2: pass\n- pytest -q coding-deepgent/tests: 438 passed\n- frontend CLI test/typecheck: pass\n- ruff + mypy: pass\n- PR title/body refreshed to current local baseline scope\n- branch rebased/merged with latest upstream main and README conflicts resolved\n\nThe PR is now mergeable from a code/conflict standpoint. The only remaining blockers are repository-side permissions and external Vercel authorization checks. My GitHub identity does not have permission to execute MergePullRequest for this repo, so merge must be completed by a maintainer or someone with merge rights.

CrazyBoyM and others added 30 commits April 8, 2026 05:45

feat: realign teaching path and web docs

36897b1

test(web): stabilize multi-locale browser flows

5dfe67f

omx(team): auto-checkpoint worker-2 [unknown]

951a9f7

omx(team): merge worker-2

9d3fe16

omx(team): auto-checkpoint worker-2 [unknown]

48ae0e8

omx(team): auto-checkpoint worker-2 [unknown]

6ddb1ad

omx(team): auto-checkpoint worker-3 [unknown]

4e2d448

omx(team): auto-checkpoint worker-2 [unknown]

cbcbb59

omx(team): auto-checkpoint worker-3 [unknown]

7f4b699

omx(team): auto-checkpoint worker-1 [unknown]

86e35d7

omx(team): auto-checkpoint worker-4 [unknown]

94c500b

omx(team): auto-checkpoint worker-1 [unknown]

b071b08

omx(team): auto-checkpoint worker-4 [unknown]

a95dbc8

omx(team): auto-checkpoint worker-2 [unknown]

f403f8a

omx(team): auto-checkpoint worker-2 [unknown]

2383975

omx(team): auto-checkpoint worker-2 [unknown]

4ec5d3b

omx(team): auto-checkpoint worker-1 [unknown]

8d43936

omx(team): auto-checkpoint worker-4 [unknown]

624ba20

omx(team): merge worker-3

bdbd6d1

omx(team): auto-checkpoint worker-2 [unknown]

0cc941e

omx(team): auto-checkpoint worker-2 [unknown]

9651878

omx(team): merge worker-2

881932f

omx(team): auto-checkpoint worker-3 [unknown]

cf0cd7a

Kun added 26 commits April 19, 2026 23:26

chore(task): archive 04-19-frontend-architecture-cc-cli-reuse

f46e16c

chore(task): archive 04-19-deerflow-inspired-decoupling

ca778fd

chore(task): archive 04-16-cc-highlight-alignment-discussion

28f64d9

chore(task): archive 04-17-subagent-multiagent-ch09-review

543c957

chore: record journal

5f1aceb

chore(task): archive 04-19-final-release-validation-pr-cleanup

d046349

chore(task): archive 04-20-extract-coding-agent-repo

95b2b60

feat(runtime): complete circle 1 wave 1 parity pack

e7f78b1

chore: record journal

4be8018

feat(runtime): expose wave 2 cli tui surfaces

575850f

chore(task): archive 04-20-coding-deepgent-circle-1-wave-2-runtime-su…

a5646a9

…rfaces

chore: record journal

5a109d5

feat(runtime): add wave 2 control surfaces

c9c38e4

chore(task): archive 04-20-coding-deepgent-circle-1-wave-2-control-su…

0078bce

…rfaces

chore: record journal

339170d

chore: fix journal commit references

6f4ff7a

feat(runtime): complete circle 1 local parity

7248889

chore(task): archive 04-20-coding-deepgent-circle-1-completion-remaining

386602b

chore: record journal

f073945

docs(plan): add circle 2 expanded parity plan

243be04

chore(task): archive 04-20-brainstorm-circle-2-parity-plan

b6b522c

chore: record journal

3c83b77

feat(runtime): complete circle 2 local baseline

bbe9eeb

chore(task): archive 04-20-coding-deepgent-circle-2-expanded-parity-b…

7cc5bfb

…aseline

chore: record journal

74a2ed9

docs: finalize release wording

1b38ed3

psymoth changed the title ~~[codex] Close coding-deepgent MVP local agent harness core~~ [codex] Build coding-deepgent local parity baselines Apr 19, 2026

psymoth marked this pull request as ready for review April 19, 2026 22:30

merge: resolve README conflicts with upstream

0036077

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[codex] Build coding-deepgent local parity baselines#220

[codex] Build coding-deepgent local parity baselines#220
psymoth wants to merge 239 commits into
shareAI-lab:mainfrom
psymoth:codex/stage-12-14-context-compact-foundation

psymoth commented Apr 14, 2026 •

edited

Loading

Uh oh!

psymoth commented Apr 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

psymoth commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Scope

Historical MVP closeout included on this branch

Circle 1 local daily-driver parity baseline

Circle 2 local expanded parity baseline

What This PR Delivers Now

Explicit Non-Goals / Deferred Beyond This PR

Reviewer Guide

Validation

Residual Risks

Uh oh!

psymoth commented Apr 19, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

psymoth commented Apr 14, 2026 •

edited

Loading