docgen — documentation generator

Reusable Python library and CLI for narrated demo videos built around Manim, OpenAI TTS, and ffmpeg composition. Aimed at long-form, scripted explainers that walk through how a system works.

Suite handbook (Courseforge)

Prose + PlantUML sources for how Courseforge repositories fit together live under docs/suite/. Regenerate PNGs with ./scripts/render-suite-diagrams.sh — Java, Graphviz (dot), and the vendored JAR in third_party/plantuml/ (CI installs Graphviz and runs the same script; see .github/workflows/render-suite-diagrams.yml). The rendered site at courseforge.github.io pulls this tree on each publish from courseforge/infrastructure.

What changed: Playwright is gone

docgen no longer ships any Playwright-driven UI demo path. The previous demo-function, playwright, discover-tests, vhs, tape-lint, sync-vhs, per-function-*, and catalog commands — together with their config blocks (vhs:, playwright:, playwright_test:, discover_tests:, catalog:, per_function:) and the playwright, playwright_test, and vhs visual_map types — have been removed.

Why: a UI-test-driven recorder turned out to be a fragile, project-specific concern that pulled pytest-playwright, Node Playwright, VHS / ttyd, browser binaries, trace parsing, and a discovery catalog into a generic library. The same goal is now being prototyped in a consumer project (CourseForge tools/courseforge/demogen/) with the “LLM emits a validated automation spec, a deterministic runner translates it to Playwright” pattern. Once that contract stabilises a small portion may be backported into docgen, but docgen itself stays Playwright-free.

If you still need the legacy behaviour, pin a pre-removal commit (pip install docgen @ git+https://github.com/jmjava/documentation-generator.git@<sha>).

What docgen does today

TTS narration — generate MP3 audio from Markdown scripts via OpenAI gpt-4o-mini-tts.
Whisper-aligned timestamps — extract word-level timing from TTS audio so visual cues can wait on real speech.
Manim animations — primary visual surface. Use docgen scene-spec-generate
- scene-compile (or hand-maintained animations/specs/*.scene.yaml) for deterministic diagram layout: rows are auto-paginated when they exceed the frame stack budget, specs that overflow safe width / budget are rejected, and (when timing.json carries Whisper words) each row’s first label is mapped to a wait_word index. Hand-maintained custom Manim classes still live in animations/scenes.py outside the BEGIN/END GENERATED SCENE markers.
ffmpeg composition — combine narration audio and Manim video into final segments, with a freeze-tail guard.
Validation — A/V drift, freeze ratio, OCR error scan, layout, narration lint, Manim scene lint.
GitHub Pages — auto-generate index.html, deploy workflow, LFS rules, .gitignore.
Wizard — local web GUI to bootstrap narration scripts from existing project docs.

No IDE lock-in: maintenance workflows are docgen CLI + YAML + shell/CI (and OpenAI where a command calls the API). The wizard is a local Flask app, not a plugin tied to one editor.

Install

pip install docgen @ git+https://github.com/jmjava/documentation-generator.git

Development setup

git clone https://github.com/jmjava/documentation-generator.git
cd documentation-generator
python3 -m venv .venv
source .venv/bin/activate
pip install -e ".[dev]"
pytest

CI installs ffmpeg and tesseract via apt — see .github/workflows/ci.yml.

Roadmap: milestones/README.md.

Quick start

cd your-project/docs/demos
docgen wizard              # optional: bootstrap narration from project docs
docgen generate-all        # TTS → timestamps → Manim → compose → validate → concat
docgen validate --pre-push

CLI commands

Command	Description
`docgen init [TARGET_DIR] [--defaults] [--segments-file FILE]`	Scaffold a new project: `docgen.yaml`, wrapper scripts, directories
`docgen wizard [--port 8501]`	Launch narration setup wizard (local web GUI)
`docgen tts [--segment 01] [--dry-run]`	Generate TTS audio
`docgen timestamps`	Extract Whisper timestamps from TTS audio → `timing.json`
`docgen manim [--scene StackDAGScene]`	Render Manim animations
`docgen compose [01 02 03] [--ffmpeg-timeout 900]`	Compose segments (audio + video)
`docgen validate [--max-drift 2.75] [--pre-push]`	Run all validation checks
`docgen lint [--segment 01]`	Narration lint only
`docgen concat [--config full-demo]`	Concatenate full demo files
`docgen pages [--force]`	Generate `index.html`, `pages.yml`, `.gitattributes`, `.gitignore`
`docgen generate-all [--skip-tts] [--skip-manim] [--retry-manim]`	Full pipeline
`docgen rebuild-after-audio`	Recompose + validate + concat (skips TTS)
`docgen clean-bundle [-y] [--delete-config] [--keep-narration]`	Remove regenerable outputs under the bundle
`docgen narration-generate --segment 01 [--extra-path REL] [--hint TEXT] [--dry-run] [--force]`	Generate narration `.md` from repo sources + owner hints (OpenAI); see `narration_from_source` in YAML
`docgen yaml-generate [--merge-defaults] [--llm] [--dry-run] [--list-gaps]`	Merge defaults into `docgen.yaml`; optional OpenAI refresh of `tts.instructions` / `wizard.system_prompt` (rewrites the file — review in Git)
`docgen scene-compile SPEC.scene.yaml [--dry-run]`	Compile a declarative scene spec (YAML) into a `_TimedScene` class and inject it into `animations/scenes.py` — deterministic layout (rows of `_box`); applies auto-pagination + Whisper `wait_word`
`docgen scene-spec-generate [--segment 01 \| --all] [--compile] [--print-only] [--output PATH] [--hint …] [--model …]`	Call OpenAI to emit YAML only (same schema as `scene-compile`); rejects specs that exceed the stack budget or safe row width, runs the same auto-paginate + word-alignment, optionally writes `animations/specs/<stem>.scene.yaml` and `--compile`s into `scenes.py`

Configuration

Create a docgen.yaml in your demos directory. Use docgen init to scaffold a fresh layout, then docgen yaml-generate to fill in defaults from the files already on disk. (docgen yaml-generate also keeps manim_scene_generation.segments in step with visual_map.)

The visual_map key is maintainer-owned per-segment wiring. Supported types are manim, mixed, still, and image.

`env_file` and the shell

If docgen.yaml sets env_file (often .env), variables are loaded with shell-first semantics: anything already exported in the process (including your IDE or CI) is not replaced by the file. To make the file win, set DOCGEN_ENV_OVERRIDES=1 so every key from env_file overwrites the environment, or DOCGEN_ENV_OVERRIDES=OPENAI_API_KEY,OTHER_KEY for specific keys only.

When OPENAI_API_KEY is present in both the shell and env_file, docgen prints a one-line hint to stderr so a silent 401 from the wrong key is easier to diagnose.

Narration from source (owner hints)

Under narration_from_source in docgen.yaml, the project owner lists optional hints (strings) that steer the model (audience, terminology, what to avoid). OpenAI generates the narration .md from your repo context (context.paths / context.globs, relative to repo_root) plus those hints; the result is what docgen tts reads. See docgen.narrate_from_source.

narration_from_source:
  model: gpt-4o-mini
  temperature: 0.65
  max_context_bytes: 120000
  hints:
    - "Audience: contributors new to this repo."
    - "Do not mention unreleased product codenames."
  context:
    paths:
      - README.md
    globs:
      - "src/**/*.py"
  segments:
    "01":
      hints:
        - "This segment covers the install wizard only."
      context:
        paths:
          - docs/install.md

Useful pipeline options

validation:
  max_drift_sec: 2.75
  max_freeze_ratio: 0.25     # trailing-frame pad vs narration length (compose freeze guard + validate)

manim:
  quality: 1080p30           # supports 480p15, 720p30, 1080p30, 1080p60, 1440p30, 1440p60, 2160p60
  manim_path: ""             # optional explicit binary path (relative to docgen.yaml or absolute)
  font: "Liberation Sans"
  min_font_size: 14

compose:
  ffmpeg_timeout_sec: 300    # can also be overridden with: docgen compose --ffmpeg-timeout N

System dependencies

ffmpeg — composition and probing
tesseract-ocr — OCR validation
Manim — primary visuals (optional install: pip install docgen[manim])

Milestone spec

See milestone-doc-generator.md for the full design document.

Name		Name	Last commit message	Last commit date
Latest commit History 88 Commits
.cursor/rules		.cursor/rules
.github/workflows		.github/workflows
docs		docs
issues/archive		issues/archive
milestones		milestones
scripts		scripts
session-notes		session-notes
specs-on-hold		specs-on-hold
src/docgen		src/docgen
tests		tests
third_party/plantuml		third_party/plantuml
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
AGENTS.md		AGENTS.md
NOTICE		NOTICE
README.md		README.md
SESSION_NOTES.md		SESSION_NOTES.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

docgen — documentation generator

Suite handbook (Courseforge)

What changed: Playwright is gone

What docgen does today

Install

Development setup

Quick start

CLI commands

Configuration

`env_file` and the shell

Narration from source (owner hints)

Useful pipeline options

System dependencies

Milestone spec

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

docgen — documentation generator

Suite handbook (Courseforge)

What changed: Playwright is gone

What docgen does today

Install

Development setup

Quick start

CLI commands

Configuration

env_file and the shell

Narration from source (owner hints)

Useful pipeline options

System dependencies

Milestone spec

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`env_file` and the shell

Packages