feat: Add pgschema.toml configuration file support by NFUChen · Pull Request #433 · pgplex/pgschema

NFUChen · 2026-05-14T17:25:20Z

Add `pgschema.toml` configuration file support

Summary

Adds support for a TOML-based configuration file (pgschema.toml) so users can define connection parameters, flags, and per-environment overrides without repeating CLI flags on every invocation.

Before:

pgschema plan --host localhost --port 5432 --db myapp --user postgres --schema public --file schema.sql
pgschema apply --host localhost --port 5432 --db myapp --user postgres --schema public --file schema.sql

After:

# with pgschema.toml in the working directory
pgschema plan
pgschema apply

Features

1. Flat config file

All existing CLI flags can be set in pgschema.toml:

host = "localhost"
port = 5432
db = "myapp"
user = "postgres"
schema = "public"
file = "schema.sql"

2. Named environments (`--env`)

[env.*] blocks define per-environment overrides that inherit from the base level:

schema = "public"
file = "schema.sql"

[env.dev]
host = "localhost"
db = "myapp_dev"
user = "postgres"

[env.prod]
host = "prod-db.internal"
db = "myapp_prod"
user = "app_user"
lock-timeout = "30s"
auto-approve = false

pgschema plan --env dev
pgschema apply --env prod

Environment merging uses TOML metadata (IsDefined) so explicitly setting a boolean to false in an env block correctly overrides a base-level true — zero values are not silently skipped.

3. Multi-tenant schema loop (`[schemas]`)

For multi-tenant setups, a [schemas] block with a SQL query discovers schema names at runtime. plan and apply iterate over all discovered schemas automatically:

host = "localhost"
db = "myapp"
user = "postgres"
file = "tenant.sql"

[schemas]
query = "SELECT schema_name FROM information_schema.schemata WHERE schema_name LIKE 'tenant_%'"

pgschema plan    # plans for each tenant schema
pgschema apply   # applies to each tenant schema

The discovery query runs inside a read-only transaction to prevent accidental data modification (CREATE/DROP/INSERT are rejected by Postgres).

4. Precedence

CLI flags always win: CLI flags > env vars > config env block > config base > defaults.

Config values are applied in PreRunE hooks before env var resolution, so existing PGHOST/PGPORT/etc. behavior is preserved. A flag is only populated from config if cmd.Flags().Changed(flag) returns false.

Global flags added

Flag	Default	Description
`--config`	`pgschema.toml`	Path to config file
`--env`	(none)	Named environment to use

Explicit --config to a missing file → exit with error.
Default pgschema.toml missing → silently proceed (backward compatible).
--env without a config file → exit with error.

Files changed

Code

File	Change
`cmd/config/config.go`	New: TOML parsing, env merging via `IsDefined`, `DiscoverSchemas` (read-only tx, URL-encoded DSN), `Get()`/`SetResolved()` singleton
`cmd/config/config_test.go`	New: unit tests for config loading, env overrides, booleans, schemas section, edge cases
`cmd/config_integration_test.go`	New: integration tests including read-only enforcement on schema discovery
`cmd/root.go`	`--config` / `--env` global flags, `loadConfig()` in `PersistentPreRun`
`cmd/plan/plan.go`	`applyConfigToPlan()` PreRunE, `runPlanMultiSchema()` using top-level `Plan` for combined output, removed `MarkFlagRequired("file")`, unified `processOutput()`
`cmd/plan/output_test.go`	Adjusted (whitespace + final newline; `TestDeriveSchemaOutputTarget` removed since per-schema-file logic was dropped in favor of single combined output)
`cmd/apply/apply.go`	`applyConfigToApply()` PreRunE, `runApplyMultiSchema()`, `applyPlanFile()` iterating over `Plan.Schemas` in sorted order with auto-detection of single vs multi-schema plan files
`cmd/apply/apply_test.go`	New `TestRunApply_PlanFlagSkipsMultiSchema` ensures `--plan` short-circuits the multi-schema path even when `[schemas]` is configured
`cmd/apply/apply_integration_test.go`	Updated call sites: `GeneratePlan` → `GenerateSchemaPlan`; plan files now go through `Plan.AddSchema("public", ...)` and are read back via `Schemas["public"]`
`cmd/dump/dump.go`	`applyConfigToDump()` PreRunE
`cmd/{ignore,migrate}_integration_test.go`, `cmd/plan/external_db_integration_test.go`	Updated to call `GenerateSchemaPlan` and wrap with `plan.NewPlan().AddSchema(...)` for output
`internal/plan/plan.go`	Slimmed down (~1100 lines removed). `Plan` is now a top-level container: `{ version, pgschema_version, created_at, schemas: map[string]*SchemaPlan }`. Adds `NewPlan()`, `AddSchema()`, `SortedSchemaNames()`, `SummaryString()`, multi-schema-aware `HumanColored()` / `ToSQL()` (single-schema renders without header), and `FromJSON()` for the new shape
`internal/plan/schema_plan.go`	New file (~1120 lines): `SchemaPlan` type holds the per-schema `Groups`, `SourceFingerprint`, `SourceDiffs`, plus all the previous `Plan` rendering logic (`HumanColored`, `ToSQL`, `calculateSummaryFromSteps`, table/view/materialized-view detail writers, helpers). All extracted verbatim from the old `plan.go`.
`internal/plan/schema_plan_test.go`	New: covers `SchemaPlan` summary/no-changes, JSON round-trip across `testdata/diff/migrate/v*` (now via top-level `Plan`), debug JSON round-trip with `SourceDiffs`, single-schema header omission
`internal/plan/plan_test.go`	Rewritten for the new top-level `Plan` API: `AddSchema`, `SortedSchemaNames`, `ToJSON`/`FromJSON` round-trip with `schemas` key, `SchemaEntry_ExcludesTopLevelFields`, `HumanColored_MultiSchema`, `ToSQL_MultiSchema`, `SummaryString`, `CreatedAt_UsesTestTime`

Test data — regenerated in this revision

All ~180 testdata/diff/**/plan.json golden files were regenerated to match the new top-level JSON shape. The change is purely structural — no SQL, fingerprints, operations, paths, or step ordering were modified.

Before (single-schema flat):

{
  "version": "1.0.0",
  "pgschema_version": "1.9.0",
  "created_at": "1970-01-01T00:00:00Z",
  "source_fingerprint": { "hash": "..." },
  "groups": [ { "steps": [ ... ] } ]
}

After (schema-keyed):

{
  "version": "1.0.0",
  "pgschema_version": "1.9.0",
  "created_at": "1970-01-01T00:00:00Z",
  "schemas": {
    "public": {
      "source_fingerprint": { "hash": "..." },
      "groups": [ { "steps": [ ... ] } ]
    }
  }
}

Why every file changed:

Plan is now always the multi-schema container; groups and source_fingerprint moved inside schemas.<name>.
Even single-schema runs (the entire existing diff suite) now serialize through the same path used by multi-tenant runs, ensuring one canonical on-disk format.
Top-level version, pgschema_version, and created_at are preserved; per-schema entries deliberately omit them (verified by TestPlan_SchemaEntry_ExcludesTopLevelFields).

The diff per file is mechanical (added wrapping "schemas": { "public": { ... } }, plus 2-space indentation shift), which is why the diffstat shows ~180 files with ±5–6k lines but no semantic changes:

184 files changed, 6511 insertions(+), 5249 deletions(-)

with internal/plan/plan.go shrinking by ~1100 lines as logic moved into internal/plan/schema_plan.go.

Misc

README.md — documentation for config file, named environments, multi-schema loop, and precedence.

Design decisions

TOML over YAML/JSON: pgschema already depends on BurntSushi/toml for ignore config — no new dependency.
Global singleton (config.Get()): Config is loaded once in PersistentPreRun and read by subcommands via config.Get(). Matches the existing pattern where global vars are set in PreRunE hooks.
--file is no longer MarkFlagRequired: When config provides file, requiring --file on the CLI would defeat the purpose. Validation moved to runtime in runPlan ("--file is required (provide via flag, config file, or environment)").
Read-only transaction for schema discovery: The [schemas].query is user-provided SQL executed against the target database. Wrapping in BeginTx(ctx, &sql.TxOptions{ReadOnly: true}) prevents accidental CREATE/DROP/INSERT even if the query is malformed or malicious. Verified by TestDiscoverSchemas_ReadOnlyEnforcement.
URL-encoded DSN in DiscoverSchemas: Built via net/url to avoid injection through host/user/password fields.
Unified plan file format: In multi-schema mode, all schema plans are written to a single file using the Plan JSON format ({"schemas": {"tenant_1": {...}, "tenant_2": {...}}}). apply --plan iterates schemas in sorted order. This eliminates the previous limitation of needing one plan file per tenant.
Single-schema rendering omits headers: Plan.HumanColored() and Plan.ToSQL() detect len(Schemas) == 1 and delegate directly to the underlying SchemaPlan, so single-schema CLI output is unchanged from before.
--plan short-circuits multi-schema: When --plan is provided, RunApply skips the [schemas] discovery path even if config has it set, since the plan file itself dictates which schemas to apply. Covered by TestRunApply_PlanFlagSkipsMultiSchema.

Backward compatibility

No CLI flag or env var semantics changed.
Without pgschema.toml, command behavior is identical to before — except plan JSON output now wraps under "schemas". This is a breaking change for anyone parsing plan JSON externally or replaying plan files produced by older pgschema versions. All golden plan files in testdata/diff/**/plan.json were regenerated accordingly.
All non-golden tests pass unmodified.

Flow diagrams

End-to-end command flow

flowchart TD
	A[Start command plan/apply/dump] --> B[PersistentPreRun loadConfig]
	B --> C{Config file exists?}
	C -->|No and --config explicit| C1[Exit with error]
	C -->|No and --env set| C2[Exit with error]
	C -->|No default file| C3[Continue with nil config]
	C -->|Yes| D[Parse TOML base + env]
	D --> E[Set global resolved config]

	E --> F[Subcommand PreRunE applyConfigToX]
	F --> G[Apply config values only when flag not changed]
	G --> H[Apply env vars and connection defaults]

	H --> I{Has schemas.query and schema not explicitly set?}
	I -->|No| J[Single-schema flow]
	I -->|Yes| K[Discover schemas in read-only transaction]

	K --> L{Command type}
	L -->|plan| M[Loop schemas, GenerateSchemaPlan, AddSchema to combined Plan]
	L -->|apply --file| N[Loop schemas, GenerateSchemaPlan + ApplyMigration each]
	L -->|apply --plan| N2[Load Plan from file, iterate Schemas in sorted order]

	M --> M1[Write combined Plan JSON/SQL/Human to single output]
	M1 --> M2[Progress logs to stderr]

	N --> N3[Progress logs to stderr]
	N2 --> N4[Apply each SchemaPlan with its schema name]

	J --> O[Single-schema behavior preserved, no header in output]
	M2 --> P[Done]
	N3 --> P
	N4 --> P
	O --> P

Precedence and output behavior

Precedence order

Priority	Source	Notes
1 (highest)	CLI flags	Explicit command-line input always wins
2	Environment variables	Applied after config fallback
3	Config env block (`[env.<name>]`)	Overrides base config when key is explicitly defined
4	Config base	Default values from top-level `pgschema.toml`
5 (lowest)	Built-in defaults	Hardcoded defaults in command flags

Multi-schema plan output

All schemas are combined into a single output. JSON wraps per-schema plans in a "schemas" map:

Output format	Behavior
`--output-json plan.json`	Single combined `Plan` JSON file with all schemas
`--output-json stdout`	Combined `Plan` JSON printed to stdout
`--output-human`	Schemas listed with `── Schema: <name> ──` headers (single-schema: no header)
`--output-sql`	Combined SQL with `-- Schema: <name>` comment headers (single-schema: no header)

Plan JSON format

{
  "version": "1.0.0",
  "pgschema_version": "1.9.0",
  "created_at": "2025-01-01T00:00:00Z",
  "schemas": {
    "tenant_1": {
      "source_fingerprint": { "hash": "..." },
      "groups": [...]
    },
    "tenant_2": {
      "source_fingerprint": { "hash": "..." },
      "groups": [...]
    }
  }
}

apply --plan combined.json iterates schemas in sorted order and applies each SchemaPlan against its named schema.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

… flags Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

When a config file defines [schemas] with a SQL query, plan and apply commands discover tenant schemas dynamically and iterate over each one. Dump is excluded since it produces a single template schema. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Tests cover: no config file, explicit config path, env overrides with inheritance, schemas section, plan fields, boolean overrides, and command-level config fallback. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…tructures

…ulti-tenant schema handling

…very

… matrix strategy

feat: Add `pgschema.toml` configuration file support

…p, and plan commands

…nd plan commands

greptile-apps · 2026-05-14T17:49:06Z

Greptile Summary

This PR adds TOML configuration support for pgschema commands. The main changes are:

Adds pgschema.toml loading with named environment overrides.
Applies config values before existing environment variable handling.
Adds multi-schema discovery and looping for plan and apply.
Adds config fallback handling for dump, plan, and apply flags.
Documents config files, environments, and multi-tenant schema loops.

Confidence Score: 3/5

This is close, but the structured multi-schema output path should be fixed before merging.

Multi-schema planning can still print invalid JSON to stdout when more than one schema is discovered.
The config and apply paths otherwise follow the intended precedence model from the changed code.

cmd/plan/plan.go

Important Files Changed

Filename	Overview
cmd/plan/plan.go	Adds config fallback and multi-schema planning; stdout structured output still needs aggregation.
cmd/apply/apply.go	Adds config fallback and multi-schema apply paths with plan flag guard handling.
cmd/config/config.go	Adds TOML parsing, environment merging, global resolved config, and schema discovery.

_{Reviews (2): Last reviewed commit: "test: add tests for deriveSchemaOutputTa..." | Re-trigger Greptile}

…hema plan

…guard

…ructure

- Renamed `GeneratePlan` to `GenerateSchemaPlan` for clarity. - Updated `runPlan` and `runPlanMultiSchema` to use the new schema plan generation function. - Consolidated the `MultiPlan` and `Plan` structures into a unified `Plan` structure that handles both single and multi-schema operations. - Adjusted methods to work with the new `Plan` structure, including `AddSchema`, `HasAnyChanges`, `ToJSON`, and `ToSQL`. - Updated tests to reflect the changes in the plan structure and ensure proper functionality. - Enhanced JSON serialization and deserialization for the new plan structure.

William-W-Chen and others added 23 commits May 14, 2026 22:17

feat: add config package with TOML parsing and LoadConfig

dc8875d

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

test: add config merge, boolean override, and error case tests

8d14729

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat: add --config and --env flags to root command

2213d48

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

feat: plan/apply/dump commands read config values as fallback for CLI…

4c3958f

… flags Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

docs: update comments for ResolvedConfig, envConfig, and fileConfig s…

dd416f9

…tructures

feat: add configuration file support with environment overrides and m…

e6fe1f3

…ulti-tenant schema handling

feat: implement read-only transaction for schema discovery queries

a837a08

feat: add tests for read-only transaction enforcement in schema disco…

5b23197

…very

refactor: streamline CI workflows for unit and integration tests with…

9a57b5a

… matrix strategy

feat: update Docker workflow to use GitHub Container Registry

d4dca47

fix: ensure integration tests depend on unit tests in CI workflow

03e59de

refactor: streamline CI workflows for unit and integration tests with…

f674c42

… matrix strategy

fix: ensure integration tests depend on unit tests in CI workflow

6df6f1a

refactor: consolidate unit and integration tests into a single CI job

3130e92

revert: undo unnecessary trailing newline change in ci-test.yml

413e19a

Merge branch 'ci/matrix-improvements' into feat/config-file

3e2b293

refactor: remove integration test job from CI workflow

0b07702

Merge pull request #1 from NFUChen/feat/config-file

282b531

feat: Add `pgschema.toml` configuration file support

refactor: enhance PreRunE hooks to apply configuration for apply, dum…

9b5d90f

…p, and plan commands

feat: enhance PreRunE hooks to apply configuration for apply, dump, a…

f4e4d43

…nd plan commands

revert: restore changes from upstream base main

a0c9516

NFUChen changed the title ~~Feat/upstream pr config file~~ feat: Add pgschema.toml configuration file support May 14, 2026

refactor: remove applyConfigToDump call from runDump function

d058178

NFUChen marked this pull request as ready for review May 14, 2026 17:47

greptile-apps Bot reviewed May 14, 2026

View reviewed changes

Comment thread cmd/plan/plan.go

Comment thread cmd/apply/apply.go

Comment thread cmd/plan/plan.go Outdated

Comment thread cmd/plan/plan.go Outdated

Comment thread cmd/apply/apply.go

Comment thread cmd/config/config.go Outdated

Comment thread cmd/root.go

William-W-Chen added 2 commits May 15, 2026 09:39

fix: skip multi-schema path when --plan flag is used in apply

b3c5463

fix: use URL-encoded DSN in DiscoverSchemas to prevent injection

2010913

William-W-Chen added 6 commits May 15, 2026 09:40

fix: apply plan DB env vars in runPlanMultiSchema

bd11fcc

fix: apply plan DB env vars in runApplyMultiSchema

56c1b23

fix: redirect multi-schema progress banners to stderr

cfc80b8

fix: use per-schema output filenames to prevent overwrite in multi-sc…

953a0e5

…hema plan

fix: clear resolved config when config file is absent

d5a78ea

test: add tests for deriveSchemaOutputTarget and --plan multi-schema …

268e82c

…guard

NFUChen marked this pull request as draft May 15, 2026 02:38

William-W-Chen added 3 commits May 15, 2026 11:15

feat: implement multi-schema plan handling and output processing

2073159

feat: refactor output processing to use Outputter interface for plans

19c132f

feat: add human-readable preview output for multi-schema plans

49e13fa

NFUChen marked this pull request as ready for review May 15, 2026 04:05

greptile-apps Bot reviewed May 15, 2026

View reviewed changes

Comment thread cmd/plan/plan.go Outdated

William-W-Chen added 4 commits May 15, 2026 13:50

Refactor privilege and schema management plans to standardize JSON st…

3a34820

…ructure

chore: rename go file

51b0176

refactor: rename GeneratePlan to GenerateSchemaPlan for clarity

4e4d7bc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add pgschema.toml configuration file support#433

feat: Add pgschema.toml configuration file support#433
NFUChen wants to merge 39 commits into
pgplex:mainfrom
NFUChen:feat/upstream-pr-config-file

NFUChen commented May 14, 2026 •

edited

Loading

Uh oh!

greptile-apps Bot commented May 14, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

NFUChen commented May 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Add pgschema.toml configuration file support

Summary

Features

1. Flat config file

2. Named environments (--env)

3. Multi-tenant schema loop ([schemas])

4. Precedence

Global flags added

Files changed

Code

Test data — regenerated in this revision

Misc

Design decisions

Backward compatibility

Flow diagrams

End-to-end command flow

Precedence and output behavior

Precedence order

Multi-schema plan output

Plan JSON format

Uh oh!

greptile-apps Bot commented May 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 3/5

Important Files Changed

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

NFUChen commented May 14, 2026 •

edited

Loading

Add `pgschema.toml` configuration file support

2. Named environments (`--env`)

3. Multi-tenant schema loop (`[schemas]`)

greptile-apps Bot commented May 14, 2026 •

edited

Loading