[FLINK-39622] [postgres] Fix O(N²) JDBC metadata lookups in CustomPostgresSchema d… by ThorneANN · Pull Request #4403 · apache/flink-cdc

ThorneANN · 2026-05-20T06:37:33Z

、 CustomPostgresSchema#readTableSchema invokes jdbcConnection.readSchema with
the full captured-table filter, so a single call already loads metadata for
every captured table. However the cache-population loop only iterates the
requested subset, discarding the rest. As a result, snapshot startup performs
one full pg_catalog scan per split, scaling as O(N²) with the number of
captured tables and causing severe latency on multi-tenant Postgres deployments
that capture hundreds of tables across schemas.

This change caches every table discovered by readSchema into schemasByTableId,
while the returned tableChanges still contains only the originally-requested
subset. Subsequent splits are served entirely from the cache.

Also fixes a related issue where getTableSchema(List) re-fetched
already-cached tables by passing the full tableIds list to readTableSchema
instead of the unmatched subset.

…uring snapshot

[postgres] Fix O(N²) JDBC metadata lookups in CustomPostgresSchema d…

708fb51

…uring snapshot

github-actions Bot added the postgres-cdc-connector label May 20, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FLINK-39622] [postgres] Fix O(N²) JDBC metadata lookups in CustomPostgresSchema d…#4403

[FLINK-39622] [postgres] Fix O(N²) JDBC metadata lookups in CustomPostgresSchema d…#4403
ThorneANN wants to merge 1 commit into
apache:masterfrom
ThorneANN:fix/postgres-schema-cache-n-squared

ThorneANN commented May 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ThorneANN commented May 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant