perf: speed up LP file writing (2.5-3.9x on large models, no regressions on small) #564

FBumann · 2026-01-31T21:13:58Z

Changes proposed in this Pull Request

Speed up LP file writing by up to 3.9x on large models, with consistent improvements across all problem sizes. Includes a benchmark script for reproducibility.
Added a lp-file benchmarking script, which is a majority of the lines changed

Performance optimizations

Use Polars streaming engine for concat_str + write_csv via new _format_and_write() helper (with automatic fallback + warning)
Replace concat + sort with join for constraint assembly
Extract maybe_group_terms_polars() to skip expensive group_by when terms already reference distinct variables
Reduce per-constraint overhead by applying labels mask directly and using fast sign uniformity check

Bug fixes

Fix missing space in LP file output
Fix IndexError on empty constraint slices in sign_flat check

Benchmark results

Reproduce with python dev-scripts/benchmark_lp_writer.py --model basic -o results.json --label "my run".

Synthetic model (2×N² vars, 2×N² constraints)

No regressions on small models, speedup grows with problem size up to 3.9x at 8M variables.

PyPSA SciGrid-DE (realistic power system model, 24–1000 snapshots)

Consistent 2.5–2.7x speedup across all sizes, reaching 7.0s → 2.7s at 2.5M variables / 6M constraints.

Checklist

Code changes are sufficiently documented; i.e. new functions contain docstrings and further explanations may be given in doc.
Unit tests for new features were added (if applicable).
A note for the release notes doc/release_notes.rst of the upcoming release is included.
I consent to the release of this PR's code under the MIT license.

Extract _format_and_write() helper that uses lazy().collect(engine="streaming") with automatic fallback, replacing 7 instances of df.select(concat_str(...)).write_csv(...).

Replace the vertical concat + sort approach in Constraint.to_polars() with an inner join, so every row has all columns populated. This removes the need for the group_by validation step in constraints_to_file() and simplifies the formatting expressions by eliminating null checks on coeffs/vars columns.

…r short DataFrame - Skip group_terms_polars when _term dim size is 1 (no duplicate vars) - Build the short DataFrame (labels, rhs, sign) directly with numpy instead of going through xarray.broadcast + to_polars - Add sign column via pl.lit when uniform (common case), avoiding costly numpy string array → polars conversion Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

…e vars Check n_unique before running the expensive group_by+sum. When all variable references are unique (common case for objectives), this saves ~31ms per 320k terms. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Replace np.unique with faster numpy equality check for sign uniformity. Eliminate redundant filter_nulls_polars and check_has_nulls_polars on the short DataFrame by applying the labels mask directly during construction. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Guard against IndexError when sign_flat is empty (no valid labels) by checking len(sign_flat) > 0 before accessing sign_flat[0]. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

…for duplicate (labels, vars) pairs before calling group_terms_polars. Use it in both Constraint.to_polars() and LinearExpression.to_polars() to avoid expensive group_by when terms already reference distinct variables

FabianHofmann · 2026-02-02T13:42:58Z

Wonderful @FBumann ! This is very much welcome!

FBumann · 2026-02-02T14:38:44Z

@FabianHofmann Should a fix the codecov stuff?

FBumann and others added 8 commits January 31, 2026 21:06

perf: use Polars streaming engine for LP file writing

86232e8

Extract _format_and_write() helper that uses lazy().collect(engine="streaming") with automatic fallback, replacing 7 instances of df.select(concat_str(...)).write_csv(...).

fix: log warning with traceback when Polars streaming fallback triggers

b1e9864

fix: missing space in lp file

d15ff40

fix: handle empty constraint slices in sign_flat check

0b413dd

Guard against IndexError when sign_flat is empty (no valid labels) by checking len(sign_flat) > 0 before accessing sign_flat[0]. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

FBumann mentioned this pull request Jan 31, 2026

perf: lp write speed #562

Closed

4 tasks

FBumann and others added 5 commits January 31, 2026 22:17

docs: add LP write speed improvement to release notes

9f35550

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

bench: add LP write benchmark script with plotting

1896eee

bench: larger model

68f1adc

Add variance to plot

04c4bea

FBumann changed the title ~~perf: speed up LP file writing (2-2.7x on large models)~~ perf: speed up LP file writing (2.5-3.9x on large models) Jan 31, 2026

FBumann changed the title ~~perf: speed up LP file writing (2.5-3.9x on large models)~~ perf: speed up LP file writing (2.5-3.9x on large models, no regressions on small) Jan 31, 2026

FBumann added 3 commits February 1, 2026 00:43

test: add coverage for streaming fallback and maybe_group_terms_polars

3f52fef

fix: mypy

3d4a815

fix: mypy

0dbe488

FBumann mentioned this pull request Feb 2, 2026

Feat/benchmarks #567

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: speed up LP file writing (2.5-3.9x on large models, no regressions on small) #564

perf: speed up LP file writing (2.5-3.9x on large models, no regressions on small) #564

Uh oh!

FBumann commented Jan 31, 2026 •

edited

Loading

Uh oh!

FabianHofmann commented Feb 2, 2026

Uh oh!

FBumann commented Feb 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

perf: speed up LP file writing (2.5-3.9x on large models, no regressions on small) #564

Are you sure you want to change the base?

perf: speed up LP file writing (2.5-3.9x on large models, no regressions on small) #564

Uh oh!

Conversation

FBumann commented Jan 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes proposed in this Pull Request

Performance optimizations

Bug fixes

Benchmark results

Synthetic model (2×N² vars, 2×N² constraints)

PyPSA SciGrid-DE (realistic power system model, 24–1000 snapshots)

Checklist

Uh oh!

FabianHofmann commented Feb 2, 2026

Uh oh!

FBumann commented Feb 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

FBumann commented Jan 31, 2026 •

edited

Loading