GPU-friendly truncation implementations #349

lkdvos · 2026-01-08T14:41:19Z

This is an attempt to get rid of the scalar-indexing oriented approach, and instead do more global operations.
Definitely still WIP, and on CPU there are definitely various optimizations that can be applied if needed.
I do wonder about the performance a bit, as I would actually expect that for a large number of sectors this might just be faster.

Some possible optimizations:

for UniqueFusion, finding the nth value is simply partialsortperm(values, n; by, rev), avoiding the need to allocate the full permutation vector
for CPU, cumsum + findlast can be replaced by a loop to avoid some intermediate allocations

codecov · 2026-01-08T16:34:15Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

Files with missing lines	Coverage Δ
src/factorizations/truncation.jl	`54.80% <100.00%> (-33.13%)`	⬇️

... and 31 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

try to make truncation GPU-friendly

228fdcf

kshyatt force-pushed the ld-truncation branch from a9bb7f6 to 228fdcf Compare January 8, 2026 19:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GPU-friendly truncation implementations #349

GPU-friendly truncation implementations #349

Uh oh!

lkdvos commented Jan 8, 2026

Uh oh!

codecov bot commented Jan 8, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

GPU-friendly truncation implementations #349

Are you sure you want to change the base?

GPU-friendly truncation implementations #349

Uh oh!

Conversation

lkdvos commented Jan 8, 2026

Uh oh!

codecov bot commented Jan 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Jan 8, 2026 •

edited

Loading