Return a BandedMatrix from a view of a BandedBlockBandedMatrix by dlfivefifty · Pull Request #223 · JuliaLinearAlgebra/BlockBandedMatrices.jl

dlfivefifty · 2025-03-26T12:18:10Z

@MikaelSlevinsky This gets rid of the allocations (for lazy axes at least).

Still need to figure out why it's slow.

julia> for n in 20:20:200
                  ax = BlockArrays.BlockedOneTo(ArrayLayouts.RangeCumsum(Base.OneTo(n)))
                  D = BandedBlockBandedMatrix{Float64}(I, (ax,ax), (0, 0), (0, 0))
                  x = BlockVector(randn(sum(1:n)), (ax,))
                  y = zero(x)
                  @time my_special_mul_through_data!(y, D, x);
                  @time mul!(y, D, x);
              end
  0.000002 seconds
  0.000066 seconds (1 allocation: 32 bytes)
  0.000004 seconds
  0.000035 seconds (1 allocation: 32 bytes)
  0.000007 seconds
  0.000045 seconds (1 allocation: 32 bytes)
  0.000011 seconds
  0.000056 seconds (1 allocation: 32 bytes)
  0.000016 seconds
  0.000140 seconds (1 allocation: 32 bytes)
  0.000023 seconds
  0.000172 seconds (1 allocation: 32 bytes)
  0.000057 seconds
  0.000179 seconds (1 allocation: 32 bytes)
  0.000041 seconds
  0.000316 seconds (1 allocation: 32 bytes)
  0.000067 seconds
  0.000180 seconds (1 allocation: 32 bytes)
  0.000062 seconds
  0.000212 seconds (1 allocation: 32 bytes)

codecov · 2025-03-26T12:24:39Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 86.06%. Comparing base (629276d) to head (159d05f).
Report is 1 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #223      +/-   ##
==========================================
- Coverage   88.16%   86.06%   -2.10%     
==========================================
  Files          11       11              
  Lines        1115     1134      +19     
==========================================
- Hits          983      976       -7     
- Misses        132      158      +26

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

dlfivefifty · 2025-04-22T09:46:57Z

@MikaelSlevinsky With this (and the just tagged BlockArrays v1.6.2) we have the following:

julia> using AppleAccelerate # needed so banded matrices aren't insanely slow

julia> for n in 20:20:200
                  ax = BlockArrays.BlockedOneTo(ArrayLayouts.RangeCumsum(Base.OneTo(n)))
                  D = BandedBlockBandedMatrix{Float64}(I, (ax,ax), (0, 0), (0, 0))
                  x = BlockVector(randn(sum(1:n)), (ax,))
                  y = zero(x)
                  @time my_special_mul_through_data!(y, D, x);
                  @time my_special_mul!(y, D, x);
                  @time my_special_mul_with_a_view!(y, D, x);
                  @time mul!(y, D, x);
              end
  0.000003 seconds
  0.000011 seconds (80 allocations: 5.938 KiB)
  0.000003 seconds
  0.000012 seconds (2 allocations: 64 bytes)
  0.000002 seconds
  0.000008 seconds (160 allocations: 18.219 KiB)
  0.000002 seconds
  0.000026 seconds (2 allocations: 64 bytes)
  0.000002 seconds
  0.000012 seconds (240 allocations: 37.188 KiB)
  0.000003 seconds
  0.000007 seconds (2 allocations: 64 bytes)
  0.000003 seconds
  0.000017 seconds (320 allocations: 62.438 KiB)
  0.000006 seconds
  0.000011 seconds (2 allocations: 64 bytes)
  0.000004 seconds
  0.000029 seconds (400 allocations: 94.500 KiB)
  0.000009 seconds
  0.000011 seconds (2 allocations: 64 bytes)
  0.000005 seconds
  0.000034 seconds (480 allocations: 133.469 KiB)
  0.000013 seconds
  0.000015 seconds (2 allocations: 64 bytes)
  0.000007 seconds
  0.000039 seconds (560 allocations: 178.156 KiB)
  0.000013 seconds
  0.000020 seconds (2 allocations: 64 bytes)
  0.000009 seconds
  0.000049 seconds (640 allocations: 229.531 KiB)
  0.000022 seconds
  0.000027 seconds (2 allocations: 64 bytes)
  0.000012 seconds
  0.000052 seconds (720 allocations: 287.469 KiB)
  0.000059 seconds
  0.000034 seconds (2 allocations: 64 bytes)
  0.000017 seconds
  0.000072 seconds (800 allocations: 351.938 KiB)
  0.000029 seconds
  0.000045 seconds (2 allocations: 64 bytes)

So we have got rid of almost all allocations. The remaining speed difference is due to gbmv! being too slow for diagonal matrices:

julia> n = 10_000; D = BandedMatrix(0 => 1:n); x = randn(n); @time D*x;
  0.000042 seconds (3 allocations: 78.188 KiB)

julia> D = Diagonal(1:n); x = randn(n); @time D*x;
  0.000010 seconds (3 allocations: 78.188 KiB)

We could special case diagonal blocks not to call BLAS... actually I suspect the fastest would be to get rid of all calls to banded BLAS and just do naive for loops....

MikaelSlevinsky · 2025-04-22T13:21:10Z

Awesome, thanks!

dlfivefifty · 2025-04-22T15:34:41Z

The reason ApproxFun is broken is that

https://github.com/JuliaApproximation/ApproxFunBase.jl/blob/569ff3c493c3f288214dfc874a1e32be7a9ce566/src/PDE/KroneckerOperator.jl#L321

is assuming that view returns a SubArray that can be reindexed. This is arguably a bad design.

@jishnub thoughts?

Return a BandedMatrix from a view of a BandedBlockBandedMatrix

cfc75eb

MikaelSlevinsky mentioned this pull request Mar 31, 2025

This adds methods to compute bivariate Gram matrices. JuliaApproximation/FastTransforms.jl#261

Open

dlfivefifty added 2 commits April 16, 2025 21:07

Update BandedBlockBandedMatrix.jl

0fc3234

tests pass

159d05f

dlfivefifty mentioned this pull request Jun 19, 2025

LinearSolve of Block-Tridiagonal matrix is slower than Sparse Arrays #225

Open

johnomotani mentioned this pull request Oct 14, 2025

Construct new BlockSkylineMatrix from BlockSkylineMatrix and UnitRanges #229

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Return a BandedMatrix from a view of a BandedBlockBandedMatrix#223

Return a BandedMatrix from a view of a BandedBlockBandedMatrix#223
dlfivefifty wants to merge 3 commits intomasterfrom
viewblock_is_banded

dlfivefifty commented Mar 26, 2025

Uh oh!

codecov bot commented Mar 26, 2025 •

edited

Loading

Uh oh!

dlfivefifty commented Apr 22, 2025

Uh oh!

MikaelSlevinsky commented Apr 22, 2025

Uh oh!

dlfivefifty commented Apr 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dlfivefifty commented Mar 26, 2025

Uh oh!

codecov bot commented Mar 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

dlfivefifty commented Apr 22, 2025

Uh oh!

MikaelSlevinsky commented Apr 22, 2025

Uh oh!

dlfivefifty commented Apr 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov bot commented Mar 26, 2025 •

edited

Loading