Introduce VCL, SIMD wrappers, and Vectorized RasterizeSDF(Spheres) by Idclip · Pull Request #2190 · AcademySoftwareFoundation/openvdb

Idclip · 2026-04-08T02:30:24Z

This PR proposed to bring in Agner Fog's vectorclass (VCL) library as an internal but optional dependency on x86/x86_64 architecture. It then introduces some further infrastructure to improve/tidy our x86 intrinsic usage/ISA targeting, along with an additional wrapper header openvdb/simd/Simd.h which wraps the VCL Vec containers within a openvdb::simd namespace. This second level of wrapping exists to:

Abstract away the underlying SIMD container types should we want to use a different library in the future (e.g. std::simd)
Provide a namespace for transient selection of other architectures in the future (arm/neon, etc)
Allows us to implement a generic API for non-vectorized or non-x86 builds that instead work on Tuples of values, but allows algorithms to be written with a single implementation for both build types.

Note that the later point is crucial - this encourages us to write code that is more applicable to SIMD concepts, regardless of whether VCL/ISA targeting is in use. Many tools in VDB are inherently memory bound; that is, lots of data reading, little computation. Even when explicitly disabling compiler vectorization for specific x86 ISA's, there are notable performance improvements to be observed in many methods simply be restructuring inner loops to work on multiple components (i.e. many inner loops vs many outer loops).

Finally, to keep this PR small and primarily infrastructural, it contains one vectorized tool port, rasterizeSdf(SphereSettings), which demonstrates how to both migrate from AoS->SoA and port existing scalar code to a templated method for VCL, Tuple and scalar arithmetic types. This implementation works with and without VCL.

The following table demonstrates the observed speedups with all configurations:

Scalar - That is, no AoS->SoA, no VCL or Tuples with no ISA targetting, with SSE42 and with AVX
Array<2> - Tuples of 2 doubles with no ISA targetting, with SSE42 and with AVX
Array<4> - Tuples of 4 doubles with no ISA targetting, with SSE42 and with AVX
Intrinsics - Using VCL with SSE42 (2 x doubles) and with AVX (4 x doubles)

Note that this particular tool requires no discussion over determinism of horizontal reduction or Intel vs AMD instruction specs - that is, there are no horizontal accumulations and no reciprocal emissions for this case. We can deffer this discussion to a future PR.

I have working implementations of the following which, should this PR be accepted, I can further contribute:

PointRasterizeSDF.h rasterizeSdf(SmoothSpheres)
PointRasterizeSDF.h rasterizeSdf(Ellipsoids)
PointRasterizeTrilinear.h rasterizeTrilinear
PrincipalComponentAnalysis.h pca

…7883e0ed9ce6 Signed-off-by: Nick Avramoussis <4256455+Idclip@users.noreply.github.com>

ext/THIRD-PARTY.md

Signed-off-by: Nick Avramoussis <4256455+Idclip@users.noreply.github.com>

…alar arithmetic Signed-off-by: Nick Avramoussis <4256455+Idclip@users.noreply.github.com>

…. RasterizeSDF with spheres Signed-off-by: Nick Avramoussis <4256455+Idclip@users.noreply.github.com>

Signed-off-by: Nick Avramoussis <4256455+Idclip@users.noreply.github.com>

Added copy of VCL (version2) v2.02.02 at f4617df57e17efcd754f5bbe0ec8…

1891047

…7883e0ed9ce6 Signed-off-by: Nick Avramoussis <4256455+Idclip@users.noreply.github.com>

Idclip requested review from apradhana, danrbailey, jmlait, kmuseth and richhones as code owners April 8, 2026 02:30

Idclip commented Apr 8, 2026

View reviewed changes

ext/THIRD-PARTY.md Show resolved Hide resolved

Idclip force-pushed the vcl_simd branch from cbc615a to b389520 Compare April 8, 2026 23:50

Idclip added 5 commits April 9, 2026 12:52

Fixed missing inline decls for some methods in vectorfp16e

2a58b4c

Signed-off-by: Nick Avramoussis <4256455+Idclip@users.noreply.github.com>

Added VCL SIMD header wrapper for aliasing between simd, tuple and sc…

dcc0612

…alar arithmetic Signed-off-by: Nick Avramoussis <4256455+Idclip@users.noreply.github.com>

Initial manual vectorization/matching of SphericalTransfer scheme i.e…

0e72d45

…. RasterizeSDF with spheres Signed-off-by: Nick Avramoussis <4256455+Idclip@users.noreply.github.com>

Moved util/Simd.h to simd/Simd.h and fixed license

9adeb1f

Signed-off-by: Nick Avramoussis <4256455+Idclip@users.noreply.github.com>

Added weekly CI for VCL=ON (OFF by default)

dfb7d81

Signed-off-by: Nick Avramoussis <4256455+Idclip@users.noreply.github.com>

Idclip force-pushed the vcl_simd branch from b389520 to dfb7d81 Compare April 9, 2026 00:52

swahtz mentioned this pull request Apr 10, 2026

simd: Generic-T Simd<T,W> abstraction #2192

Open

Minor fixes for AVX512F/AVX512+

44c59d1

Signed-off-by: Nick Avramoussis <4256455+Idclip@users.noreply.github.com>

Idclip force-pushed the vcl_simd branch from 8b00eae to 44c59d1 Compare April 12, 2026 23:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce VCL, SIMD wrappers, and Vectorized RasterizeSDF(Spheres)#2190

Introduce VCL, SIMD wrappers, and Vectorized RasterizeSDF(Spheres)#2190
Idclip wants to merge 7 commits intoAcademySoftwareFoundation:masterfrom
Idclip:vcl_simd

Idclip commented Apr 8, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Idclip commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Idclip commented Apr 8, 2026 •

edited

Loading