Splinter: A Cooperative Userspace Hypervisor for Inference & Other Semantic Workloads

Local Large Language Model (LLM) inference is currently choking on the "Socket and Lock" tax. Standard IPC tools and databases require heavy context switching, serialization, and kernel interrupts just to synchronize state. When you are generating text or evaluating semantic alignment at token speeds, that overhead isn't just a bottleneck—it's a wall.

Splinter dismantles that wall. It is a lock-free, cooperative userspace hypervisor designed from the ground up for strict mechanical sympathy with modern CPU cache hierarchies (x86_64, ARM, and RISC). It puts your governance, your vector storage, and your inference engine in the exact same physical memory lane.

Think of Splinter as a semantic breadboard. It provides a passive, shared-memory manifold where thousands of context or classification windows for multimodal inference run simultaneously with 100% non-blocking throughput. It provides the following to all processes using it:

Vector storage
Cache & KV Storage
Atomic Integer Operations (similar to Redis)
Ordered sets (Similar to Redis hashes, but designed specifically for Physics)
Persistence or In-memory residence
Eventfd-backed or poll-able pub/sub notifications
Bloomable store with feature flags
Self-elected univocality for shared region use and governance via POSIX advisement primitives
A production-grade CLI/REPL for debugging and operations

All in a size that stays in the CPU hot path, and doesn't require copying between services.

The Core Concept: Cooperative Virtualization

Splinter treats your local workspace not as a database pipeline, but as an execution topology:

The Privilege Topology: When run as root, Splinter is ringless, bypassing standard OS arbitration barriers to align directly with CPU L1/L3 cache lines and pin NUMA nodes. Under an underprivileged user, it acts as a single-ring virtualized environment, managing its own ecosystem of agents and shards without trapping down into Ring 0.
Cooperative Scheduling: Rather than using aggressive, preemptive hardware interrupts that cause CPU thrashing, Splinter coordinates access via an aligned 32-slot bid table. Shards check a shared sovereignty table and voluntarily yield memory regions using a protocol backed by POSIX madvise primitives.

Architectural Features

Zero-Copy Substrate: Multi-dimensional arrays mapped via mmap are treated as raw, continuous memory lanes. Client-side WASM engines (via WASMEdge) or local Lua scripts execute fixed-width SIMD instructions directly over the shared pointers without intermediate serialization.
Mechanical Sympathy: Lock-free, 64-byte aligned architecture utilizing sequence locks (inspired by the Xen hypervisor) ensures readers never block writers, maintaining pristine L3 cache residency.
In-Place Atomics: Execute bitwise and mathematical operations directly on BIGUINT keys within the shared memory pool.
Externalized Hippocampus: Agents can dynamically spin up local semantic scratchpads in .splinter/ to offload raw context retrieval from their expensive API windows, dropping token consumption by up to 45% while driving down hallucination rates.

The Splinter Toolchain

Splinter is accompanied by a minimalist, bare-metal C toolchain designed for production-grade telemetry and control:

splinference: An embedding inference engine that maps directly to the bus with no socket layer, managed natively by systemd.
splainference: A completion and conversational runtime that maps system prompts, generation windows, and active RAG contexts directly to the shared substrate.
splinterctl & splinterpctl: A lightning-fast CLI and REPL that completely isolates administrative interaction from core storage performance.
sidecar: A DevOps monitoring tool providing real-time visibility into the semantic bus, tracking active slots, bid windows, and evProcessor tension metrics.

Building

On Debian/Ubuntu you can run scripts/bigbang.sh to automatically install Splinter with vector storage and llama.cpp from source and install both on your system. This will also download Nomic Text 1.5 for you from Hugging Face, which is all you need to get started with Splinter as a semantic breadboard.

If you want to build advanced things (lua, WASMEdge, NUMA) then run ./configure --help to see how to enable those options. A --install-deb-deps option is provided by the script to automatically install all the Debian packages you need to build Splinter's optional dependencies.

A typical build:

./configure --with-llama --with-vectors --with-lua --with-wasm --with-numa
make
sudo -E make install

Valgrind will be used during make tests if you have it on your system. See splinter_chi_sao for benchmarks.

Performance Under Fire

Tested rigorously under strict hardware constraints (including fanless, low-tier Chromebook development environments):

Multi-Reader, Single-Writer (MRSW): Sustains 3.2 million operations/second with zero data corruption.
Multi-Reader, Multi-Writer (MRMW): Utilizing the disjointed-lane collision resolution protocol (splinter_chi_sao), 32 concurrent writers sustain 15.6 million operations/second with zero data corruption.

Governance and Open Source Commitment

Splinter is open-source because transparent, text-driven governance and systemic bias auditing should be fundamental infrastructure, not a locked enterprise feature.

Once version 1.2.0 ships (Target: Mid-May/Early June 2026), the core repository will move into long-term maintenance mode. It will receive continuous updates, performance optimizations, and community-driven enhancements and fixes to existing features, but the feature set will remain broadly locked. Splinter can't keep its original identity and be a kitchen sink.

Splinter will never be abandoned. By design, memory lifecycle management is kept ruthlessly clean: long-term, high-priority semantic stores are explicitly tracked via Git, while transient working memory scratchpads are purged safely by the operator using standard development tooling like make distclean. Commercial development will focus entirely on the high-level semantic classification layers and application ecosystems built on top of this bedrock, leaving the open substrate pristine, public, and free.

After 1.2.0, improvements would be so specific to the author's use case that very few others would benefit from them. For the most part, even the commercial code will remain open, just no longer part of this project.

Name		Name	Last commit message	Last commit date
Latest commit History 824 Commits
.github		.github
.vscode		.vscode
3rdparty		3rdparty
bindings		bindings
lume-website		lume-website
scripts		scripts
.gitignore		.gitignore
.gitkeep		.gitkeep
BUILDING.md		BUILDING.md
CMakeLists.txt		CMakeLists.txt
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.txt		LICENSE.txt
Makefile		Makefile
README.md		README.md
README.v4.md		README.v4.md
SECURITY.md		SECURITY.md
build.h		build.h
config.h		config.h
configure		configure
sidecar.c		sidecar.c
splainference.cpp		splainference.cpp
splinference.cpp		splinference.cpp
splinter.c		splinter.c
splinter.h		splinter.h
splinter_chi_sao.c		splinter_chi_sao.c
splinter_cli.h		splinter_cli.h
splinter_cli_cmd_append.c		splinter_cli_cmd_append.c
splinter_cli_cmd_bind.c		splinter_cli_cmd_bind.c
splinter_cli_cmd_bump.c		splinter_cli_cmd_bump.c
splinter_cli_cmd_caps.c		splinter_cli_cmd_caps.c
splinter_cli_cmd_clear.c		splinter_cli_cmd_clear.c
splinter_cli_cmd_config.c		splinter_cli_cmd_config.c
splinter_cli_cmd_export.c		splinter_cli_cmd_export.c
splinter_cli_cmd_get.c		splinter_cli_cmd_get.c
splinter_cli_cmd_head.c		splinter_cli_cmd_head.c
splinter_cli_cmd_help.c		splinter_cli_cmd_help.c
splinter_cli_cmd_hist.c		splinter_cli_cmd_hist.c
splinter_cli_cmd_ingest.c		splinter_cli_cmd_ingest.c
splinter_cli_cmd_init.c		splinter_cli_cmd_init.c
splinter_cli_cmd_label.c		splinter_cli_cmd_label.c
splinter_cli_cmd_list.c		splinter_cli_cmd_list.c
splinter_cli_cmd_lua.c		splinter_cli_cmd_lua.c
splinter_cli_cmd_math.c		splinter_cli_cmd_math.c
splinter_cli_cmd_orders.c		splinter_cli_cmd_orders.c
splinter_cli_cmd_search.c		splinter_cli_cmd_search.c
splinter_cli_cmd_set.c		splinter_cli_cmd_set.c
splinter_cli_cmd_shard.c		splinter_cli_cmd_shard.c
splinter_cli_cmd_type.c		splinter_cli_cmd_type.c
splinter_cli_cmd_unset.c		splinter_cli_cmd_unset.c
splinter_cli_cmd_use.c		splinter_cli_cmd_use.c
splinter_cli_cmd_uuid.c		splinter_cli_cmd_uuid.c
splinter_cli_cmd_wasm.c		splinter_cli_cmd_wasm.c
splinter_cli_cmd_watch.c		splinter_cli_cmd_watch.c
splinter_cli_input.c		splinter_cli_input.c
splinter_cli_main.c		splinter_cli_main.c
splinter_cli_tok.c		splinter_cli_tok.c
splinter_cli_util.c		splinter_cli_util.c
splinter_stress.c		splinter_stress.c
splinter_test.c		splinter_test.c
splinter_thesis.pdf		splinter_thesis.pdf
splinterctl_tests.sh		splinterctl_tests.sh
splinterrc_example		splinterrc_example
test.lua		test.lua
test.wasm		test.wasm

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Splinter: A Cooperative Userspace Hypervisor for Inference & Other Semantic Workloads

The Core Concept: Cooperative Virtualization

Architectural Features

The Splinter Toolchain

Building

Performance Under Fire

Governance and Open Source Commitment

About

Uh oh!

Releases 5

Sponsor this project

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Splinter: A Cooperative Userspace Hypervisor for Inference & Other Semantic Workloads

The Core Concept: Cooperative Virtualization

Architectural Features

The Splinter Toolchain

Building

Performance Under Fire

Governance and Open Source Commitment

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 5

Sponsor this project

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages