Aspasia — Independent Neurosymbolic Statistical Auditor

Table of Contents

The Name
Why Does This Exist?
Why Octave? Why Not Julia?
Architecture
The Three Audit Phases
Governance Model
What Happens When They Disagree?
- Resolution Ladder (automated, 6 steps)
- Three-Body Governance
Requirements
Quick Start
License

The Name

Aspasia of Miletus (~470-400 BCE) taught Socrates the art of rhetoric and dialectic. Plato credits her in the Menexenus; Plutarch describes her as a thinker consulted on matters of state. She didn’t cross-examine for sport — she sharpened others' reasoning constructively.

This system is named for her because it corrects the corrector. StatistEase uses Socratic method (LLM routes questions, Julia answers). Aspasia audits that entire process — the teacher checking the student’s work. And like her namesake, Aspasia is not an annoying gadfly. She is a wise advisor who helps you feel more confident in your results, not less.

Why Does This Exist?

StatistEase computes statistics using Julia. An LLM routes questions to Julia functions and explains the results. No neural numbers — good.

But three problems remain:

Did Julia compute correctly? Software has bugs. Numerical libraries have edge cases. A single implementation is a single point of failure.
Was this the right test? The LLM chose a t-test. Should it have been Mann-Whitney? The LLM doesn’t know — it pattern-matches. It might be right. It might be confidently wrong.
Is the explanation accurate? The LLM says "large effect size" but Cohen’s d is 0.35. That’s small-to-medium. The number was correct; the interpretation was not.

Aspasia solves all three by providing an independent neurosymbolic audit from a completely separate codebase, in a completely different language, using a completely different reasoning engine.

Why Octave? Why Not Julia?

This is the most important design decision in the project, and it is not overcomplicated — it is the minimum necessary for genuine independence:

Property	StatistEase (Julia)	Aspasia (GNU Octave)
Numerical backend	OpenBLAS	LAPACK/BLAS (system)
Sorting algorithm	Julia’s QuickSort	Octave’s std::sort
Statistical library	StatsBase.jl	Octave statistics package
Floating-point path	Julia’s LLVM codegen	GCC/gfortran codegen
Reasoning engine	LLM tool calling	Prolog + DeepProbLog
Developer community	Scientific computing	Engineering + applied maths

If both systems produce the same answer via different code paths, that answer is far more trustworthy than either system alone. If they disagree, that disagreement is valuable information — it reveals either a bug or a genuine numerical sensitivity.

Using the same language would mean the same library, the same bugs, the same blind spots. That is not independence. That is redundancy.

This also attracts different developers. Julia people and Octave/MATLAB people come from different disciplines, think differently about numerical problems, and catch different classes of bugs. Two independent communities competing toward the same goal — correctness — makes both systems better.

Architecture

StatistEase                          Aspasia
┌────────────────────┐              ┌────────────────────┐
│  Julia computation │              │  Octave recompute  │
│  (the numbers)     │──── JSON ───►│  (same data, diff  │
│                    │   transaction│   code path)       │
└────────┬───────────┘              └────────┬───────────┘
         │                                   │
         │                          ┌────────▼───────────┐
         │                          │  Prolog ontology   │
         │                          │  (was this the     │
         │                          │   RIGHT test?)     │
         │                          └────────┬───────────┘
         │                                   │
         │                          ┌────────▼───────────┐
         │                          │  Interpretation    │
         │                          │  audit (does the   │
         │                          │  explanation match  │
         │                          │  the numbers?)     │
         │                          └────────┬───────────┘
         │                                   │
         ▼                                   ▼
┌────────────────────────────────────────────────────────┐
│                    USER SEES BOTH                       │
│  Result: t(38) = 2.847, p = .007, d = 0.90            │
│  Audit:  VERIFIED — computation, test selection, and   │
│          interpretation all check out.                  │
└────────────────────────────────────────────────────────┘

The Three Audit Phases

Phase 1: Numerical Verification (Octave)

"Did the computation produce the correct numbers?"

Aspasia independently recomputes every statistical result using GNU Octave. Different language, different BLAS, different floating-point code paths. If the results match within tolerance (1e-10), the numbers are verified.

Phase 2: Ontological Reasoning (Prolog/DeepProbLog)

"Was this the right test to run?"

A Prolog knowledge base encodes Stevens' measurement scales, test prerequisites, assumption requirements, and nonparametric alternatives. DeepProbLog extends this with probabilistic confidence.

This is genuinely neurosymbolic — the probabilities can be learned from data (neural) while the logical structure is fixed (symbolic).

Phase 3: Interpretation Audit (Prolog + Octave)

"Does the LLM’s explanation accurately represent the result?"

Cross-references effect size labels against Cohen’s conventions, checks for p-value misinterpretation (ASA 2016 statement), detects significance inflation with large N, and flags missing assumption discussions.

Governance Model

Aspasia operates as an AUDITOR, not a gatekeeper:

It NEVER modifies StatistEase output
It NEVER prevents computation
It ALWAYS explains WHY it raises a concern
It tracks its own accuracy (precision and recall of challenges)
It learns from user feedback (Logtalk knowledge base)

What Happens When They Disagree?

A disagreement between two independent systems is not a failure — it is information. The magnitude, location, and nature of the disagreement tells you something about your data that neither system alone could reveal.

Resolution Ladder (automated, 6 steps)

Before asking a human, Aspasia runs a systematic resolution protocol:

Step	Method	Confidence
1	NIST StRD reference values — certified answers to 15+ digits (McCullough & Wilson 1999)	Definitive
2	Arbitrary precision recomputation — Neumaier compensated summation at extended precision	High
3	Interval arithmetic — guaranteed enclosures; if both values fall inside, they’re compatible	High
4	Perturbation analysis — jitter inputs by 1 ULP; if output swings wildly, the problem is ill-conditioned and neither answer is reliable	Diagnostic
5	Symbolic verification — compute exact answer via sorted summation or CAS (Maxima)	Definitive
6	Escalate to human — with FULL evidence from steps 1-5 and both systems' working	Last resort

Most disagreements resolve at steps 1-3. Step 4 is particularly valuable: when it triggers, it means the data itself doesn’t support the precision being claimed — that’s a finding about the research, not a bug in the software.

Three-Body Governance

When the resolution ladder exhausts automated methods:

StatistEase computes (Julia)
Aspasia audits (Octave + Prolog)
echidna arbitrates (formal proofs via GraphQL)

If echidna cannot resolve the dispute (because it’s a judgment call, not a mathematical fact), the system escalates to the human with full evidence from all three systems. It says:

"We tried our best but we are coming up against conflicts. Here is everything we checked, everything we found, and what each system thinks. You need to decide."

This is honest. It is more useful than silently picking one answer.

Requirements

GNU Octave 8+ with the statistics package
SWI-Prolog 9+ (for ontological reasoning)
Logtalk 3+ (for knowledge base management)
Optional: DeepProbLog (for probabilistic logic)

Quick Start

octave --eval "pkg install -forge statistics"
octave --path src/verification:src/audit:src/interface \
       --eval "audit_from_json('/path/to/transaction.json')"

License

PMPL-1.0-or-later (Palimpsest License)

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
.devcontainer		.devcontainer
.github		.github
.hypatia		.hypatia
.machine_readable		.machine_readable
.reuse		.reuse
.well-known		.well-known
LICENSES		LICENSES
benches		benches
contractiles		contractiles
docs		docs
examples		examples
ffi/zig		ffi/zig
generated/abi		generated/abi
src		src
tests		tests
.clinerules		.clinerules
.cursorrules		.cursorrules
.editorconfig		.editorconfig
.envrc		.envrc
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
.guix-channel		.guix-channel
.mailmap		.mailmap
.nojekyll		.nojekyll
.tool-versions		.tool-versions
.windsurfrules		.windsurfrules
0-AI-MANIFEST.a2ml		0-AI-MANIFEST.a2ml
ABI-FFI-README.md		ABI-FFI-README.md
CHANGELOG.md		CHANGELOG.md
CODEOWNERS		CODEOWNERS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Containerfile		Containerfile
EXHIBIT-A-ETHICAL-USE.txt		EXHIBIT-A-ETHICAL-USE.txt
EXHIBIT-B-QUANTUM-SAFE.txt		EXHIBIT-B-QUANTUM-SAFE.txt
EXPLAINME.adoc		EXPLAINME.adoc
GOVERNANCE.md		GOVERNANCE.md
Justfile		Justfile
LICENSE		LICENSE
MAINTAINERS.adoc		MAINTAINERS.adoc
MAINTAINERS.md		MAINTAINERS.md
NOTICE		NOTICE
PLACEHOLDERS.md		PLACEHOLDERS.md
README.adoc		README.adoc
ROADMAP.adoc		ROADMAP.adoc
RSR_OUTLINE.adoc		RSR_OUTLINE.adoc
SECURITY.md		SECURITY.md
TOPOLOGY.md		TOPOLOGY.md
cliff.toml		cliff.toml
contractile.just		contractile.just
deny.toml		deny.toml
flake.nix		flake.nix
guix.scm		guix.scm
selur-compose.toml		selur-compose.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Aspasia — Independent Neurosymbolic Statistical Auditor

The Name

Why Does This Exist?

Why Octave? Why Not Julia?

Architecture

The Three Audit Phases

Phase 1: Numerical Verification (Octave)

Phase 2: Ontological Reasoning (Prolog/DeepProbLog)

Phase 3: Interpretation Audit (Prolog + Octave)

Governance Model

What Happens When They Disagree?

Resolution Ladder (automated, 6 steps)

Three-Body Governance

Requirements

Quick Start

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Aspasia — Independent Neurosymbolic Statistical Auditor

The Name

Why Does This Exist?

Why Octave? Why Not Julia?

Architecture

The Three Audit Phases

Phase 1: Numerical Verification (Octave)

Phase 2: Ontological Reasoning (Prolog/DeepProbLog)

Phase 3: Interpretation Audit (Prolog + Octave)

Governance Model

What Happens When They Disagree?

Resolution Ladder (automated, 6 steps)

Three-Body Governance

Requirements

Quick Start

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages