Input all categorical data via YAML files by ScriptRaccoon · Pull Request #164 · ScriptRaccoon/CatDat

ScriptRaccoon · 2026-05-07T15:39:33Z

All categorical data is now authored in YAML files. Contributors no longer need to touch SQL seed files anymore. YAML is more readable, structured, and independent of the database layer.

To support this change, existing seed data has been converted (once) into YAML. From now on, YAML is the single source of truth for seeded data. The SQL seed files have been deleted, but the SQLite database is kept as before. It is still used for storage, querying, and for running deduction scripts. The schema is still defined in SQL files.

There is a major improvement: each category now has its own YAML file. These files no longer contain only the basic metadata, but also tags, related categories, satisfied and unsatisfied properties of the category, and more (see the example below). This information is no longer distributed across many SQL files, making it much easier to add new categories and maintain existing ones.

The same changes have been made for functors. In addition, each property (of a category or a functor) now has its own YAML file. The implications are still grouped by topic, but the topics have been refined, so each file is now more focused.

Previously, SQL seed files were doing two things at once: defining the data and also implementing the logic to insert and normalize it into multiple tables. This PR separates those concerns. The insertion logic is now handled explicitly in the new seed script, while YAML only defines the data itself. This does introduce an extra layer (YAML parsing + import step), but this layer already existed implicitly before in the SQL seed files.

A challenge was finding a good and consistent way to format long strings used as YAML values. This will likely need further improvement in the future. It was also tested whether Markdown could be used inside fields, but this caused more problems than it solved.

This PR resolves #163 (but with YAML instead of markdown).

Example

Here is the current version Ab.yaml. All the information about this category is in one file.

id: Ab
name: category of abelian groups
notation: $\Ab$
objects: abelian groups
morphisms: group homomorphisms
description: This is the prototype of an abelian category.
nlab_link: https://ncatlab.org/nlab/show/Ab

tags:
  - algebra

related_categories:
  - Ab_fg
  - CMon
  - FinAb
  - FreeAb
  - Grp
  - R-Mod
  - TorsAb
  - TorsFreeAb

satisfied_properties:
  - property_id: locally small
    reason: There is a forgetful functor $\Ab \to \Set$ and $\Set$ is locally small.

  - property_id: abelian
    reason: This is standard, see <a href="https://ncatlab.org/nlab/show/Categories+for+the+Working+Mathematician" target="_blank">Mac Lane</a>, Ch. VIII.

  - property_id: finitary algebraic
    reason: Take the algebraic theory of a commutative group.

unsatisfied_properties:
  - property_id: split abelian
    reason: The short exact sequence $0 \xrightarrow{} \IZ \xrightarrow{p} \IZ \xrightarrow{} \IZ/p \xrightarrow{} 0$ does not split.

  - property_id: skeletal
    reason: This is trivial.

  - property_id: CSP
    reason: The canonical homomorphism $\bigoplus_{n \geq 0} \IZ \to \prod_{n \geq 0} \IZ$ is not surjective, hence no epimorphism.

special_objects:
  initial object:
    description: trivial group
  terminal object:
    description: trivial group
  coproducts:
    description: direct sums
  products:
    description: direct products with pointwise operations

special_morphisms:
  isomorphisms:
    description: bijective homomorphisms
    reason: This characterization holds in every algebraic category.
  monomorphisms:
    description: injective homomorphisms
    reason: 'This holds in every finitary algebraic category: the forgetful functor to $\Set$ is faithful, hence reflects monomorphisms, and it is continuous (even representable), hence preserves monomorphisms.'
  epimorphisms:
    description: surjective homomorphisms
    reason: 'For the non-trivial direction, if $f : A \to B$ is an epimorphism, then $p \circ f = 0$ for the projection  $p : B \to B/f(A)$ implies that $p = 0$, so that $B = f(A)$.'
  regular monomorphisms:
    description: same as monomorphisms
    reason: This is because the category is mono-regular.
  regular epimorphisms:
    description: same as epimorphisms
    reason: This is because the category is epi-regular.

as for error logging, unfortunately, due to deferred FKs, it is currently not possible to log the exact file where the FK violation happened

fix typo

…allowed

ScriptRaccoon mentioned this pull request May 7, 2026

Use IDs to order tags, special objects, and special morphisms #165

Merged

ScriptRaccoon force-pushed the yaml-transformation branch from 02a836c to 0615d32 Compare May 7, 2026 16:09

ScriptRaccoon changed the title ~~Yaml transformation (WIP)~~ Input all categorical data via YAML files (WIP) May 7, 2026

ScriptRaccoon changed the title ~~Input all categorical data via YAML files (WIP)~~ Input all categorical data via YAML files May 8, 2026

ScriptRaccoon marked this pull request as ready for review May 8, 2026 18:03

ScriptRaccoon added infra feature labels May 8, 2026

ScriptRaccoon force-pushed the yaml-transformation branch 2 times, most recently from 77744ba to 07ca96d Compare May 9, 2026 17:06

ScriptRaccoon mentioned this pull request May 9, 2026

Add category of compact Hausdorff spaces #160

Open

ScriptRaccoon force-pushed the yaml-transformation branch 4 times, most recently from 1ff7bdc to ddb213b Compare May 10, 2026 12:42

ScriptRaccoon added 16 commits May 11, 2026 17:22

generate yaml files for all categories

055bd7a

generate yaml files for all properties of categories

107e3d8

generate yaml files for all lemmas

2f46b42

generate yaml files for all implications by topic

32ef843

generate yaml files for functors, their properties and implications

3cf8b91

save comments from deprecated sql files

79cd25c

seed all data from yaml files

89c6fb3

as for error logging, unfortunately, due to deferred FKs, it is currently not possible to log the exact file where the FK violation happened

delete old SQL files and old seed script

1969077

bring back schema diff check

0e01cf7

refactor seed script for yaml files

5eb3ead

format yaml files: replace " with ', unescape backslashes

c165fc7

remove |- from yaml seed files, put string values into one line for now

92ce641

italics for functor property defs

3fe856c

replace <br> in yaml files with proper line breaks + render them as <br>

f97eca7

improve list formatting in yaml files

952bfd1

Ab_fg is related to Ab

3c5d816

ScriptRaccoon added 6 commits May 11, 2026 17:30

mention YAML files in documentation

27bc039

use separate config file for cspell, fix typos

7d73a21

fix typo

define macros in yaml file

1a1b92b

put $$ formulas in separate lines in all yaml files

1cc5ac1

rename file

6a2c170

always pass nlab_link and description in category yaml file, null is …

89e925d

…allowed

ScriptRaccoon force-pushed the yaml-transformation branch from cda6e49 to 89e925d Compare May 11, 2026 15:40

ScriptRaccoon merged commit fa0c3c4 into main May 11, 2026
1 check passed

ScriptRaccoon deleted the yaml-transformation branch May 11, 2026 15:48

ScriptRaccoon mentioned this pull request May 12, 2026

Add the category of countable groups #171

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Input all categorical data via YAML files#164

Input all categorical data via YAML files#164
ScriptRaccoon merged 22 commits into
mainfrom
yaml-transformation

ScriptRaccoon commented May 7, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ScriptRaccoon commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Example

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ScriptRaccoon commented May 7, 2026 •

edited

Loading