Goal
Add a tiny downstream analysis recipe so people can actually use the dataset without writing boilerplate.
Input
Use one of:
Suggested output
A short example under examples/ that shows how to:
- load the compressed JSONL
- count top styles/subjects/model families
- sample records by label
- print prompt + source URL for inspection
DuckDB, Polars, or plain Python are all fine.
Guardrails
- No paid APIs.
- No secrets.
- Include the media-rights caveat if touching image URLs.
Goal
Add a tiny downstream analysis recipe so people can actually use the dataset without writing boilerplate.
Input
Use one of:
Suggested output
A short example under
examples/that shows how to:DuckDB, Polars, or plain Python are all fine.
Guardrails