GitHub - rafacm/ragtime: RAGtime ("Retrieval in the Key of Jazz") is a Django application for ingesting jazz-related podcast episodes and powering Scott, a jazz-focused conversational agent named Scott.

RAGtime -- Retrieval Augmented Generation (RAG) in the Key of Jazz

_{Image generated with Nano Banana from the cover of E.L. Doctorow's novel "Ragtime"}

What is RAGtime?

RAGtime is a Django application for ingesting jazz-related podcast episodes. It extracts metadata, transcribes audio, identifies jazz entities, and powers Scott — a jazz-focused AI agent that answers questions strictly from ingested episode content, with references to specific episodes and timestamps.

Scott answering questions about Django Reinhardt

Features

🎙️ Episode Ingestion — Add podcast episodes by URL. RAGtime scrapes metadata (title, description, date, image), downloads audio, and processes it through the pipeline.
📝 Multilingual Transcription — Transcribes episodes using configurable backends (Whisper API by default) with segment and word-level timestamps. Supports multiple languages (English, Spanish, German, Swedish, etc.).
🔍 Entity Extraction — Identifies jazz entities: musicians, musical groups, albums, music venues, recording sessions, record labels, years. Entities are resolved against existing records using LLM-based matching.
📇 Episode Indexing — Splits transcripts into segments and generates multilingual embeddings stored in Qdrant. Enables cross-language semantic search so Scott can retrieve relevant content regardless of the question's language.
🎷 Scott — Your Jazz AI — A conversational agent that answers questions strictly from ingested episode content. Scott responds in the user's language and provides references to specific episodes and timestamps. Responses stream in real-time.
📊 AI Evaluation — Measures pipeline and Scott quality using RAGAS (faithfulness, answer relevancy, context precision/recall) with scores tracked in Langfuse.

Status

RAGtime is under active development.

What's already implemented

Episode ingestion: submit episodes by URL, metadata scraping, audio download, transcription, summarization, chunking, entity extraction and resolution with Wikidata integration, and multilingual embeddings into Qdrant.
Episode management UI: Django admin interface to view episode status and metadata and browse extracted entities.
Configuration wizard: interactive manage.py configure command for all RAGTIME_* env vars.
Telemetry: OpenTelemetry-based tracing for pipeline steps and LLM calls with optional collectors: console, Jaeger, and Langfuse.
Agent-based recovery: Pydantic AI agent with Playwright browser automation recovers from scraping and downloading failures automatically.
Scott chatbot: strict-RAG conversational agent that answers questions only from ingested episode content, with citations and real-time streaming via AG-UI. React frontend built with assistant-ui and conversation history persisted in Django.

See CHANGELOG.md for the full list of implemented features, fixes, implementation plans, feature documentation and session transcripts.

What's coming

AI evaluation: measure pipeline and Scott quality using RAGAS (faithfulness, answer relevancy, context precision/recall) with scores tracked in Langfuse. Enables regression testing across prompt and model changes.

Processing Pipeline

Each step updates the episode's status field. A post_save signal starts a DBOS durable workflow that sequences all steps with PostgreSQL-backed checkpointing — on crash or restart, the workflow resumes from the last completed step. Failures trigger the recovery layer.

#	Step	Status	Description
1	📥 Submit	`pending`	User submits an episode URL
2	🕷️ Scrape	`scraping`	Extract metadata and detect language
3	⬇️ Download	`downloading`	Download audio and extract duration
4	🎙️ Transcribe	`transcribing`	Whisper API transcription with timestamps
5	📋 Summarize	`summarizing`	LLM-generated episode summary
6	✂️ Chunk	`chunking`	Split transcript into ~150-word chunks
7	🔍 Extract	`extracting`	Named entity recognition per chunk
8	🧩 Resolve	`resolving`	Entity linking and deduplication via Wikidata
9	📐 Embed	`embedding`	Multilingual embeddings into Qdrant
10	✅ Ready	`ready`	Episode available for Scott to query

See the full pipeline documentation for per-step details, entity types, and the recovery layer.

Documentation

Detailed documentation lives in the doc/ directory:

Full pipeline documentation — per-step details, entity types, recovery layer
How Scott works — RAG architecture and query flow
Telemetry (OpenTelemetry) — tracing setup, collectors (console, Jaeger, Langfuse)
Architecture diagrams — processing pipeline diagram
Feature documentation — per-feature docs with problem, changes, and verification
Plans — implementation plans
Session transcripts — planning and implementation session logs

Getting Started

Prerequisites

Python 3.13+
uv
Node.js (for the frontend dev server and build)
Docker (for PostgreSQL and Qdrant)
ffmpeg (for audio downsampling)
wget (for audio downloading)

Installation

git clone <repo-url>
cd ragtime
uv sync                           # Install dependencies

Optional dependency group:

Extra	Install command	Description
`langfuse`	`uv sync --extra langfuse`	Langfuse collector for telemetry

Configuration

Launch the interactive setup wizard for all RAGTIME_* env vars:

uv run python manage.py configure

Alternatively, copy .env.sample to .env and fill in your values.

The service variables are read by docker-compose.yml when the containers start, so the values you set here flow straight through:

RAGTIME_DB_NAME, RAGTIME_DB_USER, RAGTIME_DB_PASSWORD, RAGTIME_DB_PORT → Postgres (defaults: ragtime / port 5432).
RAGTIME_QDRANT_PORT → Qdrant published HTTP port (default: 6333).

Defaults are used if the variables are unset, so a fresh clone runs with zero configuration.

Running the services

Start PostgreSQL and Qdrant, apply migrations, create an admin account, and start the application:

docker compose up -d                      # Start PostgreSQL and Qdrant (both read ports/creds from .env)
uv run python manage.py migrate
uv run python manage.py createsuperuser   # Create an admin user for the Django admin UI
uv run python manage.py load_entity_types # Seed initial entity types

Application server (ASGI)

uv run uvicorn ragtime.asgi:application --host 127.0.0.1 --port 8000

The application runs under ASGI via Uvicorn. This is required because Scott's chat endpoint (/chat/agent/) uses HTTP+SSE streaming through an ASGI sub-app mounted in ragtime/asgi.py. All other routes (admin, episodes, pages) are served by the same process through Django's standard ASGI handler.

Note: manage.py runserver still works for non-Scott development (admin, episodes, ingestion pipeline) but does not load the ASGI dispatcher, so the chat endpoint will not function.

Frontend dev server (Vite)

cd frontend && npm install   # First time only
cd frontend && npm run dev   # Vite dev server with HMR on port 5173

The Scott chat UI is a React application (assistant-ui + AG-UI) built with Vite. During development, Vite serves the frontend with hot module replacement. In production, run npm run build and the compiled assets are served by Django via django-vite.

The frontend communicates with the ASGI server over HTTP+SSE (AG-UI protocol), so both the Uvicorn server and the Vite dev server must be running to develop the chat UI.

Telemetry (optional)

RAGtime uses OpenTelemetry to trace pipeline steps and LLM calls. The quickest local setup is Jaeger:

docker run -d --name jaeger -p 4318:4318 -p 16686:16686 jaegertracing/all-in-one:latest

Then set RAGTIME_OTEL_COLLECTORS=jaeger in .env. Traces are viewable at http://localhost:16686. See Telemetry (OpenTelemetry) for all collector options (console, Jaeger, Langfuse).

Resetting the database

To drop all data and start fresh:

uv run python manage.py dbreset            # Drop PostgreSQL DB (incl. DBOS tables) + Qdrant collection
uv run python manage.py migrate            # Recreate tables
uv run python manage.py load_entity_types  # Seed entity types
uv run python manage.py createsuperuser    # Recreate the admin account (interactive)

Or non-interactively:

DJANGO_SUPERUSER_PASSWORD=admin uv run python manage.py createsuperuser --username admin --email admin@example.com --noinput

Tech Stack

Runtime: Python 3.13
Framework: Django 5.2
Database: PostgreSQL 17 (via Docker Compose)
Vector Store: Qdrant (via Docker Compose)
Durable Workflows: DBOS Transact (PostgreSQL-backed durable execution)
AI Agents: Pydantic AI (recovery agent)
Transcription: Configurable — Whisper API (default), local Whisper, etc.
LLM: Configurable — Claude (Anthropic), GPT (OpenAI), etc.
Embeddings: Configurable — must support multilingual models for cross-language retrieval
AI Evaluation: RAGAS + Langfuse
Frontend: React 19 + assistant-ui + Tailwind CSS 4 via Vite + django-vite (Scott chat UI); Django templates + HTMX (other pages)
Package Manager: uv

License

This project is licensed under the MIT License — see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What is RAGtime?

Features

Status

What's already implemented

What's coming

Processing Pipeline

Documentation

Getting Started

Prerequisites

Installation

Configuration

Running the services

Application server (ASGI)

Frontend dev server (Vite)

Telemetry (optional)

Resetting the database

Tech Stack

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 336 Commits
.github/workflows		.github/workflows
chat		chat
core		core
doc		doc
episodes		episodes
frontend		frontend
ragtime		ragtime
.env.sample		.env.sample
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
manage.py		manage.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

What is RAGtime?

Features

Status

What's already implemented

What's coming

Processing Pipeline

Documentation

Getting Started

Prerequisites

Installation

Configuration

Running the services

Application server (ASGI)

Frontend dev server (Vite)

Telemetry (optional)

Resetting the database

Tech Stack

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages