Factor Telemetry

Welcome to Factor Telemetry, a comprehensive repository of ready-to-use telemetry integrations and dashboard templates for monitoring Apache Kafka and Apache Flink environments. These configurations are designed to visualize the rich, Prometheus-compatible metrics emitted by Factor House products, including Kpow and Flex.

While observability is often associated with Grafana, this project is built to be platform-agnostic. The metrics exposed by Kpow and Flex can be seamlessly integrated into a wide variety of modern monitoring and alerting platforms, including Grafana, Datadog, New Relic, and more.

🗂️ Organization

Inside this repository, you will find platform-specific configuration files and templates. Currently, our Grafana JSON models are located within the grafana-templates directory. To keep things cleanly organized, these dashboards are further divided into dedicated subfolders based on the target product:

The kpow folder contains all of our Kafka-focused dashboards (covering environments, topics, consumer groups, and Kafka Connect).
The flex folder contains dashboards dedicated to Flink cluster monitoring.

As we expand support for other platforms like Datadog, additional directories will be added to help you quickly deploy world-class observability into your tool of choice.

📊 Dashboards Overview

Quality Gap of Raw JMX Data vs. High-Fidelity Metrics

The standard approach of routing raw Kafka JMX metrics into Prometheus often leaves teams with noisy dashboards and fragile alerts. Attempting to compute meaningful business metrics, like exact consumer lag or active throughput, from raw JMX offsets using PromQL is notoriously difficult.

These templates take a different approach. They are built on top of Kpow, which acts as a high-fidelity metrics engine. Instead of relying on JMX sidecars, Kpow directly observes your cluster and exposes pre-calculated, actionable metrics (such as exact group_offset_lag and topic_end_delta) ready for immediate visualization.

📖 Read the architectural deep-dive: Beyond JMX: Supercharging Grafana Dashboards with High-Fidelity Metrics

Dashboard Templates

1. Kafka Environment Health

Designed for Platform Teams, this dashboard provides a high-level macro view of overall cluster stability and capacity.

Rather than relying on raw byte counts, it surfaces derived operational health indicators. It tracks total online brokers, overall data on disk, total topics, and total consumer groups. It also visualizes cluster-wide production and consumption rates, and provides a detailed breakdown of topic activity and consumer group health (Stable, Rebalancing, Empty) to give you an instant read on the environment's status.

🔗 Quick Links:

📥 JSON Template: kafka-environment.json
🌐 Grafana Gallery: View and Import Dashboard (ID: 25103)

2. Kafka Topic Diagnostics

Designed for data engineers and platform administrators, this dashboard provides granular visibility into the data layer.

It tracks aggregate metrics like total topics, total replica disk usage, cluster-wide read/write throughput, and non-preferred leaders. Most importantly, it visualizes per-topic production and consumption rates over time, topic size growth, and isolates the exact topics experiencing consumer lag or Under Replicated Partitions (URPs) through detailed diagnostic tables.

🔗 Quick Links:

📥 JSON Template: kafka-topic.json
🌐 Grafana Gallery: View and Import Dashboard (ID: 25104)

3. Kafka Consumer Group Deep Dive

Designed for Application Teams, this dashboard focuses on micro-level Service Level Agreement (SLA) monitoring.

Instead of generic host metrics, it visualizes the exact state of your data consumption. Key metrics include precise total lag (group_offset_lag) and real-time consumption rates (group_offset_delta). It details total assigned members and hosts, and features a clear status table tracking the exact state of every consumer group to help engineers spot stalling applications before downstream users are impacted.

🔗 Quick Links:

📥 JSON Template: kafka-consumer-group.json
🌐 Grafana Gallery: View and Import Dashboard (ID: 25105)

4. Kafka Connect Operations

Data pipeline reliability depends heavily on integration health. This dashboard targets Kafka Connect deployments, replacing tedious API queries with instant visual feedback.

It tracks aggregate summary statistics alongside individual Connector and Task states. By mapping state labels directly to distinct visual alerts (RUNNING, PAUSED, FAILED, UNASSIGNED, UNREACHABLE), teams can immediately detect stalled integrations and isolate whether the failure exists at the connector or task level.

🔗 Quick Links:

📥 JSON Template: kafka-connect.json
🌐 Grafana Gallery: View and Import Dashboard (ID: 25106)

🚀 Getting Started with Grafana Cloud

These instructions illustrate how to wire up Grafana Cloud's agentless Metrics Endpoint integration to scrape Kpow directly, without needing to manage a local Prometheus instance.

📋 Prerequisites: Enable Kpow Telemetry

Before configuring Grafana Cloud, ensure that your Kpow instance is configured to expose its Prometheus metrics and that the endpoints are secured with Basic Authentication (which is strictly required by Grafana Cloud's agentless scraper).

🔗 Read the official guide: Enabling Kpow's Prometheus Integration

Step 1: Configure Metrics Endpoints (Scrape Jobs)

Grafana Cloud can scrape Kpow directly over the internet. You will need to create a scrape job for each of Kpow's metric endpoints.

Log in to your Grafana Cloud portal.
Navigate to Connections > Add new connection.
Search for and select Metrics Endpoint.
Click Add new scrape job and create three separate jobs using the following URLs:
- https://<your-kpow-domain>/metrics/v1 (replace with your Kpow domain)
- https://<your-kpow-domain>/offsets/v1
- https://<your-kpow-domain>/group-offsets/v1
❗ Authentication: The endpoints should be secured, which is strictly required by the Metrics Endpoint integration. You can select either Basic or Bearer (OAuth) authentication.
Click Test Connection and Save Scrape Job for each job. Grafana will immediately start polling these endpoints and storing the data in your built-in Prometheus database.

Step 2: Check Metrics are Flowing

Before importing the dashboards, verify that Grafana Cloud is successfully receiving the data:

In Grafana, go to the left-hand menu and click Explore (the compass icon).
Ensure your default Prometheus data source is selected in the top-left dropdown (usually named grafanacloud-<your-stack>-prom).
In the query bar, type a metric like topic_count or broker_count and run the query.
If you see a graph or data table populate, your connection is working perfectly!

Step 3: Create Dashboards from Templates

With the data flowing, you can now import the JSON templates provided in this repository.

Download the .json files from the grafana-templates/kpow directory in this repo.
In Grafana, navigate to Dashboards > New > Import.
Upload the .json file (or paste the raw JSON text into the provided box) and click Load.
At the bottom of the import options screen, you will be prompted to select a Prometheus data source. Select your Grafana Cloud Prometheus data source from the dropdown.
Click Import.

Your dashboard will instantly load and populate with live metrics! Repeat this process for the remaining dashboards.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
grafana-templates		grafana-templates
images		images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Factor Telemetry

🗂️ Organization

📊 Dashboards Overview

Dashboard Templates

1. Kafka Environment Health

2. Kafka Topic Diagnostics

3. Kafka Consumer Group Deep Dive

4. Kafka Connect Operations

🚀 Getting Started with Grafana Cloud

📋 Prerequisites: Enable Kpow Telemetry

Step 1: Configure Metrics Endpoints (Scrape Jobs)

Step 2: Check Metrics are Flowing

Step 3: Create Dashboards from Templates

About

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Factor Telemetry

🗂️ Organization

📊 Dashboards Overview

Dashboard Templates

1. Kafka Environment Health

2. Kafka Topic Diagnostics

3. Kafka Consumer Group Deep Dive

4. Kafka Connect Operations

🚀 Getting Started with Grafana Cloud

📋 Prerequisites: Enable Kpow Telemetry

Step 1: Configure Metrics Endpoints (Scrape Jobs)

Step 2: Check Metrics are Flowing

Step 3: Create Dashboards from Templates

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!