Skip to main content

Prometheus metrics collector for FastAPI event loop lag and thread pool utilization

Project description

fastapi-runtime-details-collector

Prometheus metrics collector for FastAPI applications that exposes event loop health and thread pool utilization in real time.

Why

FastAPI runs on an asyncio event loop with a thread pool for synchronous route handlers. Neither is observable out of the box. This library adds two key signals:

  • Event loop lag — how long the loop was blocked before it could process the next iteration. Sustained lag means CPU-bound or blocking work is starving your async handlers.
  • Thread pool saturation — how many of anyio's sync-handler threads are currently active versus the total capacity.

Installation

pip install fastapi-runtime-details-collector

Requires Python 3.10+ and a FastAPI application already using prometheus_client to expose a /metrics endpoint.

Metrics

Metric Type Description
fastapi_eventloop_lag_seconds Gauge Drift between a scheduled asyncio.sleep and its actual wake-up, in seconds
fastapi_eventloop_tasks_total Gauge Number of asyncio tasks currently pending in the event loop
fastapi_threadpool_capacity_tokens Gauge Total token capacity of anyio's default thread limiter (max concurrent sync handlers)
fastapi_threadpool_active_threads Gauge Number of sync handler threads currently executing (anyio borrowed tokens)
fastapi_metrics_last_collection_timestamp_seconds Gauge Unix timestamp of the last successful metrics collection

Metrics are collected every second by a background task. No metrics are exposed until the first successful collection.

Setup

1. Expose a /metrics endpoint

If you don't already have one:

pip install prometheus-client
from prometheus_client import make_asgi_app

metrics_app = make_asgi_app()
app.mount("/metrics", metrics_app)

2. Register the collector in your lifespan

from contextlib import asynccontextmanager
from fastapi import FastAPI
from fastapi_runtime_details_collector import FastAPIRuntimeCollector, setup_lag_monitor

@asynccontextmanager
async def lifespan(app: FastAPI):
    collector = FastAPIRuntimeCollector()
    task = setup_lag_monitor(collector)
    yield
    task.cancel()

app = FastAPI(lifespan=lifespan)

setup_lag_monitor must be called from within a running anyio event loop, so the FastAPI lifespan startup is the right place.

Custom registry

If you use a custom Prometheus registry instead of the global default:

from prometheus_client import CollectorRegistry
from prometheus_client import make_asgi_app
from fastapi_runtime_details_collector import FastAPIRuntimeCollector, setup_lag_monitor

registry = CollectorRegistry()

@asynccontextmanager
async def lifespan(app: FastAPI):
    collector = FastAPIRuntimeCollector(registry=registry)
    task = setup_lag_monitor(collector)
    yield
    task.cancel()

metrics_app = make_asgi_app(registry=registry)
app.mount("/metrics", metrics_app)

Full example

from contextlib import asynccontextmanager
from fastapi import FastAPI
from prometheus_client import make_asgi_app
from fastapi_runtime_details_collector import FastAPIRuntimeCollector, setup_lag_monitor


@asynccontextmanager
async def lifespan(app: FastAPI):
    collector = FastAPIRuntimeCollector()
    task = setup_lag_monitor(collector)
    yield
    task.cancel()


app = FastAPI(lifespan=lifespan)
app.mount("/metrics", make_asgi_app())


@app.get("/")
async def root():
    return {"status": "ok"}

Run with uvicorn main:app and scrape http://localhost:8000/metrics.

Grafana dashboard

A ready-to-import dashboard is available in grafana/fastapi-runtime-details.json.

Panels:

  • Event loop lag, pending tasks, active threads, seconds since last collection (stat panels with color thresholds)
  • Event loop lag over time (time series)
  • Pending asyncio tasks over time (time series)
  • Thread pool active vs capacity over time (time series)
  • Thread pool saturation % (gauge)

Import steps:

  1. In Grafana, go to Dashboards → Import
  2. Click Upload dashboard JSON file and select grafana/fastapi-runtime-details.json
  3. Select your Prometheus data source
  4. Pick the job label that matches your app's scrape config

The dashboard auto-refreshes every 10 seconds and exposes a $job variable to filter by service.

Alerting

Example Prometheus alert rules are provided in prometheus/alerts.yml.

Alert Severity Condition For
FastAPIEventLoopLagHigh warning lag > 50 ms 1 min
FastAPIEventLoopLagCritical critical lag > 200 ms 30 s
FastAPIThreadPoolSaturationHigh warning active / capacity > 80 % 2 min
FastAPIThreadPoolSaturationCritical critical active / capacity > 95 % 1 min

Threshold rationale:

  • 50 ms lag is the point where human-perceptible async latency degradation begins; 200 ms indicates serious blocking.
  • 80 % thread pool utilisation leaves a safety margin for bursts; 95 % means the pool is effectively full and requests will queue for a thread.

Load the rules into Prometheus by adding the file path under rule_files in your prometheus.yml:

rule_files:
  - prometheus/alerts.yml

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fastapi_runtime_details_collector-1.0.1.tar.gz (9.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file fastapi_runtime_details_collector-1.0.1.tar.gz.

File metadata

File hashes

Hashes for fastapi_runtime_details_collector-1.0.1.tar.gz
Algorithm Hash digest
SHA256 9226bbb976e91ee02f2570228b292349745af16e8ba948087077dea639535e36
MD5 07fdd56139a810fb1050daec6735ca39
BLAKE2b-256 821e26c85b242d9d7090dd05516459768d4aae5a75bc395e2e9f33b5572334f9

See more details on using hashes here.

Provenance

The following attestation bundles were made for fastapi_runtime_details_collector-1.0.1.tar.gz:

Publisher: publish-pypi.yml on SIB-rennes/fastapi_runtime_details_collector

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file fastapi_runtime_details_collector-1.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for fastapi_runtime_details_collector-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 e0a5050230b847e702f325f5289f2883be42f6b49a111514c2fac8e2b14b78f0
MD5 dd3590f0c5e62463a3ef9c936483db1a
BLAKE2b-256 aad5574bb497e0784b17374fa835cd913495be95c7f57b53347c7ee989ad67ed

See more details on using hashes here.

Provenance

The following attestation bundles were made for fastapi_runtime_details_collector-1.0.1-py3-none-any.whl:

Publisher: publish-pypi.yml on SIB-rennes/fastapi_runtime_details_collector

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page