Prometheus metrics collector for FastAPI event loop lag and thread pool utilization
Project description
fastapi-runtime-details-collector
Prometheus metrics collector for FastAPI applications that exposes event loop health and thread pool utilization in real time.
Why
FastAPI runs on an asyncio event loop with a thread pool for synchronous route handlers. Neither is observable out of the box. This library adds two key signals:
- Event loop lag — how long the loop was blocked before it could process the next iteration. Sustained lag means CPU-bound or blocking work is starving your async handlers.
- Thread pool saturation — how many of anyio's sync-handler threads are currently active versus the total capacity.
Installation
pip install fastapi-runtime-details-collector
Requires Python 3.10+ and a FastAPI application already using prometheus_client to expose a /metrics endpoint.
Metrics
| Metric | Type | Description |
|---|---|---|
fastapi_eventloop_lag_seconds |
Gauge | Drift between a scheduled asyncio.sleep and its actual wake-up, in seconds |
fastapi_eventloop_tasks_total |
Gauge | Number of asyncio tasks currently pending in the event loop |
fastapi_threadpool_capacity_tokens |
Gauge | Total token capacity of anyio's default thread limiter (max concurrent sync handlers) |
fastapi_threadpool_active_threads |
Gauge | Number of sync handler threads currently executing (anyio borrowed tokens) |
fastapi_metrics_last_collection_timestamp_seconds |
Gauge | Unix timestamp of the last successful metrics collection |
Metrics are collected every second by a background task. No metrics are exposed until the first successful collection.
Setup
1. Expose a /metrics endpoint
If you don't already have one:
pip install prometheus-client
from prometheus_client import make_asgi_app
metrics_app = make_asgi_app()
app.mount("/metrics", metrics_app)
2. Register the collector in your lifespan
from contextlib import asynccontextmanager
from fastapi import FastAPI
from fastapi_runtime_details_collector import FastAPIRuntimeCollector, setup_lag_monitor
@asynccontextmanager
async def lifespan(app: FastAPI):
collector = FastAPIRuntimeCollector()
task = setup_lag_monitor(collector)
yield
task.cancel()
app = FastAPI(lifespan=lifespan)
setup_lag_monitor must be called from within a running anyio event loop, so the FastAPI lifespan startup is the right place.
Custom registry
If you use a custom Prometheus registry instead of the global default:
from prometheus_client import CollectorRegistry
from prometheus_client import make_asgi_app
from fastapi_runtime_details_collector import FastAPIRuntimeCollector, setup_lag_monitor
registry = CollectorRegistry()
@asynccontextmanager
async def lifespan(app: FastAPI):
collector = FastAPIRuntimeCollector(registry=registry)
task = setup_lag_monitor(collector)
yield
task.cancel()
metrics_app = make_asgi_app(registry=registry)
app.mount("/metrics", metrics_app)
Full example
from contextlib import asynccontextmanager
from fastapi import FastAPI
from prometheus_client import make_asgi_app
from fastapi_runtime_details_collector import FastAPIRuntimeCollector, setup_lag_monitor
@asynccontextmanager
async def lifespan(app: FastAPI):
collector = FastAPIRuntimeCollector()
task = setup_lag_monitor(collector)
yield
task.cancel()
app = FastAPI(lifespan=lifespan)
app.mount("/metrics", make_asgi_app())
@app.get("/")
async def root():
return {"status": "ok"}
Run with uvicorn main:app and scrape http://localhost:8000/metrics.
Grafana dashboard
A ready-to-import dashboard is available in grafana/fastapi-runtime-details.json.
Panels:
- Event loop lag, pending tasks, active threads, seconds since last collection (stat panels with color thresholds)
- Event loop lag over time (time series)
- Pending asyncio tasks over time (time series)
- Thread pool active vs capacity over time (time series)
- Thread pool saturation % (gauge)
Import steps:
- In Grafana, go to Dashboards → Import
- Click Upload dashboard JSON file and select
grafana/fastapi-runtime-details.json - Select your Prometheus data source
- Pick the
joblabel that matches your app's scrape config
The dashboard auto-refreshes every 10 seconds and exposes a $job variable to filter by service.
Alerting
Example Prometheus alert rules are provided in prometheus/alerts.yml.
| Alert | Severity | Condition | For |
|---|---|---|---|
FastAPIEventLoopLagHigh |
warning | lag > 50 ms | 1 min |
FastAPIEventLoopLagCritical |
critical | lag > 200 ms | 30 s |
FastAPIThreadPoolSaturationHigh |
warning | active / capacity > 80 % | 2 min |
FastAPIThreadPoolSaturationCritical |
critical | active / capacity > 95 % | 1 min |
Threshold rationale:
- 50 ms lag is the point where human-perceptible async latency degradation begins; 200 ms indicates serious blocking.
- 80 % thread pool utilisation leaves a safety margin for bursts; 95 % means the pool is effectively full and requests will queue for a thread.
Load the rules into Prometheus by adding the file path under rule_files in your prometheus.yml:
rule_files:
- prometheus/alerts.yml
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file fastapi_runtime_details_collector-1.0.1.tar.gz.
File metadata
- Download URL: fastapi_runtime_details_collector-1.0.1.tar.gz
- Upload date:
- Size: 9.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9226bbb976e91ee02f2570228b292349745af16e8ba948087077dea639535e36
|
|
| MD5 |
07fdd56139a810fb1050daec6735ca39
|
|
| BLAKE2b-256 |
821e26c85b242d9d7090dd05516459768d4aae5a75bc395e2e9f33b5572334f9
|
Provenance
The following attestation bundles were made for fastapi_runtime_details_collector-1.0.1.tar.gz:
Publisher:
publish-pypi.yml on SIB-rennes/fastapi_runtime_details_collector
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
fastapi_runtime_details_collector-1.0.1.tar.gz -
Subject digest:
9226bbb976e91ee02f2570228b292349745af16e8ba948087077dea639535e36 - Sigstore transparency entry: 1703944557
- Sigstore integration time:
-
Permalink:
SIB-rennes/fastapi_runtime_details_collector@8a5e97515cf90e74b8fb5c8581491c4fa3d2ca5e -
Branch / Tag:
refs/tags/v1.0.1 - Owner: https://github.com/SIB-rennes
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-pypi.yml@8a5e97515cf90e74b8fb5c8581491c4fa3d2ca5e -
Trigger Event:
push
-
Statement type:
File details
Details for the file fastapi_runtime_details_collector-1.0.1-py3-none-any.whl.
File metadata
- Download URL: fastapi_runtime_details_collector-1.0.1-py3-none-any.whl
- Upload date:
- Size: 5.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e0a5050230b847e702f325f5289f2883be42f6b49a111514c2fac8e2b14b78f0
|
|
| MD5 |
dd3590f0c5e62463a3ef9c936483db1a
|
|
| BLAKE2b-256 |
aad5574bb497e0784b17374fa835cd913495be95c7f57b53347c7ee989ad67ed
|
Provenance
The following attestation bundles were made for fastapi_runtime_details_collector-1.0.1-py3-none-any.whl:
Publisher:
publish-pypi.yml on SIB-rennes/fastapi_runtime_details_collector
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
fastapi_runtime_details_collector-1.0.1-py3-none-any.whl -
Subject digest:
e0a5050230b847e702f325f5289f2883be42f6b49a111514c2fac8e2b14b78f0 - Sigstore transparency entry: 1703944587
- Sigstore integration time:
-
Permalink:
SIB-rennes/fastapi_runtime_details_collector@8a5e97515cf90e74b8fb5c8581491c4fa3d2ca5e -
Branch / Tag:
refs/tags/v1.0.1 - Owner: https://github.com/SIB-rennes
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-pypi.yml@8a5e97515cf90e74b8fb5c8581491c4fa3d2ca5e -
Trigger Event:
push
-
Statement type: