Ingestion gateway: WebSocket + presigned blob URLs, publishes to Kafka
Project description
Matyan Frontier
Ingestion gateway between training clients and the rest of the Matyan stack. Clients connect via WebSocket for metrics, params, and logs, and via REST for presigned URLs (S3, GCS, or Azure Private SAS); the frontier publishes to Kafka. The UI and backend do not talk to the frontier. Part of the Matyan experiment-tracking stack (fork of Aim).
Layout
src/matyan_frontier/— Python package: FastAPI app, WebSocket handler, REST presign endpoint, Kafka producer, config, health, metrics.- Entrypoints:
app.py(lifespan: start Kafka producer, create blob storage clients, ensure bucket; shutdown: flush producer, close clients). - Routes:
GET /api/v1/ws/runs/{run_id}(WebSocket),POST /api/v1/rest/artifacts/presign,GET /health/ready/,GET /health/live/,GET /metrics/(Prometheus).
Prerequisites
- Python 3.12+. The package uses
uvin the repo:uv run matyan-frontieror install thenmatyan-frontierCLI. - Runtime dependencies: Kafka (bootstrap reachable) and an blob store (e.g. RustFS in dev, AWS S3/GCS/Azure Blob Store in prod). The smoke test and local dev assume Kafka and S3 are up (e.g. via docker-compose).
Run (production-like)
From the frontier package directory: uv run matyan-frontier start (or matyan-frontier start if installed).
Options: --host, --port (defaults: 0.0.0.0, 53801). The CLI uses these option defaults; config also defines host/port for other entry points.
Configuration (environment variables)
| Variable | Default | Purpose |
|---|---|---|
MATYAN_ENVIRONMENT / ENVIRONMENT |
development |
When production, S3/Kafka must be non-dev (validated at startup). |
LOG_LEVEL |
INFO |
Log level (loguru + uvicorn). |
HOST |
0.0.0.0 |
Bind address. |
PORT |
53801 |
Bind port. |
KAFKA_BOOTSTRAP_SERVERS |
localhost:9092 |
Kafka broker list. |
KAFKA_DATA_INGESTION_TOPIC |
data-ingestion |
Topic for ingestion messages. |
KAFKA_SECURITY_PROTOCOL / KAFKA_SASL_* |
(empty) | Optional Kafka SASL. |
BLOB_BACKEND_TYPE |
s3 |
Storage backend: s3, gcs, or azure. |
S3_ENDPOINT |
http://localhost:9000 |
S3 API endpoint (e.g. RustFS). |
S3_PUBLIC_ENDPOINT |
"" |
Optional; used for presigned URLs if different from S3_ENDPOINT. |
S3_ACCESS_KEY / S3_SECRET_KEY |
(dev defaults) | S3 credentials. |
S3_BUCKET |
matyan-artifacts |
Bucket for artifacts (S3). |
S3_REGION |
us-east-1 |
S3 region (default: us-east-1). |
GCS_BUCKET |
matyan-artifacts |
Bucket for artifacts (GCS). |
AZURE_CONTAINER |
matyan-artifacts |
Container for artifacts (Azure). |
AZURE_CONN_STR |
"" |
Azure connection string. |
AZURE_ACCOUNT_URL |
"" |
Azure account URL (for SAS user delegation key). |
S3_PRESIGN_EXPIRY |
3600 |
URL/SAS expiry (seconds). |
SHUTDOWN_FLUSH_TIMEOUT |
5.0 |
Seconds to wait for Kafka flush on shutdown. |
METRICS_ENABLED |
true |
Expose Prometheus /metrics/. |
CORS_ORIGINS |
(localhost list) | Allowed origins (comma-separated or repeated). |
Source of truth: config.py.
Development and smoke test
- Development: Run Kafka and S3 (e.g.
docker compose up kafka kafka-init rustfsor equivalent), thenuv run matyan-frontier start. Clients (e.g. matyan-client) point at the frontier URL for WebSocket and presign. - Smoke test: From
extra/matyan-frontier, runuv run python scripts/smoke_test.py. Prerequisites: Kafka, S3, and frontier running. The script covers WebSocket (create_run, log_metric, log_hparams, finish_run, etc.) and REST presign, and optionally verifies Kafka consumption.
See scripts/smoke_test.py for the exact command and what it checks.
Deployment
- Docker: Use Dockerfile.dev or Dockerfile.prod (context from repo root as needed; align with how the repo builds frontier images).
The UI talks only to the backend, not to the frontier; the frontier is for training clients.
Related
- Backend: matyan-backend serves the REST API and consumes from Kafka (ingestion + control workers); it does not receive client traffic from the frontier directly.
- Client: matyan-client sends tracking data to the frontier (WebSocket + presign REST).
- Monorepo: This package lives under
extra/matyan-frontierin the matyan-core repo.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file matyan_frontier-0.3.0.tar.gz.
File metadata
- Download URL: matyan_frontier-0.3.0.tar.gz
- Upload date:
- Size: 149.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f85c2c6b25d8f2f2049debd712e9c3dc327485b2ca6ca13f5ad10826b389eecd
|
|
| MD5 |
16c7b12c7a48a15938b5e36406f82e1d
|
|
| BLAKE2b-256 |
d0c12206f0cabe27463928f5aeb4ebb6f24158599d1084124c2eec8227a8189d
|
Provenance
The following attestation bundles were made for matyan_frontier-0.3.0.tar.gz:
Publisher:
release.yml on 4gt-104/matyan-frontier
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
matyan_frontier-0.3.0.tar.gz -
Subject digest:
f85c2c6b25d8f2f2049debd712e9c3dc327485b2ca6ca13f5ad10826b389eecd - Sigstore transparency entry: 1191481068
- Sigstore integration time:
-
Permalink:
4gt-104/matyan-frontier@21bd36b03eb5c600346ed3eb3010b5d78db5191e -
Branch / Tag:
refs/tags/0.3.0 - Owner: https://github.com/4gt-104
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@21bd36b03eb5c600346ed3eb3010b5d78db5191e -
Trigger Event:
push
-
Statement type:
File details
Details for the file matyan_frontier-0.3.0-py3-none-any.whl.
File metadata
- Download URL: matyan_frontier-0.3.0-py3-none-any.whl
- Upload date:
- Size: 22.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d7cdf50b7d9c2825aa4f12a0b756f9b29cecd1173130c4237b5a05713acf92e1
|
|
| MD5 |
33f21f7c044608437a0a8cf38b6af85d
|
|
| BLAKE2b-256 |
129be29b5d5ddcdedf9e10da9b2c23bf298dd97ed634f591a39b8af5fa7e3116
|
Provenance
The following attestation bundles were made for matyan_frontier-0.3.0-py3-none-any.whl:
Publisher:
release.yml on 4gt-104/matyan-frontier
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
matyan_frontier-0.3.0-py3-none-any.whl -
Subject digest:
d7cdf50b7d9c2825aa4f12a0b756f9b29cecd1173130c4237b5a05713acf92e1 - Sigstore transparency entry: 1191481071
- Sigstore integration time:
-
Permalink:
4gt-104/matyan-frontier@21bd36b03eb5c600346ed3eb3010b5d78db5191e -
Branch / Tag:
refs/tags/0.3.0 - Owner: https://github.com/4gt-104
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@21bd36b03eb5c600346ed3eb3010b5d78db5191e -
Trigger Event:
push
-
Statement type: