Skip to main content

Lakehouse platform

Project description

Phlo

Modern data lakehouse platform. Plugin-driven. Storage-agnostic.

CI PyPI Python 3.11+

Features

  • Decorator-driven development@phlo.ingestion and @phlo.quality replace hundreds of lines of boilerplate
  • Write-Audit-Publish pattern — Git-like branching with automatic quality gates and promotion
  • Type-safe data quality — Pandera schemas enforce validation at ingestion time
  • Plugin architecture — 12 plugin types: sources, quality, ingestion, transforms, services, hooks, catalogs, assets, resources, orchestrators, and CLI commands
  • Storage-agnostic — Iceberg, Delta, or bring-your-own via table-format plugins
  • Observatory UI — Web-based data exploration, lineage, and monitoring
  • Observability — OpenTelemetry traces, metrics, and logs via phlo-otel; Grafana/Prometheus/Loki stack
  • Production-ready — Auto-publishing, configurable merge strategies, freshness policies, data migrations

What It Looks Like

import phlo

@phlo.ingestion(
    table_name="events",
    unique_key="id",
    validation_schema=EventSchema,
    group="api",
    cron="0 */1 * * *",
    freshness_hours=(1, 24),
)
def api_events(partition_date: str):
    return rest_api(...)  # Any DLT source


@phlo.quality(
    table="bronze.events",
    checks=[
        NullCheck(columns=["id", "timestamp"]),
        RangeCheck(column="value", min_value=0, max_value=100),
        UniqueCheck(columns=["id"]),
        FreshnessCheck(column="timestamp", max_age_hours=24),
    ],
)
def events_quality():
    pass

Prerequisites

  • uv — Python package manager
  • Docker — Container runtime

Quick Start

# Install with default plugins
uv pip install phlo[defaults]

# Initialize a new project
phlo init my-project
cd my-project

# Start services and materialize
phlo services start
phlo materialize --select "dlt_glucose_entries+"

Documentation

Full documentation at docs/index.md:

Development

uv pip install -e .    # Install Phlo in dev mode
make check             # Lint, format, typecheck, and test (parallel)

# Services
phlo services start    # Start infrastructure
phlo services stop     # Stop services
phlo services logs -f  # View logs

# Individual gates
uv run ruff check .    # Lint
uv run ruff format .   # Format
uv run ty check        # Typecheck
uv run pytest          # Test

Architecture

Phlo is a monorepo of composable packages — install only what you need:

Layer Packages
Orchestration phlo-dagster
Ingestion phlo-dlt
Quality phlo-pandera
Transforms phlo-dbt
Table formats phlo-iceberg, phlo-delta, phlo-clickhouse
Infrastructure phlo-traefik, phlo-postgres
Storage phlo-minio
Catalog phlo-nessie, phlo-openmetadata
Query phlo-trino
Observability phlo-otel, phlo-clickstack, phlo-grafana, phlo-prometheus, phlo-loki, phlo-alloy
UI phlo-observatory, phlo-pgweb, phlo-superset
API phlo-api, phlo-hasura, phlo-postgrest
Dev/Test phlo-testing

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

phlo-0.7.0.tar.gz (173.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

phlo-0.7.0-py3-none-any.whl (235.4 kB view details)

Uploaded Python 3

File details

Details for the file phlo-0.7.0.tar.gz.

File metadata

  • Download URL: phlo-0.7.0.tar.gz
  • Upload date:
  • Size: 173.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for phlo-0.7.0.tar.gz
Algorithm Hash digest
SHA256 08f6521e270cc1b5b71bfeb7fe8a017f12a8ace33eb9e278952b461f94bcb4bb
MD5 a5a1c42e97c536504f54b58c96108bc9
BLAKE2b-256 59f50697099ef574f359fd7bf8b318cc4359d252e28ede3cdaaeb518f8dc4ab0

See more details on using hashes here.

File details

Details for the file phlo-0.7.0-py3-none-any.whl.

File metadata

  • Download URL: phlo-0.7.0-py3-none-any.whl
  • Upload date:
  • Size: 235.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for phlo-0.7.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6d8b734e714e2bcf48d9071d4db8fd8d58fdfbb9d5fc59acd0cf15b324a88c7f
MD5 8575066e51f9a921340d72ad313f517f
BLAKE2b-256 12cc8f4d0512ebd1876d3b194db583523bc03e8ae39190c53bd5fc3e4b2d9a8a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page