Skip to main content

DataEngineX - Core framework for data engineering projects

Project description

dataenginex

dataenginex is the core DataEngineX framework package for building observable, production-ready data and API services.

It provides:

  • FastAPI application primitives and API extensions
  • Middleware for structured logging, metrics, and tracing
  • Data quality and validation utilities
  • Lakehouse and warehouse building blocks (S3, GCS, BigQuery, Parquet)
  • Reusable ML support modules for model-serving workflows

Install

# Core (no web framework dependencies)
pip install dataenginex

# With FastAPI, middleware, auth, health checks
pip install dataenginex[api]

# With cloud storage backends
pip install dataenginex[s3]        # AWS S3 via boto3
pip install dataenginex[gcs]       # Google Cloud Storage
pip install dataenginex[bq]        # Google BigQuery
pip install dataenginex[cloud]     # All cloud storage (S3 + GCS)

# Everything
pip install dataenginex[all]

Package Scope

dataenginex is the core library from the DEX monorepo. It is the only published package — applications and examples are built on top of it.

Submodules

Module Requires Extra Description
dataenginex.core Medallion architecture, schemas, quality gates, validators
dataenginex.data Schema registry, data contracts, catalog
dataenginex.lakehouse optional [s3] [gcs] [bq] Storage backends (JSON, Parquet, S3, GCS, BigQuery), catalog, partitioning
dataenginex.warehouse Warehouse layers, lineage tracking
dataenginex.ml Model registry, vectorstore, LLM adapters, drift detection
dataenginex.api [api] Auth (JWT), health checks, error handling, pagination, rate limiting
dataenginex.middleware [api] Structured logging, Prometheus metrics, OpenTelemetry tracing

Quick Usage

# Core — always available
from dataenginex.core import MedallionArchitecture, QualityGate
from dataenginex.data import SchemaRegistry
from dataenginex.ml import ModelRegistry

# API — requires pip install dataenginex[api]
from dataenginex.api import HealthChecker, AuthMiddleware, paginate
from dataenginex.middleware import configure_logging, configure_tracing

# Storage — requires the relevant extra
from dataenginex.lakehouse import JsonStorage, get_storage
storage = get_storage("file://./data")       # always works
storage = get_storage("s3://my-bucket")      # requires [s3]
storage = get_storage("gs://my-bucket")      # requires [gcs]
storage = get_storage("bq://my-project/ds")  # requires [bq]

Source and Docs

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dataenginex-0.8.2.tar.gz (410.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dataenginex-0.8.2-py3-none-any.whl (97.5 kB view details)

Uploaded Python 3

File details

Details for the file dataenginex-0.8.2.tar.gz.

File metadata

  • Download URL: dataenginex-0.8.2.tar.gz
  • Upload date:
  • Size: 410.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for dataenginex-0.8.2.tar.gz
Algorithm Hash digest
SHA256 c46ce691e0cd02b255ef1d9023032a83ae4975713022f490c26c5e8d56096061
MD5 8869bf1088ed6d8cb3c16d1a2c2a9af0
BLAKE2b-256 b3176b4d6444e0ece185a34170823793402b323853cf94dbce3de69e84568aa9

See more details on using hashes here.

Provenance

The following attestation bundles were made for dataenginex-0.8.2.tar.gz:

Publisher: pypi-publish.yml on TheDataEngineX/dex

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file dataenginex-0.8.2-py3-none-any.whl.

File metadata

  • Download URL: dataenginex-0.8.2-py3-none-any.whl
  • Upload date:
  • Size: 97.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for dataenginex-0.8.2-py3-none-any.whl
Algorithm Hash digest
SHA256 2b6c78f1359f967c5d7a1b7e3fe8dc2b468b3b149559e1d2b0b97578c5d4e530
MD5 127af84e0f99bc4e5e957c57d57f7adc
BLAKE2b-256 890bfe883d3e7e5c90338eb6b8c7ffdfbab17584136dd42e5cab53c141085bbb

See more details on using hashes here.

Provenance

The following attestation bundles were made for dataenginex-0.8.2-py3-none-any.whl:

Publisher: pypi-publish.yml on TheDataEngineX/dex

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page