Skip to main content

A pandas extension built for OpenLineage

Project description

pandas-lineage

BEWARE: This project is in very early stages (as of 2022-09-12)

pandas-lineage is intended to extend the functionality of I/O and standard transform operations on a pandas dataframe to emit OpenLineage RunEvents. I am starting just with read/write operations emiting RunEvents with schema facets.

Badges:

python-package

Installation

pip install pandas-lineage

Development Documentation

Examples:

  • marquez-examples
    • contains getting started code and a script for running Marquez locally in Docker
  • mock-api-example
    • contains getting started code and a simple Flask API for sending lineage events to which will just always return a 200 status code

References:

Contributing:

Issues

I have not created any sort of contribution guide yet, but I don't want that to stop anyone! If you are interested in contributing, fork this repository and open a PR. As this becomes more feature-rich/useful, we will establish a contributors workflow. For now, please just use the pre-commit hooks.

Notes:

  • The pandas-lineage directory structure (for now) will mirror the directory structure of pandas for the components that it is extending.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pandas-lineage-0.0.2.tar.gz (8.3 kB view details)

Uploaded Source

Built Distribution

pandas_lineage-0.0.2-py3-none-any.whl (12.9 kB view details)

Uploaded Python 3

File details

Details for the file pandas-lineage-0.0.2.tar.gz.

File metadata

  • Download URL: pandas-lineage-0.0.2.tar.gz
  • Upload date:
  • Size: 8.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.14

File hashes

Hashes for pandas-lineage-0.0.2.tar.gz
Algorithm Hash digest
SHA256 27b361bd573eebbb07e48d4caa6eccb43568b770047224a649b6f4753d29be09
MD5 081a58b707cd49ed9d559fad620d135c
BLAKE2b-256 3413537c4ae0566187d15b816b25858697718e442ca9fac05f9023937321e5fa

See more details on using hashes here.

File details

Details for the file pandas_lineage-0.0.2-py3-none-any.whl.

File metadata

File hashes

Hashes for pandas_lineage-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 83abf8040ac574014ca8ebdee1fb5b01c7c4939dfd777b0a9eeb4fdceb964bdd
MD5 8dd8b4f1777ebf66577ee7e21b05a19a
BLAKE2b-256 4df39bc8cd9859562e13c55954548219f3bffe22e25aab051cfb2dc4c449ec65

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page