PySpark helper utilities for Spark data provenance
Project description
pyspark-data-provenance
Python package in this monorepo for enabling Spark data provenance features in PySpark jobs.
This project is managed with uv.
Quick Start
From the repository root:
cd pyspark-data-provenance
make init-uv-python
make init
make check
make test
make run
Project Layout
pyspark-data-provenance/
src/pyspark_data_provenance/
tests/
main.py
pyproject.toml
Makefile
Common Commands
make init- Create/refresh lock file and sync dependenciesmake update- Upgrade dependencies and sync environmentmake check- Run lint and type checksmake test- Run unit testsmake build- Build package wheelmake publish- Publish package (PyPI)
PyPI Package
The Python package is published on PyPI:
https://pypi.org/project/pyspark-data-provenance/
You can install it with pip:
pip install pyspark-data-provenance
Before publishing to production PyPI with make publish, set your token:
export UV_PUBLISH_TOKEN=<your-pypi-token>
For details, see:
https://docs.astral.sh/uv/guides/package/#publishing-your-package
CI/CD
Repository-level workflows are provided for:
- CI: lint, type-check, and tests on pushes/PRs affecting this package
- Publish: build and publish on release
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pyspark_data_provenance-0.0.2-py3-none-any.whl.
File metadata
- Download URL: pyspark_data_provenance-0.0.2-py3-none-any.whl
- Upload date:
- Size: 2.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
942ab194ef3158de30b37489e2a5569dbdd4c3dd4ef01607bc2a92cafc31aac1
|
|
| MD5 |
606a29c19325c1726cb8d20e1429d76e
|
|
| BLAKE2b-256 |
45a3c9ce70eb7828e5bdf83a85988f98dafa428274095b316d98013903de7eb0
|