Skip to main content

A Python package that provides helpers for cleaning, deduplication, enrichment, etc. in Spark

Project description

Spark ETL Python

https://img.shields.io/pypi/v/spark_etl_python.svg https://img.shields.io/travis/flavianh/spark_etl_python.svg Documentation Status Updates

A Python package that provides helpers for cleaning, deduplication, enrichment, etc. in Spark

Features

  • TODO

Develop

In order to be able to develop on this package:

  1. Create a virtual environment
  2. Install pip-tools: pip install pip-tools
  3. Run pip-sync requirements_dev.txt requirements.txt

To update dependencies, add them to requirements.in (if they are needed to run the package) or requirements_dev.in. Then run pip-compile requirements.in or pip-compile requirements_dev.in.

Credits

This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.

History

0.1.0 (2018-10-19)

  • First release on PyPI.

Project details


Release history Release notifications

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
spark_etl_python-0.1.5-py2.py3-none-any.whl (4.1 kB) Copy SHA256 hash SHA256 Wheel py2.py3
spark_etl_python-0.1.5.tar.gz (9.2 kB) Copy SHA256 hash SHA256 Source None

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page