Data Science oriented tools, mostly for Apache Spark
Project description
Data Science oriented tools, mostly for Apache Spark
- The pipepline for using Python ML models together with Apache Spark
- Command-line tools (see readme)
- demo: usage demos in form of Jupyter notebooks
- model inference on cluster: demo/score-sklearn.ipynb
- quick dataset distribution change detection: demo/datadiff.ipynb
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
spark-pipeline-0.0.4.tar.gz
(8.8 kB
view hashes)
Built Distribution
Close
Hashes for spark_pipeline-0.0.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d8827c2e114bccc14ae3b556b3e40768ae140eecd90ff6e29d085853314263e1 |
|
MD5 | 623f4efc2a7ecfb13deb769e2965a319 |
|
BLAKE2b-256 | 86f71882fa4cd334ab1d6813986a3be04d962904feb1238a883bd2b45192369e |