Machine learning data flow for reproducible data science
Project description
IDEAL HACKDUCK PROJECT
Run model from with a REST app (MLflow):
- save a github folder for each project
- can easely have predition on a bunch of data
FEATURES:
- seed for reproducibility
- map arguments to loop over a list
- mlflow integration (automatic logs parameters, can log metrics or artifacts)
- all prefect avantages
- handle subflows
- task bank to do basic operations
- unit test handle by ward
TODO:
- map over subflows ?
- create a script to run it with HackDuck file.yaml --argsname argvalue ...
- run it in a docker
- save version for all requirements (needed to rerun the flow)
- save python files inside mlruns/... and git them and save git commit
- being able to rerun a previous flow (save args and kwargs and output ref)
- put to prod thanks to travis CI that create the MLflow git repo
- generate examples for people to use
use it
from HackDuck import run_flow
config = yaml.load(open('/home/alex/awesome/HackDuck/iris/flows/iris_classif_with_sub.yaml', 'r'), Loader=yaml.FullLoader)
run_flow(config, {})
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
HackDuck-0.1.5.tar.gz
(3.3 kB
view details)
Built Distribution
File details
Details for the file HackDuck-0.1.5.tar.gz
.
File metadata
- Download URL: HackDuck-0.1.5.tar.gz
- Upload date:
- Size: 3.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.42.0 CPython/3.7.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 904e433d9db1265af498bb21a4b69dc438afaa561e6f349f4703a726b4d2dc86 |
|
MD5 | 95d1c6cb8a0e19782a2eab6610f8a0e4 |
|
BLAKE2b-256 | 10fb67e964b2757fdb821fa0bbbee4fbd38d49559ae96d948712c738eb27847e |
File details
Details for the file HackDuck-0.1.5-py3-none-any.whl
.
File metadata
- Download URL: HackDuck-0.1.5-py3-none-any.whl
- Upload date:
- Size: 4.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.42.0 CPython/3.7.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d9cced7c5380bfe564593b9a08c514cac2962e1171ae1b63e014d236d9bb6be0 |
|
MD5 | 40bf466a55ef5abad4b9b4352fa8321d |
|
BLAKE2b-256 | e3062db161a39ccf0e8782833c8875b6fcb96169f0942620e2baf6a0c47e7c88 |