ML & data helper code!
Project description
ML & data helper code!
Jakub Langr (c) 2021
This is a CLI utility for speeding up basic Data Engineering/Data Science tasks
Installation
$ pip install data-toolkit
Development
This project includes a number of helpers in the Makefile
to streamline common development tasks.
Environment Setup
The following demonstrates setting up and working with a development environment:
### create a virtualenv for development
$ make virtualenv
$ source env/bin/activate
### run dt cli application
$ dt --help
### run pytest / coverage
$ make test
TODO: Central Data Repository
As a PM, I want to be able to quickly look at all the data we have on S3 with sample images.
* Needs a DB
* Needs a way of sampling
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
data-toolkit-0.8.0.tar.gz
(29.8 kB
view hashes)
Built Distribution
Close
Hashes for data_toolkit-0.8.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 50f4eb6ec77bac50b61d9b13c1149afa55512ce4f0a2a25681ce24302a97a5f2 |
|
MD5 | 5869cddcd3fc0438c546b36acd755522 |
|
BLAKE2b-256 | 6ad1f9c8ecf31b4bb11eb8af2b0b69ecca975544512d2a6165d3f319261c3254 |