Skip to main content

ML & data helper code!

Project description

ML & data helper code!

Jakub Langr (c) 2021

This is a CLI utility for speeding up basic Data Engineering/Data Science tasks

Installation

$ pip install data-toolkit

Development

This project includes a number of helpers in the Makefile to streamline common development tasks.

Environment Setup

The following demonstrates setting up and working with a development environment:

### create a virtualenv for development

$ make virtualenv

$ source env/bin/activate


### run dt cli application

$ dt --help


### run pytest / coverage

$ make test

TODO: Central Data Repository

As a PM, I want to be able to quickly look at all the data we have on S3 with sample images.

* Needs a DB
* Needs a way of sampling

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for data-toolkit, version 0.10.4
Filename, size File type Python version Upload date Hashes
Filename, size data_toolkit-0.10.4-py3-none-any.whl (40.7 kB) File type Wheel Python version py3 Upload date Hashes View
Filename, size data-toolkit-0.10.4.tar.gz (32.1 kB) File type Source Python version None Upload date Hashes View

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page