Skip to main content

A docker tool that helps put machine learning in places that empower ml developers

Project description

MLDOCK

A docker tool that helps put machine learning in places that empower ml developers

PyPI CI Upload Python Package

mldock header

What is MLDOCK?

MLDOCK builds in conveniences and the power of docker and frames it around the core machine learning tasks related to production.

As a tool this means MLDOCK's goals are:

  • Provide tooling to improve the ML development workflow. ✅
  • Enable portability of ml code betwen platforms and vendors (Sagemaker, AI Platform, Kubernetes, other container services). ✅
  • Lower the barrier to entry by developing containers from templates. ✅
  • Be ready out the box, using templates to get you started quickly. Bring only your code. ✅
  • For any ML frameworks, runs in any orchestrator and on any cloud. (as long as it integrates with docker) ✅

What it is not:

  • Service orchestrator ❌
  • Training Scheduler ❌
  • Hyperparameter tuner ❌
  • Experiment Tracking ❌

Inspired by Sagify, Sagemaker Training Toolkit and Amazon Sagemaker.

Getting Started

Set up your environment

  1. (Optional) Use virtual environment to manage dependencies.
  2. Install dotenv easily configure environment.
pip install --user python-dotenv[cli]

note: dotenv allows configuring of environment through the .env file. MLDOCK uses ENVIRONMENT VARIABLES in the environment to find your DOCKER_HOST, DOCKERHUB credentials and even AWS/GCP credentials.

  1. Create an .env with the following:
# for windows and if you are using WSL1
DOCKER_HOST=tcp://127.0.0.1

# for WSL2 and linux (this is default and should work out of the box)
# but for consistency, set this dockerhost

DOCKER_HOST=unix://var/run/docker.sock

note: Now to switch environments just use dotenv as follows:

dotenv -f "/path/to/.env" run mldock local build --dir <my-project-path>

Overview of MLDOCK command line

The MLDOCK command line utility provides a set of commands to streamline the machine learning container image development process. The commands are grouped in to 3 functionality sets, namely:

Command Group Description
container A set of commands that support creating new containers, initialize and update containers. Also, provides commands for created new MLDOCK supported templates from previously built container images.
local A set of commands to use during the development phase. Creating your trainer, prediction scripts and debugging the execution of scripts.
registry A set of tools to help you push, pull and interact with image registries.

Create your first container image project

  1. Install MLDOCK

The pip install is the only supported package manager at present. It is recommended that you use an environment manager, either virtualenv or conda will work.

pip install mldock[cli]
  1. Setup local config for the mldock cli

This command sets up mldock cli with some nice to have defaults. It may even prompt you for some set up.

mldock configure init
  1. Initialize or create your first container

You will see a some of prompts to set up container.

mldock container init --dir my_ml_container

note:

  • Just hit Return/Enter to accept all the defaults.
  1. Build your container image locally
mldock local build --dir my_ml_container
  1. Run your training locally
mldock local train --dir my_ml_container
  1. Run your training locally
mldock local deploy --dir my_ml_container

Putting your model in the cloud

Push to Dockerhub

  1. Add the following to .env
DOCKERHUB_USERNAME=<your/user/name>
DOCKERHUB_PASSWORD=<your/dockerhub/password>
DOCKERHUB_REGISTRY=https://index.docker.io/v1/
DOCKERHUB_REPO=<your/user/repo/name>
  1. Push your container to dockerhub
mldock registry push --dir my_ml_container --provider dockerhub --build

note: The flags allow you to stipulate configuration changes in the command. --build says build the image before pushing. This is required initially since the dockerhub registry will prefix your container name. --provider tells MLDOCK to authenticate to dockerhub and push the container there.

hint In addition to DockerHub, both AWS ECR & GCP GCR are also supported.

helpful tips

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mldock-0.8.21.tar.gz (76.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mldock-0.8.21-py3-none-any.whl (116.4 kB view details)

Uploaded Python 3

File details

Details for the file mldock-0.8.21.tar.gz.

File metadata

  • Download URL: mldock-0.8.21.tar.gz
  • Upload date:
  • Size: 76.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.8.11

File hashes

Hashes for mldock-0.8.21.tar.gz
Algorithm Hash digest
SHA256 aa2687bc171e3e35b9282bca6a83eb6607d08828b588d477d36437b24dfa3db3
MD5 d243e6e2e4e0b77d8bce455a61b14a6d
BLAKE2b-256 b055aba428e0ca5bbe41a613c5fe9da921e0b551d4b6cda6c7d3e0648abe8ddf

See more details on using hashes here.

File details

Details for the file mldock-0.8.21-py3-none-any.whl.

File metadata

  • Download URL: mldock-0.8.21-py3-none-any.whl
  • Upload date:
  • Size: 116.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.8.11

File hashes

Hashes for mldock-0.8.21-py3-none-any.whl
Algorithm Hash digest
SHA256 95caaf793c4fd8b96f1d1c3021838d9fb3a7d0137a052ef39fee03d61a3cfb88
MD5 7f7c929d25ff0d973fba038cc6406c3a
BLAKE2b-256 69a66718e235e7e9b6aecb38c2e3a54781b549143ca0227938db8e1e06453776

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page