Skip to main content

A lightweight tool to perform reproducible machine learning experiment using Dask.

Project description

https://img.shields.io/pypi/v/daskperiment.svg Latest Docs https://travis-ci.org/sinhrks/daskperiment.svg?branch=master https://codecov.io/gh/sinhrks/daskperiment/branch/master/graph/badge.svg

Overview

daskperiment is a tool to perform reproducible machine learning experiment. It allows users to define and manage the history of trials (given parameters, results and execution environment).

The package is built on Dask, a package for parallel computing with task scheduling. Each experiment trial is internally expressed as Dask computation graph, and can be executed in parallel.

Benefits

  • Compatibility with standard Python/Jupyter environment (and optionally with standard KVS).

    • No need to set up server applications

    • No need to registrate on any cloud services

    • Run on standard / customized Python shells

  • Intuitive user interface

    • Few modifications on existing codes are needed

    • Trial histories are logged automatically (no need to write additional codes for logging)

    • Dask compatible API

    • Easily accessible experiments history (with pandas basic operations)

    • Less managiment works on Git (no need to make branch per trials)

    • (Experimental) Web dashboard to manage trial history

  • Traceability of experiment related information

    • Trial result and its (hyper) parameters.

    • Code contexts

    • Environment information

      • Device information

      • OS information

      • Python version

      • Installed Python packages and its version

      • Git information

  • Reproducibility

    • Check function purity (each step should return the same output for the same inputs)

    • Automatic random seeding

  • Auto saving and loading of previous experiment history

  • Parallel execution of experiment steps

  • Experiment sharing

    • Redis backend

    • MongoDB backend

Future Scope

  • More efficient execution.

    • Omit execution if depending parameters are the same

    • Distributed execution

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

daskperiment-0.5.0.tar.gz (45.7 kB view details)

Uploaded Source

Built Distribution

daskperiment-0.5.0-py3-none-any.whl (77.0 kB view details)

Uploaded Python 3

File details

Details for the file daskperiment-0.5.0.tar.gz.

File metadata

  • Download URL: daskperiment-0.5.0.tar.gz
  • Upload date:
  • Size: 45.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.19.5 CPython/3.6.8

File hashes

Hashes for daskperiment-0.5.0.tar.gz
Algorithm Hash digest
SHA256 dcdbb4181c397933c7912161d758fba00c099f0c320bf6a79b60425dbe5d4f6b
MD5 3e3b12f682eac6faca838923d30c1daf
BLAKE2b-256 cca23bf266f43f149c7b55e4a8005d784a9e3763e2613849d5b0d6183ada217d

See more details on using hashes here.

File details

Details for the file daskperiment-0.5.0-py3-none-any.whl.

File metadata

  • Download URL: daskperiment-0.5.0-py3-none-any.whl
  • Upload date:
  • Size: 77.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.19.5 CPython/3.6.8

File hashes

Hashes for daskperiment-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6da0bad36c346c57dcef66e005ca50bb68ae9c98ae6bebfb69ff917fa8274c5b
MD5 931c20e6664856aaf1abc4de0662ff6d
BLAKE2b-256 86bc18f2ba991d45ad915b9d705659c2514d28f78f12549971bdfb15e4eda4bd

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page