Skip to main content

A lightweight tool to perform reproducible machine learning experiment using Dask.

Project description

https://img.shields.io/pypi/v/daskperiment.svg Latest Docs https://travis-ci.org/sinhrks/daskperiment.svg?branch=master https://codecov.io/gh/sinhrks/daskperiment/branch/master/graph/badge.svg

Overview

daskperiment is a tool to perform reproducible machine learning experiment. It allows users to define and manage the history of trials (given parameters, results and execution environment).

The package is built on Dask, a package for parallel computing with task scheduling. Each experiment trial is internally expressed as Dask computation graph, and can be executed in parallel.

Benefits

  • Usable in standard Python/Jupyter environment (and optionally with standard KVS).

    • No need to setting up server applications.

    • No registration to cloud services.

    • Not to be constrained by slightly customized Python shells.

  • User-intuitive.

    • Minimizing modifications of existing codes.

    • Performing experiments using Dask compatible API.

    • Easily handle experiments history (with pandas basic operations).

    • Requires less work to manage with Git (no need to make branch per trials).

    • (Experimental) Web dashboard to manage trial history.

  • Tracking experiment related information

    • Trial result and its (hyper) parameters.

    • Code context.

    • Environment information.

      • Device information

      • OS information

      • Python version

      • Installed Python packages and its version

      • Git information

  • Reproducibility

    • Check function purity (each step should return the same output for the same inputs)

    • Automatic random seeding

  • Auto saving and loading previous experiment history.

  • Parallel execution of experiment steps.

  • Sharing experiments.

    • Redis backend

Future Scope

  • More efficient execution.

    • Omit execution if depending parameters are the same

    • Distributed execution

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

daskperiment-0.3.0.tar.gz (31.9 kB view details)

Uploaded Source

Built Distribution

daskperiment-0.3.0-py3-none-any.whl (56.3 kB view details)

Uploaded Python 3

File details

Details for the file daskperiment-0.3.0.tar.gz.

File metadata

  • Download URL: daskperiment-0.3.0.tar.gz
  • Upload date:
  • Size: 31.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.19.5 CPython/3.6.6

File hashes

Hashes for daskperiment-0.3.0.tar.gz
Algorithm Hash digest
SHA256 b44656ab92a0413e358e2c386080dcdea08f3f8f770b72f9286c3bb22809853d
MD5 925e9190d7fa4ff8f016eaf8e68dfda4
BLAKE2b-256 819680b85a80816dcb598f73a3564cf880c3f5c7bdf7bf1d5abe99fb9a7609a0

See more details on using hashes here.

File details

Details for the file daskperiment-0.3.0-py3-none-any.whl.

File metadata

  • Download URL: daskperiment-0.3.0-py3-none-any.whl
  • Upload date:
  • Size: 56.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.19.5 CPython/3.6.6

File hashes

Hashes for daskperiment-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 13bb0b2c4dd1d3f2f98ef8292c0e32588f150f8334f8ecbb7a0452e93055df50
MD5 fe8c158dd3e9778496e0622c6d236aed
BLAKE2b-256 232cef94ef4886d37e23b7ec2df266fdc3821207cbd5ade721f6cbf5d6ea8da1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page