Skip to main content

Tools that make working with scikit-learn and pandas easier.

Project description

NOTE: This project is on life support. That means there are probably not any new features being added, but there will be regular updates to support upcoming versions of sklearn and pandas.

This repository contains tools that make working with scikit-learn and pandas easier.

Build Status

What is this?

dstoolbox is not one big tool but rather an amalgamation of small re-usable tools. They are intended to work well with scikit-learn and pandas make the integration of those libraries easier.

The best way to get started is to have a look at the notebooks folder, especially at the showcase notebook.

The tools included here are used by us at Otto Group BI for our production services, as well as by individual members for machine learning related things, such as participating in Kaggle competitions.

Installation instructions

Using pip:

pip install dstoolbox

There is a conda recipe for those who want to build their own conda package.

Contributing

Pull requests are welcome. Here are some directions:

Tests

To run the tests, you need to install the dev requirements using pip:

pip install -r requirements-dev.txt

or conda:

conda install --file requirements-dev.txt

Next you should check that all unit tests and all static code checks pass:

py.test
pylint dstoolbox

Guidelines

  • Python 3 only.

  • Code should be re-usable and succinct.

  • Where applicable, it should be compatible with scikit-learn, pandas, and Palladium.

  • It should be documented and unit-tested using pytest (100% code coverage desired).

  • It should conform to the coding standards prescribed by pylint (where it makes sense).

  • There should be usage examples that cover the most common use cases (the best place would be an IPython/Jupyter notebook).

  • Don’t add dependencies unless absolutely necessary.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dstoolbox-0.14.0.tar.gz (297.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dstoolbox-0.14.0-py3-none-any.whl (48.1 kB view details)

Uploaded Python 3

File details

Details for the file dstoolbox-0.14.0.tar.gz.

File metadata

  • Download URL: dstoolbox-0.14.0.tar.gz
  • Upload date:
  • Size: 297.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for dstoolbox-0.14.0.tar.gz
Algorithm Hash digest
SHA256 fbc0edd1e3f9da2d8fa13f854d1216328043816a2704d6ded4bad2c295240b1c
MD5 07d9db6b379a785a716e7ec05facd2db
BLAKE2b-256 f1608dbb5fe8f947371dcfb6d491bcf844a0ae3e7e0422d28047feb90491b22f

See more details on using hashes here.

Provenance

The following attestation bundles were made for dstoolbox-0.14.0.tar.gz:

Publisher: deploy_to_pypi.yml on ottogroup/dstoolbox

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file dstoolbox-0.14.0-py3-none-any.whl.

File metadata

  • Download URL: dstoolbox-0.14.0-py3-none-any.whl
  • Upload date:
  • Size: 48.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for dstoolbox-0.14.0-py3-none-any.whl
Algorithm Hash digest
SHA256 2426bffd995e410906abca95bf626032aa9ba9d842793ca10b2968e85c4ea55f
MD5 7c1417051fa7b75386594b9bdf5cad24
BLAKE2b-256 22372ccbea3cf6b232929080ebdc71b4c011b1c201fefddab96ecb708b991975

See more details on using hashes here.

Provenance

The following attestation bundles were made for dstoolbox-0.14.0-py3-none-any.whl:

Publisher: deploy_to_pypi.yml on ottogroup/dstoolbox

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page