Skip to main content

A set of tools for machine learning

Project description

[![Build Status](https://travis-ci.org/Nikolay-Lysenko/dsawl.svg?branch=master)](https://travis-ci.org/Nikolay-Lysenko/dsawl) [![codecov](https://codecov.io/gh/Nikolay-Lysenko/dsawl/branch/master/graph/badge.svg)](https://codecov.io/gh/Nikolay-Lysenko/dsawl) [![Maintainability](https://api.codeclimate.com/v1/badges/98fc23b8b51fb20f2920/maintainability)](https://codeclimate.com/github/Nikolay-Lysenko/dsawl/maintainability) [![PyPI version](https://badge.fury.io/py/dsawl.svg)](https://badge.fury.io/py/dsawl)

# dsawl

## What is it?

This is a set of tools for machine learning. Provided by the package utilities are described in the below table:

Subject | Description | Docs :—–: | :———: | :–: Active Learning | Highly-modular system that recommends which previously unlabelled examples should be labelled in order to increase model quality quickly and significantly. Special features: various options for both exploitation and exploration. | [Read more](https://github.com/Nikolay-Lysenko/dsawl/blob/master/docs/active_learning_demo.ipynb) Stacking | A method that applies machine learning algorithm to out-of-fold predictions or transformations made by other machine learning models. Special features: support of any sklearn-compatible estimators (in particular, pipelines). | [Read more](https://github.com/Nikolay-Lysenko/dsawl/blob/master/docs/stacking_demo.ipynb) Target Encoding | An alternative to one-hot encoding and hashing trick that attempts to have both memory efficiency and incorporation of all useful information from initial features. Special features: sklearn-compatible wrapper that can transform data out-of-fold and apply an estimator to the result.| [Read more](https://github.com/Nikolay-Lysenko/dsawl/blob/master/docs/target_encoding_demo.ipynb)

Repository name is a combination of three words: DS, saw, and awl. DS is as an abbreviation for Data Science and the latter two words represent useful tools.

## How to install the package?

The package is compatible with Python 3.5 or newer. A virtual environment where it is guaranteed that the package works can be created based on [the file](https://github.com/Nikolay-Lysenko/dsawl/blob/master/requirements.txt) named requirements.txt.

To install a stable release of the package, run this command: ` pip install dsawl `

To install the latest version from sources, execute this from your terminal: ` cd path/to/your/destination git clone https://github.com/Nikolay-Lysenko/dsawl cd dsawl pip install -e . `

If you have any troubles with installation, your questions are welcome.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dsawl-0.1.1.tar.gz (18.5 kB view details)

Uploaded Source

File details

Details for the file dsawl-0.1.1.tar.gz.

File metadata

  • Download URL: dsawl-0.1.1.tar.gz
  • Upload date:
  • Size: 18.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for dsawl-0.1.1.tar.gz
Algorithm Hash digest
SHA256 c73fef800f781b86d93455ce11404e3eb34eed7026ff2d4a9c4edbc2008439b5
MD5 22e93f493792b665797101b71ae27f67
BLAKE2b-256 f5aa7222d731b0a5c1fbbcb48696fff7aca65897fd14c3e38adf749c7cf2ecab

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page