Skip to main content

A set of tools for machine learning

Project description

[![Build Status](https://travis-ci.org/Nikolay-Lysenko/dsawl.svg?branch=master)](https://travis-ci.org/Nikolay-Lysenko/dsawl) [![codecov](https://codecov.io/gh/Nikolay-Lysenko/dsawl/branch/master/graph/badge.svg)](https://codecov.io/gh/Nikolay-Lysenko/dsawl) [![Maintainability](https://api.codeclimate.com/v1/badges/98fc23b8b51fb20f2920/maintainability)](https://codeclimate.com/github/Nikolay-Lysenko/dsawl/maintainability)

# dsawl

## What is it? This is a set of tools for machine learning. As of now, the provided utilities look as follows:

Subject | Description | Docs :—–: | :———: | :–: Active Learning | Highly-modular system that recommends which previously unlabelled examples should be labelled in order to increase model quality quickly and significantly. Special features: various options for both exploitation and exploration. | [Read more](https://github.com/Nikolay-Lysenko/dsawl/blob/master/docs/active_learning_demo.ipynb) Stacking | A method that applies machine learning algorithm to out-of-fold predictions or transformations made by other machine learning models. Special features: support of any sklearn-compatible estimators (in particular, pipelines). | [Read more](https://github.com/Nikolay-Lysenko/dsawl/blob/master/docs/stacking_demo.ipynb) Target Encoding | An alternative to one-hot encoding and hashing trick that attempts to have both memory efficiency and incorporation of all useful information from initial features. Special features: sklearn-compatible wrapper that can transform data out-of-fold and apply an estimator to the result.| [Read more](https://github.com/Nikolay-Lysenko/dsawl/blob/master/docs/target_encoding_demo.ipynb)

Repository name is a combination of three words: DS, saw, and awl. DS is as an abbreviation for Data Science and the latter two words represent useful tools.

## How to install the package? The package is compatible with Python 3.5 or newer. A virtual environment where it is guaranteed that the package works can be created based on [the file](https://github.com/Nikolay-Lysenko/dsawl/blob/master/requirements.txt) named requirements.txt.

To install a stable release of the package, run this command: ` pip install dsawl `

To install the latest version from sources, execute this from your terminal: ` cd path/to/your/destination git clone https://github.com/Nikolay-Lysenko/dsawl cd dsawl pip install -e . `

If you have any troubles with installation, your questions are welcome.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dsawl-0.0.1.tar.gz (18.5 kB view details)

Uploaded Source

File details

Details for the file dsawl-0.0.1.tar.gz.

File metadata

  • Download URL: dsawl-0.0.1.tar.gz
  • Upload date:
  • Size: 18.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for dsawl-0.0.1.tar.gz
Algorithm Hash digest
SHA256 92ee2384c90d817a26f5a1b038d147a646e3357a74da4c53562fd2722481e4a4
MD5 37394366e46f0d11b8f457aec471c6af
BLAKE2b-256 012cfd09a4f0164c03b50364427bbfaa552b20eed197c920d598ddc4ce2581d6

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page