No project description provided
Project description
niacin
A Python library for replacing the missing variation in your data.
Why should I use this?
Data collected for model training necessarily undersamples the likely variance in the input space. This library is a collection of tools for inserting typical kinds of perturbations to better approximate population variance; and, for creating similar-but-incorrect examples to aid in reducing the total size of the hypothesis space. These are commonly known as ENRICHMENT and NEGATIVE SAMPLING, respectively.
How do I use this?
Functions in niacin are separated into submodules for specific data types. Functions expose a similar API, with two input arguments: the data to be transformed, and the probability of applying a specific transformation.
enrichment:
data = "This is the song that never ends and it goes on and on my friends"
print(text.add_misspelling(data, p=1.0))
This is teh song tath never ends adn it goes on anbd on my firends
negative sampling:
data = "This is the song that never ends and it goes on and on my friends"
print(text.add_hypernyms(data, p=1.0))
This is the musical composition that never extremity and it exit on and on my person
How do I install this?
with pip:
pip install niacin
from source:
git clone git@github.com:deniederhut/niacin.git && cd niacin && python setup.py install
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file niacin-0.1.1.tar.gz.
File metadata
- Download URL: niacin-0.1.1.tar.gz
- Upload date:
- Size: 1.2 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0d3fb337d7435e50a1bec7363d73dbca24d0bc959523db95e108edbcd21e794e
|
|
| MD5 |
083c1419b97fff01895b6181fc3b625c
|
|
| BLAKE2b-256 |
cf03a4002d60a75a12b907bd1c39a6bb08506e387e0b0fe59bd438d8e37094ec
|
File details
Details for the file niacin-0.1.1-py3.6.egg.
File metadata
- Download URL: niacin-0.1.1-py3.6.egg
- Upload date:
- Size: 1.2 MB
- Tags: Egg
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f870c18dfd928c3cec1eb07e367231f22e45130389878978fe3185fffd90da03
|
|
| MD5 |
1f54853a05d974d2ce4b87c6e31e1812
|
|
| BLAKE2b-256 |
738fb734d6df3c38d479320d602bfb6cd871f13dda8ffa8a21b0e8cf235a93d7
|
File details
Details for the file niacin-0.1.1-py3-none-any.whl.
File metadata
- Download URL: niacin-0.1.1-py3-none-any.whl
- Upload date:
- Size: 1.2 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4678443fa57e9b22dbcf34e86b296a5c36562b343b330027b3b50d7e87cbee1d
|
|
| MD5 |
4a43c8b4b191763b08c77d3f7e4f1c6d
|
|
| BLAKE2b-256 |
ef7c5c1e13f67b2cc725fcd4e0f53ee59b471947ec7f036d161098388bf1d2f6
|