Skip to main content

Data Preparation Toolkit Transforms using Ray

Project description

DPK Python Transforms

installation

The transforms are delivered as a standard pyton library available on pypi and can be installed using pip install:

python -m pip install data-prep-toolkit-transforms[all] or python -m pip install data-prep-toolkit-transforms[ray, all] or python -m pip install data-prep-toolkit-transforms[language]

installing the python transforms will also install data-prep-toolkit

installing the ray transforms will also install data-prep-toolkit[ray]

List of Transforms in current package

Note: This list includes the transforms that were part of the release starting with data-prep-toolkit-transforms:0.2.1. This list may not always reflect up to date information. Users are encourage to raise an issue in git when they discover missing components or packages that are listed below but not in the current release they get from pypi.

Release notes:

1.0.1.dev1

Added Gneissweb transforms
fdedup fix for windows

1.0.1.dev0

PR #979 (code_profiler)

1.0.0.a6

Added Profiler
Added Resize

1.0.0.a5

Added Pii Redactor
Relax fasttext requirement >= 0.9.2

1.0.0.a4

Added missing ray implementation for lang_id, doc_quality, tokenization and filter
Added ray notebooks for lang id, Doc Quality, tokenization, and Filter

1.0.0.a3

Added code_profiler

1.0.0.a2

Relax dependencies on pandas (use latest or whatever is installed by application) Relax dependencies on requests (use latest or whatever is installed by application)

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file data_prep_toolkit_transforms-1.1.1.dev0-py3-none-any.whl.

File metadata

File hashes

Hashes for data_prep_toolkit_transforms-1.1.1.dev0-py3-none-any.whl
Algorithm Hash digest
SHA256 ced6a060a5f5fd545140a97c713c24f00d8a56c2cb05ba01201f8f3107aa8e23
MD5 c5913d931dbaa74b95e2ec634abf930e
BLAKE2b-256 7ee6f525ed2fa4159264e186ed5742b38bfd0b6daf7e342724eec3d6ce6fab69

See more details on using hashes here.

File details

Details for the file data_prep_toolkit_transforms-1.1.1.dev0-3-py3-none-any.whl.

File metadata

File hashes

Hashes for data_prep_toolkit_transforms-1.1.1.dev0-3-py3-none-any.whl
Algorithm Hash digest
SHA256 c624dc5fa504d7c680fe2dacf9f064c6558b9369c67924848fc80236025880e4
MD5 157f1384e81bff42b58a82c59b7b200a
BLAKE2b-256 b5926248ccefbbb365dca6ddf3707d0904b81c1949e275bdfccfe200084aef59

See more details on using hashes here.

File details

Details for the file data_prep_toolkit_transforms-1.1.1.dev0-2-py3-none-any.whl.

File metadata

File hashes

Hashes for data_prep_toolkit_transforms-1.1.1.dev0-2-py3-none-any.whl
Algorithm Hash digest
SHA256 e3eb2b7226fc2d5d5102176dd0b6b949843a94934a448f47ed42ddd45704bee8
MD5 3205d18c362c13eabe2fa3cb72a4cd3d
BLAKE2b-256 c7f63e1a4bc4e91745e59c7f004cb4931f96f4137f66c1a07ce56e28da1bfbcf

See more details on using hashes here.

File details

Details for the file data_prep_toolkit_transforms-1.1.1.dev0-1-py3-none-any.whl.

File metadata

File hashes

Hashes for data_prep_toolkit_transforms-1.1.1.dev0-1-py3-none-any.whl
Algorithm Hash digest
SHA256 13882189eaf0784e56177d0043c6315f96af29ba384863871d1e8aed22fdd2c9
MD5 79777259fe335c0b17c39851fbb87aeb
BLAKE2b-256 07d20cdbdb5b87e8f08e8dc8d2ab2ff520411ae05ef27f9a7daf48f95a87a023

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page