Skip to main content

Data Preparation Toolkit Transforms using Ray

Project description

DPK Python Transforms

installation

The transforms are delivered as a standard pyton library available on pypi and can be installed using pip install:

python -m pip install data-prep-toolkit-transforms[all] or python -m pip install data-prep-toolkit-transforms[ray, all]

installing the python transforms will also install data-prep-toolkit

installing the ray transforms will also install data-prep-toolkit[ray]

List of Transforms in current package

Note: This list includes the transforms that were part of the release starting with data-prep-toolkit-transforms:0.2.1. This list may not always reflect up to date information. Users are encourage to raise an issue in git when they discover missing components or packages that are listed below but not in the current release they get from pypi.

Release notes:

1.0.0.a5

Added Pii Redactor

1.0.0.a4

Added missing ray implementation for lang_id, doc_quality, tokenization and filter
Added ray notebooks for lang id, Doc Quality, tokenization, and Filter

1.0.0.a3

Added code_profiler

1.0.0.a2

Relax dependencies on pandas (use latest or whatever is installed by application) Relax dependencies on requests (use latest or whatever is installed by application)

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

data_prep_toolkit_transforms-1.0.0a5-py3-none-any.whl (27.5 MB view details)

Uploaded Python 3

File details

Details for the file data_prep_toolkit_transforms-1.0.0a5-py3-none-any.whl.

File metadata

File hashes

Hashes for data_prep_toolkit_transforms-1.0.0a5-py3-none-any.whl
Algorithm Hash digest
SHA256 67cfc4e0e27e90957b3d62bc0f85e6688518be30eb43352547f3b4924040f45d
MD5 0b00ad30bf8d69ca39debcecd338eb29
BLAKE2b-256 c3951012b8202216e99a9b7b3b4255a27dc7fda93abee79dc9d9c76815a7fb5d

See more details on using hashes here.

File details

Details for the file data_prep_toolkit_transforms-1.0.0a5-1-py3-none-any.whl.

File metadata

File hashes

Hashes for data_prep_toolkit_transforms-1.0.0a5-1-py3-none-any.whl
Algorithm Hash digest
SHA256 2c0cec116d6dda095d5bcbaf040bc8717a46a1e6403805d7ae4b168c39e90d61
MD5 a5fda8efb2a57662c78e1b0d659f02a6
BLAKE2b-256 be4576aac4f7a98fb3ae973a4958a41701c2d0b6addbfc8006f650b6aab30726

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page