Data Preparation Toolkit Transforms
Project description
DPK Python Transforms
installation
The transforms are delivered as a standard pyton library available on pypi and can be installed using pip install:
python -m pip install data-prep-toolkit-transforms
installing the python transforms will also install data-prep-toolkit
List of Transforms in current package
- code
- code2parquet
- header_cleanser (Not available on MacOS)
- code_quality
- proglang_select
- language
- doc_chunk
- *doc_quality
- lang_id
- pdf2parquet
- text_encoder
- universal
- ededup
- filter
- resize
- tokenization
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for data_prep_toolkit_transforms-0.2.1.dev1.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 412ba90d3f4e16fa2ed0cb5172096ffab008e8d2742271770762ed328b3fee3b |
|
MD5 | 2ea0d1bf53cb7757edd27adb93cf656c |
|
BLAKE2b-256 | 571b1254ad1177080ec46c655cb955d6673601aa4d544fab839ee03581ab4845 |
Close
Hashes for data_prep_toolkit_transforms-0.2.1.dev1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | da2637e315972f7e2db2ec9de1a25d7bcf48890f3eb29b8f1cbbd615115e0093 |
|
MD5 | 2ebe02b85cba38127378f69b18793a23 |
|
BLAKE2b-256 | d13f87889f3fcde6968d48942ab5e11b50c836e1ca3903d2a7cfb37b434eac22 |