Skip to main content

Python bindings for rust-data-processing: schema-first CSV/JSON/Parquet/Excel ingestion into an in-memory DataSet.

Project description

rust-data-processing

Phase 2 scope: Phase 1 baseline plus export, privacy, Arrow, incremental ETL → Python; JVM planned

Python bindings for the rust-data-processing crate: schema-first ingestion from CSV, JSON, Parquet, and Excel into an in-memory DataSet, with profiling, validation, Polars-backed pipelines, SQL, and Phase 2 JSONL export, privacy transforms and summaries, median, Arrow interop, and incremental ingest helpers.

Infographic: Phase 2 — Phase 1 flow plus export, privacy, median, Arrow, incremental ETL; JVM planned Phase 3.

This page is the PyPI project description (Python-only). Clone the repository for developer setup, Rust sources, and the full monorepo README.

Install

pip install rust-data-processing

Requires Python 3.10+.

Quick start

import rust_data_processing as rdp

schema = [
    {"name": "id", "data_type": "int64"},
    {"name": "name", "data_type": "utf8"},
]
ds = rdp.ingest_from_path("path/to/data.csv", schema, {"format": "csv"})
print("rows", ds.row_count())

report = rdp.profile_dataset(ds, {"head_rows": 50, "quantiles": [0.5]})
print("profile rows sampled", report["row_count"])

validation = rdp.validate_dataset(
    ds,
    {"checks": [{"kind": "not_null", "column": "id", "severity": "error"}]},
)
print("checks", validation["summary"]["total_checks"])

Phase 2 (export, privacy, JSONL, median, Delta handoff)

Copy-paste snippets: Phase 2 Python examples (Markdown in repo). These APIs are also summarized in API.md (section Export, privacy summaries, truncation (Phase 2)).

Documentation

Link
This package on PyPI pypi.org/project/rust-data-processing
Python examples (HTML, pdoc) GitHub Pages — examples
Python API (HTML, pdoc) GitHub Pages — Python
Python API (markdown) API.md in the repository
Combined site (landing + Rust rustdoc) GitHub Pages — home
Rust crate API docs.rs/rust-data-processing
Repository github.com/vihangdesai2018-png/rust-data-processing

License

MIT OR Apache-2.0 - see LICENSE-MIT and LICENSE-APACHE in the repository.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rust_data_processing-0.2.2.tar.gz (4.0 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

rust_data_processing-0.2.2-cp314-cp314-win_amd64.whl (29.0 MB view details)

Uploaded CPython 3.14Windows x86-64

rust_data_processing-0.2.2-cp314-cp314-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (31.7 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.17+ x86-64

rust_data_processing-0.2.2-cp314-cp314-macosx_11_0_arm64.whl (28.9 MB view details)

Uploaded CPython 3.14macOS 11.0+ ARM64

rust_data_processing-0.2.2-cp313-cp313-win_amd64.whl (29.0 MB view details)

Uploaded CPython 3.13Windows x86-64

rust_data_processing-0.2.2-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (31.7 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.17+ x86-64

rust_data_processing-0.2.2-cp313-cp313-macosx_11_0_arm64.whl (28.9 MB view details)

Uploaded CPython 3.13macOS 11.0+ ARM64

rust_data_processing-0.2.2-cp312-cp312-win_amd64.whl (29.0 MB view details)

Uploaded CPython 3.12Windows x86-64

rust_data_processing-0.2.2-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (31.7 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ x86-64

rust_data_processing-0.2.2-cp312-cp312-macosx_11_0_arm64.whl (28.9 MB view details)

Uploaded CPython 3.12macOS 11.0+ ARM64

rust_data_processing-0.2.2-cp311-cp311-win_amd64.whl (29.0 MB view details)

Uploaded CPython 3.11Windows x86-64

rust_data_processing-0.2.2-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (31.7 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64

rust_data_processing-0.2.2-cp311-cp311-macosx_11_0_arm64.whl (28.9 MB view details)

Uploaded CPython 3.11macOS 11.0+ ARM64

rust_data_processing-0.2.2-cp310-cp310-win_amd64.whl (29.0 MB view details)

Uploaded CPython 3.10Windows x86-64

rust_data_processing-0.2.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (31.7 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.17+ x86-64

File details

Details for the file rust_data_processing-0.2.2.tar.gz.

File metadata

  • Download URL: rust_data_processing-0.2.2.tar.gz
  • Upload date:
  • Size: 4.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for rust_data_processing-0.2.2.tar.gz
Algorithm Hash digest
SHA256 bb989ba6a2ee18a4223ded10dfb74c8d04c5135bfa0c66667174b6aac11be5fd
MD5 335c21e4ef5556c79d701e2269543181
BLAKE2b-256 89a48f953ad8b35d42061742fabb4bfb34e64fa82ffe71e893e926d21080055d

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.2-pp311-pypy311_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.2-pp311-pypy311_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 b2e2319cdcff30c9a77da12a50ba23f0eff232012d6e4abfa4e042e106bce5d9
MD5 a5ed62cccd08fd562221fa32d95a4333
BLAKE2b-256 b44bcacdbbea00a7d1c2fc03402f9341fa6a43ba699e0228ad0a18a2b5feacb9

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.2-cp314-cp314-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.2-cp314-cp314-win_amd64.whl
Algorithm Hash digest
SHA256 1477c87be260873939d8208f973d41f68845df530897f2f4c9c4f135a76045af
MD5 0f591b2b43daf15862d1b8f446e01934
BLAKE2b-256 e3cb95cee08a996e6b53c1c7db4560add28cbc0149e4fd29e17ecc59126f5885

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.2-cp314-cp314-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.2-cp314-cp314-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 a1a3e595adbe872e3db203f32e2a45e169afee6dc53aae20ff10cd3db4f458d5
MD5 8e2c6c8a1b28d33fde1c2a1824f19942
BLAKE2b-256 918e1325b1d5f80a8fe099d996caf4169dadfc14d8030fad3204e739d293073c

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.2-cp314-cp314-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.2-cp314-cp314-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 115248d8cea023e6aebabd20e2c2e122d542543f583dd6a5c4ddb78f170fbec5
MD5 98ebd865b335116a86b85b2e5e820219
BLAKE2b-256 d7fcfe4d7131f021d36da9060b4159fb5325328e31110759fee4dd71911d1da3

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.2-cp313-cp313-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.2-cp313-cp313-win_amd64.whl
Algorithm Hash digest
SHA256 86737cf544217309bec001268bce1fba48d3844b2abd79f15707e06c640effe8
MD5 1004cd9735ae582c8d34a502495a58c1
BLAKE2b-256 2f992e5270fc2f40119d986f3c7db467deaef519d66ac0b984b8d6c972d8a736

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.2-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.2-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 8ea678d4b223070cb9cb38ae833ed1962ab06b50590fb8f6c6c165bfc62a81ad
MD5 3ca8a2614c0cab87c0421d37b8658951
BLAKE2b-256 f224d5a0eecd3da34ab84d3cd6d62285c5425442f0f5349a9f4252e6bd342f33

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.2-cp313-cp313-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.2-cp313-cp313-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 3afad08bb48881142148af129aa80182db79e1b89fcda4a6e99f2e293080613c
MD5 89f5abfb894e9b9177c47ffb67d6fdd5
BLAKE2b-256 266b63eee8f76918430d74994828b0f2feedbb05b2e22343e2432d4e07b1326e

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.2-cp312-cp312-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.2-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 2f8da6310ade2ea25b779959ad8e8171c4f5de6d3b5c0dcab128a4edd3865aa8
MD5 962c97060db016ca9449f39a37ff0a1f
BLAKE2b-256 3311962f0d4b6348526ba9b80f567a34fbadf9afa2b192be2dba2e20a16ad402

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.2-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.2-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 591bf62a406d330b6db93a1ab4905a440379428811549646fdccfae3c76e78e2
MD5 8af2402efffb4bbbdfb7a8784b52f7ac
BLAKE2b-256 e75e7b5d91d2bc8e8d7d44b24b2a83d7c78b8b2c63f6d3d32825a036b5130b54

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.2-cp312-cp312-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.2-cp312-cp312-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 65bb8d2b2804196f3527d3423cce39438b6bb65901eb15e3daa86a7d229aea58
MD5 5e39933b90cced724a28e6a83f2fb460
BLAKE2b-256 118dfe689dd735539cec2cba06921de26abff74cd7e0e6c9bf40061d93a185d4

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.2-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.2-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 a21b825fc44a2ba28289adc5d7286dff134e199317ef0761088313ade601ff65
MD5 935bf1e556d633b4870fae6d93737b7e
BLAKE2b-256 3f73dcd93f94a80cf5db04dfd790f630d93f0fd1b91efc3c2dd0888520cfcd90

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.2-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.2-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 75d566f4eaf387de9e8fdf1251df1cd4f5537b430c5db2967bbe255886c1e004
MD5 d5f57e56e788f5512469158474a131f7
BLAKE2b-256 af2eb3e6174f0c003885390db63eeeeb39467c987e4cc2fa33b0299ab1ea5a9a

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.2-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.2-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 0ff3558908dc5936dba9b6b138afdd63afc21e44f35ca1ca47e722a931a21476
MD5 8b8f6605ee13355c02bed300ad25d10e
BLAKE2b-256 d7c29413be6a82f76417d279b2e2480b8de3d6ef23ed23b432b5bbdb7406028f

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.2-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.2-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 7ff2e49dd155cd40b076ce71bcf28cbb643b3977af2685e9cfb6054f73c4a641
MD5 5d9659604b198369782646985e98a9d1
BLAKE2b-256 be1793136d3aef55c50942cdcb97beab0fdd65b78f48665e3e02f7f467653e44

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 9e8aaf685a6a78483dbba56d91624ea5aced4236aad61af644a5569039bf828e
MD5 7caba6d499a2614c4166a57d28ab9566
BLAKE2b-256 7f0ef65a51fb77f03b2d24293ca37bb9c641e3e0ac4729258c4805e03921389d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page