Skip to main content

Python bindings for rust-data-processing: schema-first CSV/JSON/Parquet/Excel ingestion into an in-memory DataSet.

Project description

rust-data-processing

Phase 2 scope: Phase 1 baseline plus export, privacy, Arrow, incremental ETL → Python; JVM planned

Python bindings for the rust-data-processing crate: schema-first ingestion from CSV, JSON, Parquet, and Excel into an in-memory DataSet, with profiling, validation, Polars-backed pipelines, SQL, and Phase 2 JSONL export, privacy transforms and summaries, median, Arrow interop, and incremental ingest helpers.

Infographic: Phase 2 — Phase 1 flow plus export, privacy, median, Arrow, incremental ETL; JVM planned Phase 3.

This page is the PyPI project description (Python-only). Clone the repository for developer setup, Rust sources, and the full monorepo README.

Install

pip install rust-data-processing

Requires Python 3.10+.

Quick start

import rust_data_processing as rdp

schema = [
    {"name": "id", "data_type": "int64"},
    {"name": "name", "data_type": "utf8"},
]
ds = rdp.ingest_from_path("path/to/data.csv", schema, {"format": "csv"})
print("rows", ds.row_count())

report = rdp.profile_dataset(ds, {"head_rows": 50, "quantiles": [0.5]})
print("profile rows sampled", report["row_count"])

validation = rdp.validate_dataset(
    ds,
    {"checks": [{"kind": "not_null", "column": "id", "severity": "error"}]},
)
print("checks", validation["summary"]["total_checks"])

Phase 2 (export, privacy, JSONL, median, Delta handoff)

Copy-paste snippets: Phase 2 Python examples (Markdown in repo). These APIs are also summarized in API.md (section Export, privacy summaries, truncation (Phase 2)).

Documentation

Link
This package on PyPI pypi.org/project/rust-data-processing
Python examples (HTML, pdoc) GitHub Pages — examples
Python API (HTML, pdoc) GitHub Pages — Python
Python API (markdown) API.md in the repository
Combined site (landing + Rust rustdoc) GitHub Pages — home
Rust crate API docs.rs/rust-data-processing
Repository github.com/rust-data-processing/rust-data-processing

License

MIT OR Apache-2.0 - see LICENSE-MIT and LICENSE-APACHE in the repository.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rust_data_processing-0.3.0.tar.gz (4.0 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

rust_data_processing-0.3.0-cp314-cp314-win_amd64.whl (29.5 MB view details)

Uploaded CPython 3.14Windows x86-64

rust_data_processing-0.3.0-cp314-cp314-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (32.2 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.17+ x86-64

rust_data_processing-0.3.0-cp314-cp314-macosx_11_0_arm64.whl (29.3 MB view details)

Uploaded CPython 3.14macOS 11.0+ ARM64

rust_data_processing-0.3.0-cp313-cp313-win_amd64.whl (29.5 MB view details)

Uploaded CPython 3.13Windows x86-64

rust_data_processing-0.3.0-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (32.2 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.17+ x86-64

rust_data_processing-0.3.0-cp313-cp313-macosx_11_0_arm64.whl (29.3 MB view details)

Uploaded CPython 3.13macOS 11.0+ ARM64

rust_data_processing-0.3.0-cp312-cp312-win_amd64.whl (29.5 MB view details)

Uploaded CPython 3.12Windows x86-64

rust_data_processing-0.3.0-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (32.2 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ x86-64

rust_data_processing-0.3.0-cp312-cp312-macosx_11_0_arm64.whl (29.3 MB view details)

Uploaded CPython 3.12macOS 11.0+ ARM64

rust_data_processing-0.3.0-cp311-cp311-win_amd64.whl (29.5 MB view details)

Uploaded CPython 3.11Windows x86-64

rust_data_processing-0.3.0-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (32.2 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64

rust_data_processing-0.3.0-cp311-cp311-macosx_11_0_arm64.whl (29.3 MB view details)

Uploaded CPython 3.11macOS 11.0+ ARM64

rust_data_processing-0.3.0-cp310-cp310-win_amd64.whl (29.5 MB view details)

Uploaded CPython 3.10Windows x86-64

rust_data_processing-0.3.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (32.2 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.17+ x86-64

File details

Details for the file rust_data_processing-0.3.0.tar.gz.

File metadata

  • Download URL: rust_data_processing-0.3.0.tar.gz
  • Upload date:
  • Size: 4.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for rust_data_processing-0.3.0.tar.gz
Algorithm Hash digest
SHA256 dfbfed15b03b6287caefe7ef3fa9529bc719dfa82d1a2007daae57ec6f899ae7
MD5 a8002e0b34622fb31c7e757428b3fb13
BLAKE2b-256 f76bbe9fa82836baaf7f1c70e6ca6a22a0f29e340dbf19a0d692b83972529e38

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.0-pp311-pypy311_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.0-pp311-pypy311_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 e954c2ade0452f47560e897e96da8059111ae0663b18a8e50cba973743df9e88
MD5 7cc96b8c0d5893b32bfd87deff36880b
BLAKE2b-256 9c713d9486ecd4bc544ac09760e6286bb518d2bb098f7ffc1d228757482155f7

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.0-cp314-cp314-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.0-cp314-cp314-win_amd64.whl
Algorithm Hash digest
SHA256 0fd57f4b889b59316bd5de66788c947bcdabc612fc0397c1a2dd7d46f582b0e9
MD5 41229fde85126124978be9a86ef5c224
BLAKE2b-256 fa61b8e0dcf6ded1763b0aa152d654a124c0c3cb6cc73888389b27dd881f39dc

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.0-cp314-cp314-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.0-cp314-cp314-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 85834d3b4794c704d76cf5dd7e8af9793d902d45986769aa8f3b038c1853b558
MD5 3fce7fcffcf5dd3b42a0097fc2978f90
BLAKE2b-256 534b4d8681d2d40c4c2591785cefb3f61962cf28a98fb6f81914618fdc19582e

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.0-cp314-cp314-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.0-cp314-cp314-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 a86506f94e672cac38c958094cd212c05d249a0fc861e147083d9eabf13c750e
MD5 6d9739e0c6e1764303aef6e40ab576e7
BLAKE2b-256 1875ae6a44c18260dfd824acdd0d1293a1cdd9fcf1f5bf6a9ec47df117c31717

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.0-cp313-cp313-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.0-cp313-cp313-win_amd64.whl
Algorithm Hash digest
SHA256 a59be06175253fb1223b52dfb2499a19cb82aa346a26ed3b33e9e20a18bca22f
MD5 35b26f6d3a6a1e9ba92939ae66ed2ade
BLAKE2b-256 184f42831eee68a0de2ccc92c7bf6890d72a99a910e3459f27860af28438e0be

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.0-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.0-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 7ed2e9df09ce7c0681723ffbce313c54e91d8fbf71c0732a06ea48920bbebf7f
MD5 b627b422d670ebea88ef7899091de4dd
BLAKE2b-256 af9ec258b1440492bd998a2591da737c4814c86564e966079630ab90e60ee7a5

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.0-cp313-cp313-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.0-cp313-cp313-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 c550066f3830a2b4a1814196270ff075d622ce40b6d8a9ef1f6ccdd1e34a912f
MD5 cf130896eef5665d1dd0b9e0e0cd6255
BLAKE2b-256 3a1afab58d394fb6e39564066cc7603f3f1fd1a4c40a42e3cb40e769da46a928

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.0-cp312-cp312-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.0-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 e7ba0128bae64a58933855d9dee1fc52797f6f7bc05a5e08e73445e079563fd0
MD5 501db53833b085aaedb76d11ac2637f0
BLAKE2b-256 e3766e63d20fb0dae7cdb83b3c311fa2a31d82dffd9889aacd70c4ed0862f80c

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.0-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.0-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 67612a7dfcea4db44f8f6ab7427dd44fe43403a6ec0490c19dccf2055528d179
MD5 bed321a94d63662e4d31136af2a274ca
BLAKE2b-256 2194413ff16b91c033e4bb529e1d80713cff872fb0e15af76650036a417a2741

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.0-cp312-cp312-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.0-cp312-cp312-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 b47ddae62665509e20b38c935bcc971866d69f8d74a2fad1eebd5ee0c9d12572
MD5 3c0a147efb2c6ce2e74db8c9379af347
BLAKE2b-256 d59d6a3f0bef29f1e75d64fa0c3475c3f3938b3813e0d72b0b79df9609f019d4

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.0-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.0-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 c85553795a22b6b2c6d309263c370bbdbf9a3990b95f613b9314e1e53d470356
MD5 a2cc918d4a01e811364bc99a648404a4
BLAKE2b-256 802f8cba7672139344b955331eb015614d93a16987ac52ae32ca0edfdd6d0ccc

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.0-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.0-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 0cdb95e645b64896855088cb4d4a216bdba16a2da20da91ae74d381e9aed36a6
MD5 bfc4141cafd9d6413284d65ec8813921
BLAKE2b-256 91fb18c8b6c887264cd7125095c666e6f8f9e76a8a9f33ad658294148e16b1c6

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.0-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.0-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 fd63379e64c02d2b9db4865248f81bba451295591ef9e666f02e3ad1a18c5571
MD5 affa19431f1cc872f59bcdaf8955c54b
BLAKE2b-256 b6e5cec6163a687a616eb665d9ef86dd45dc1ccc7dc03208eb845699a9647bfa

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.0-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.0-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 4df785a61e102fd9a93078475c6cb50fb74827aa19a971d826524a6e3c364c64
MD5 b6e90bc786891f80db2b4194a932cf8e
BLAKE2b-256 5c03cdd0b1bce66e0c09d51ba9bd6fcaba10009b5dcbb0f06e100ee154e3e924

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 364787f04404ba8baa59836342481065bd33553df3dca2ca94bf6d5e3f5f9b02
MD5 8d0cd324ebe035c30b7d882812e233a3
BLAKE2b-256 aaf4ab6f0af513d7e1df5f68025f34efe3efc57a667f7fe2d6dfea915cc63d9f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page