Skip to main content

Python bindings for rust-data-processing: schema-first CSV/JSON/Parquet/Excel ingestion into an in-memory DataSet.

Project description

rust-data-processing

Phase 3 scope: Rust core with Python (PyO3) and Java (Panama) bindings, agent-ready JSON FFI, and shared batch/streaming connectors

Python bindings for the rust-data-processing crate: schema-first ingestion from CSV, JSON, Parquet, and Excel into an in-memory DataSet, with profiling, validation, Polars-backed pipelines, SQL, Phase 2 JSONL export, privacy transforms and summaries, median, Arrow interop, incremental ingest helpers, and Phase 3 parity with JVM JSON FFI (agent-ready structured in/out for LangGraph and tool-calling workflows).

Infographic: Phase 3 — one Rust engine; Python (PyO3 / PyPI) and Java (Panama / Maven + Gradle) bindings; Phase 1–2 capabilities; agent-ready JSON in/out; shared connectors (Postgres, S3, Kafka, Snowflake).

This page is the PyPI project description (Python-only). Clone the repository for developer setup, Rust sources, and the full monorepo README.

Install

pip install rust-data-processing

Requires Python 3.10+.

Quick start

import rust_data_processing as rdp

schema = [
    {"name": "id", "data_type": "int64"},
    {"name": "name", "data_type": "utf8"},
]
ds = rdp.ingest_from_path("path/to/data.csv", schema, {"format": "csv"})
print("rows", ds.row_count())

report = rdp.profile_dataset(ds, {"head_rows": 50, "quantiles": [0.5]})
print("profile rows sampled", report["row_count"])

validation = rdp.validate_dataset(
    ds,
    {"checks": [{"kind": "not_null", "column": "id", "severity": "error"}]},
)
print("checks", validation["summary"]["total_checks"])

Phase 2 (export, privacy, JSONL, median, Delta handoff)

Copy-paste snippets: Phase 2 Python examples (Markdown in repo). These APIs are also summarized in API.md (section Export, privacy summaries, truncation (Phase 2)).

Documentation

Link
This package on PyPI pypi.org/project/rust-data-processing
Python examples (HTML, pdoc) GitHub Pages — examples
Python API (HTML, pdoc) GitHub Pages — Python
Python API (markdown) API.md in the repository
Combined site (landing + Rust rustdoc) GitHub Pages — home
Rust crate API docs.rs/rust-data-processing
JVM bindings (Java) docs/java/README.md
Repository github.com/scorpio-datalake/rust-data-processing

License

MIT OR Apache-2.0 - see LICENSE-MIT and LICENSE-APACHE in the repository.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rust_data_processing-0.3.6.tar.gz (5.9 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

rust_data_processing-0.3.6-cp315-cp315t-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (32.8 MB view details)

Uploaded CPython 3.15tmanylinux: glibc 2.17+ x86-64

rust_data_processing-0.3.6-cp315-cp315-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (32.8 MB view details)

Uploaded CPython 3.15manylinux: glibc 2.17+ x86-64

rust_data_processing-0.3.6-cp314-cp314t-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (32.8 MB view details)

Uploaded CPython 3.14tmanylinux: glibc 2.17+ x86-64

rust_data_processing-0.3.6-cp314-cp314-win_amd64.whl (29.5 MB view details)

Uploaded CPython 3.14Windows x86-64

rust_data_processing-0.3.6-cp314-cp314-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (32.8 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.17+ x86-64

rust_data_processing-0.3.6-cp314-cp314-macosx_11_0_arm64.whl (29.3 MB view details)

Uploaded CPython 3.14macOS 11.0+ ARM64

rust_data_processing-0.3.6-cp313-cp313-win_amd64.whl (29.5 MB view details)

Uploaded CPython 3.13Windows x86-64

rust_data_processing-0.3.6-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (32.8 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.17+ x86-64

rust_data_processing-0.3.6-cp313-cp313-macosx_11_0_arm64.whl (29.3 MB view details)

Uploaded CPython 3.13macOS 11.0+ ARM64

rust_data_processing-0.3.6-cp312-cp312-win_amd64.whl (29.5 MB view details)

Uploaded CPython 3.12Windows x86-64

rust_data_processing-0.3.6-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (32.8 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ x86-64

rust_data_processing-0.3.6-cp312-cp312-macosx_11_0_arm64.whl (29.3 MB view details)

Uploaded CPython 3.12macOS 11.0+ ARM64

rust_data_processing-0.3.6-cp311-cp311-win_amd64.whl (29.5 MB view details)

Uploaded CPython 3.11Windows x86-64

rust_data_processing-0.3.6-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (32.8 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64

rust_data_processing-0.3.6-cp311-cp311-macosx_11_0_arm64.whl (29.3 MB view details)

Uploaded CPython 3.11macOS 11.0+ ARM64

rust_data_processing-0.3.6-cp310-cp310-win_amd64.whl (29.5 MB view details)

Uploaded CPython 3.10Windows x86-64

rust_data_processing-0.3.6-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (32.9 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.17+ x86-64

File details

Details for the file rust_data_processing-0.3.6.tar.gz.

File metadata

  • Download URL: rust_data_processing-0.3.6.tar.gz
  • Upload date:
  • Size: 5.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for rust_data_processing-0.3.6.tar.gz
Algorithm Hash digest
SHA256 88fd85d1f45644639feb5e6cb6449c94526b6e45da87adfb2bba2fce60b872a9
MD5 f975e32158a4cb37e7c3a345a76e5387
BLAKE2b-256 26246acb7e32ed7d4d4a869fe5f779fff0047b8c8aebc05447c8f8a0fd6aebf9

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.6-pp311-pypy311_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.6-pp311-pypy311_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 0ddd6b3a47c9148e22f26431356a8ad34a264eb2bf7d2ee467df1d80516105c4
MD5 81199d2c233d394cbe1bbe6653cc900f
BLAKE2b-256 0fca53801b29c0959457857bfaa1586226b3062876d08290f1e1212cf0b6fc53

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.6-cp315-cp315t-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.6-cp315-cp315t-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 5dd9e8eaea79acc4363fa8732177b8112bb6a16c9fa6a5163587f46fd3ff584f
MD5 81b339f8eabc1fc47c54070dd946ed08
BLAKE2b-256 83d48245938a1ca5075b43c6966bcdcaecf38d1008327476fe25b886013ac3ba

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.6-cp315-cp315-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.6-cp315-cp315-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 bbe7d7e9fb2c3cdd1590ee148c71e9e6b36962a4ffb04280eb50521b0b72d683
MD5 55b9628f739e47b3a5de8eab2a7b76fc
BLAKE2b-256 bc2532ce255fc6ebacb941d1924b27c32f707af800ff34241069705ce8915cbb

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.6-cp314-cp314t-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.6-cp314-cp314t-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 2ccea257aed7fa1e4d4ff86e8f7ccd8a8721e58690dec5cfbfb7254608b7968d
MD5 493f80fad8cbe5b08cecedcd40343605
BLAKE2b-256 0b56a806dc50a2f2ae448171967c49adbd73434d33455080099f7bec915d07ee

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.6-cp314-cp314-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.6-cp314-cp314-win_amd64.whl
Algorithm Hash digest
SHA256 2c85454d050632ad305d5ba2fa1c8e2db8cb659f6b8c3e88fb7c87af3f81abaa
MD5 22f7f32a9619a77dc269ea59e486074d
BLAKE2b-256 532d9586b64b53e309633ecd2f4a61d80612edef13eb5ac8bb9ca5a815499cca

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.6-cp314-cp314-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.6-cp314-cp314-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 a1441a14c2592f12edec21b9fda07a5f819dba9dc880b3ae8de9119eb61b5292
MD5 9f98320b8853a2fff4b067c8db4bc0c7
BLAKE2b-256 745bb63551d6158a57f2990bf44776d918f8aee59d3ac97285f9858f4afd763e

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.6-cp314-cp314-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.6-cp314-cp314-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 9879aa92c47f4f38396af0d1af30a98168932914a867919b3b89dc1cfba574fc
MD5 44678161eea1dfd2352f4e21a7b94bef
BLAKE2b-256 cc369bce759368c261d6c43d49750203ec98126f3e45c23908d7eb5e45457da9

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.6-cp313-cp313-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.6-cp313-cp313-win_amd64.whl
Algorithm Hash digest
SHA256 74d40f83c4bd11c534f5df7239c4920f964a2831b7016cad481b232180479d30
MD5 c98812a73355d1ba845c656c1c13efbd
BLAKE2b-256 12316bf8aec7a91c7deddfb1cf0812b61c36acce24f1bbe53d2f49f2f88f8936

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.6-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.6-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 b862464a1edc9f2151483779e1de9e27a5ebdbc3a54788dfa2b5c82543b384e2
MD5 b01a1d3822d53031263d447f245176e0
BLAKE2b-256 fb551f79dd41cda018a2c509eb3be80ae30c9a30e5d65baec2a41ab8047e5547

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.6-cp313-cp313-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.6-cp313-cp313-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 05f27911c20ddf141b1ccef0dfaad7cdad777d639dc214b455a334e4cf606759
MD5 d2dc60fe0d32dab0bf10bdcd37a86ca1
BLAKE2b-256 e60a7160c1a1bc010a12dcd4eb4a71fae92ed03ebbd003e65d6d6afc436c7ead

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.6-cp312-cp312-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.6-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 fac74105cce06a788c703d7e10516bc616c62a991c5421001e317ecf4d9e65ba
MD5 55ec33f1c69ab9f18209311fb1fe4609
BLAKE2b-256 c466eb27461703e094ae6341dfec35609af4fa2c8f36413c2fecc11982a5da45

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.6-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.6-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 f10f6e05f3029ec8fd5167a19ab78a50966305ac19aaad1d171bebf2c8adea42
MD5 9721a59b6247fe71da2f099031c7ec69
BLAKE2b-256 3f188a3e6e67e3d4005ae2b2d357c98b2f56a2a4fca4fd24668a81ec4ffa7b87

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.6-cp312-cp312-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.6-cp312-cp312-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 767daee3fe65b112f643e752c7c33e72596d043d482b7fa51c216c360b133a6a
MD5 0b4120dac995dfc9c13582b1c0a6815d
BLAKE2b-256 20ddd9201e0300addb6d857823bc1716d3240bba0e3ee0c8b35460da723df5ec

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.6-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.6-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 4fa272bb4d0a858190903f3838e066da0e62ab44b6e4ecbb4d36849338f9f520
MD5 4670f8066aeb19164d6d56b4cfbfa507
BLAKE2b-256 c62d4bdcd10de6216187557f47f3674fadb5b1e3fc2bf42381a4c8e926aac770

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.6-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.6-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 7b81cf2b8e43172c7b2523a281bc3bf09242286b352136d7fbbd874676cdb97c
MD5 8f09a7d78a5d0a5bc8cce1f969f1e4bb
BLAKE2b-256 591a6ae47f6779bd6802a390d84a07286b44e4813ddfc682a3f93f06fe5e301f

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.6-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.6-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 fdc4fc0bfb28ecb2b9a198c3ee1f142398342d0c731ce74ac05a129c9e3b71ea
MD5 ab9bf08beaea383a15ce991948b4c404
BLAKE2b-256 97b48e57818f2683a2cc3fcbf87827e12afcf4d77e43599f55ece840866fadad

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.6-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.6-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 deb9fe0ce4896e3e8006fba331e856650c20adb5d575f52ff9b76395311d8704
MD5 7be83d92cd9d491d480ddfd90829ca58
BLAKE2b-256 9723aceb8138dfcdf8eb1febdee56840caaafe962ddfd43f9a5ff670ddf18853

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.3.6-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.3.6-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 3609f9392bc9e35526abc1b98dad3aed4eedbeaaf44631f59b8d3760c89654de
MD5 afad1b60ee3cd98a51d4ef772854f7e8
BLAKE2b-256 fd830a112159dd5f6c8fb5f738e7cc18bc2a3d82e06a47957a920578d3df9604

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page