Skip to main content

Python bindings for rust-data-processing: schema-first CSV/JSON/Parquet/Excel ingestion into an in-memory DataSet.

Project description

rust-data-processing

Phase 1 scope: sources → rust-data-processing → Python / optional AI & ML surfaces

Python bindings for the rust-data-processing crate: schema-first ingestion from CSV, JSON, Parquet, and Excel into an in-memory DataSet, with profiling, validation, Polars-backed pipelines, and SQL.

Infographic: Phase 1 — single-node, library-first flow (ingest → DataSet, pipelines, SQL, profile, validate, outliers, transforms, parallel execution, PyO3 bindings, optional chatbot / notebook story).

This page is the PyPI project description (Python-only). Clone the repository for developer setup, Rust sources, and the full monorepo README.

Install

pip install rust-data-processing

Requires Python 3.10+.

Quick start

import rust_data_processing as rdp

schema = [
    {"name": "id", "data_type": "int64"},
    {"name": "name", "data_type": "utf8"},
]
ds = rdp.ingest_from_path("path/to/data.csv", schema, {"format": "csv"})
print("rows", ds.row_count())

report = rdp.profile_dataset(ds, {"head_rows": 50, "quantiles": [0.5]})
print("profile rows sampled", report["row_count"])

validation = rdp.validate_dataset(
    ds,
    {"checks": [{"kind": "not_null", "column": "id", "severity": "error"}]},
)
print("checks", validation["summary"]["total_checks"])

Phase 2 (export, privacy, JSONL, median, Delta handoff)

Copy-paste snippets: Phase 2 Python examples (Markdown in repo). These APIs are also summarized in API.md (section Export, privacy summaries, truncation (Phase 2)).

Documentation

Link
Python examples (HTML, pdoc) GitHub Pages — examples
Python API (HTML, pdoc) GitHub Pages — Python
Python API (markdown) API.md in the repository
Combined site (landing + Rust rustdoc) GitHub Pages — home
Rust crate API docs.rs/rust-data-processing
Repository github.com/vihangdesai2018-png/rust-data-processing

License

MIT OR Apache-2.0 - see LICENSE-MIT and LICENSE-APACHE in the repository.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rust_data_processing-0.2.0.tar.gz (4.0 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

rust_data_processing-0.2.0-cp314-cp314-win_amd64.whl (29.0 MB view details)

Uploaded CPython 3.14Windows x86-64

rust_data_processing-0.2.0-cp314-cp314-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (31.7 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.17+ x86-64

rust_data_processing-0.2.0-cp314-cp314-macosx_11_0_arm64.whl (28.9 MB view details)

Uploaded CPython 3.14macOS 11.0+ ARM64

rust_data_processing-0.2.0-cp313-cp313-win_amd64.whl (29.0 MB view details)

Uploaded CPython 3.13Windows x86-64

rust_data_processing-0.2.0-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (31.7 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.17+ x86-64

rust_data_processing-0.2.0-cp313-cp313-macosx_11_0_arm64.whl (28.9 MB view details)

Uploaded CPython 3.13macOS 11.0+ ARM64

rust_data_processing-0.2.0-cp312-cp312-win_amd64.whl (29.0 MB view details)

Uploaded CPython 3.12Windows x86-64

rust_data_processing-0.2.0-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (31.7 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ x86-64

rust_data_processing-0.2.0-cp312-cp312-macosx_11_0_arm64.whl (28.9 MB view details)

Uploaded CPython 3.12macOS 11.0+ ARM64

rust_data_processing-0.2.0-cp311-cp311-win_amd64.whl (29.0 MB view details)

Uploaded CPython 3.11Windows x86-64

rust_data_processing-0.2.0-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (31.7 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64

rust_data_processing-0.2.0-cp311-cp311-macosx_11_0_arm64.whl (28.9 MB view details)

Uploaded CPython 3.11macOS 11.0+ ARM64

rust_data_processing-0.2.0-cp310-cp310-win_amd64.whl (29.0 MB view details)

Uploaded CPython 3.10Windows x86-64

rust_data_processing-0.2.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (31.7 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.17+ x86-64

File details

Details for the file rust_data_processing-0.2.0.tar.gz.

File metadata

  • Download URL: rust_data_processing-0.2.0.tar.gz
  • Upload date:
  • Size: 4.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for rust_data_processing-0.2.0.tar.gz
Algorithm Hash digest
SHA256 2f27f6292ed4a610dbef78b1347f26d2e9c5a274ede138d166af311a220b6649
MD5 aa6b0dda13b4c3f34b45d0b5b7d9a8b1
BLAKE2b-256 da90fd45ea5049544663b77f3d83288547517a80639afa41188cd4610596bba7

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.0-pp311-pypy311_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.0-pp311-pypy311_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 ce5177c13e259a47471e8f16d49c28a4edc99bbe6dad85db3d2b30c2e277e424
MD5 8e486dbe82e8254a073a6b7a6b694ca4
BLAKE2b-256 233c35fa86090fdb474d1bbd83835e18fc383632162eb0a6e54168b758e8a7f4

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.0-cp314-cp314-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.0-cp314-cp314-win_amd64.whl
Algorithm Hash digest
SHA256 51926c1daf1e35da2e54fcb1488181bc6401913e16315186ad427560aa16917e
MD5 b4692c6aa550a6fde28286dba0858cb1
BLAKE2b-256 c7d8fab3388173925fb9ffa52199f1a3bf7ca00037930323103dc669f2ca15fb

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.0-cp314-cp314-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.0-cp314-cp314-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 297b0d4d2f89debd89d8d4e4acdee5eb2e32c4a312aa9e14711123d54a48b5d6
MD5 b39752af9b1a8bac1d6143bd9e52cc2c
BLAKE2b-256 a8c97244bdd6d95903866b315c419a912869f7dd871a263e573d40d158e37b26

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.0-cp314-cp314-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.0-cp314-cp314-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 8fcf10e839a1124c10feccff6686964c1d117c97e552965cac078989fb6d8508
MD5 ef1f0000187ff4219cf6c809b5b1de2c
BLAKE2b-256 7cc8183f9b3339ab2a900862510d3dd0f5e8bb2de8fdd4d48737b8d402e5cba6

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.0-cp313-cp313-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.0-cp313-cp313-win_amd64.whl
Algorithm Hash digest
SHA256 605d25377c1ed31a5c080c3594edc951670fd47ed4452367f3fe6f11c75aa320
MD5 19a99150d83d7fc61e3ed24612c4cb08
BLAKE2b-256 5da7fec9293b6cc4a65422bea53ca1060da00fd506ccd78508d4d611de8a22d8

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.0-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.0-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 034e4ade9a9171b0e1a51736e0bee0bfcc793b9372261bec7d92429f8aecb9b4
MD5 902af290b0d6bb1fa79f9f2c8abd73f2
BLAKE2b-256 76b9ebcde3c949067ee14240b93739c976e52aad08c42e044e46954339ad5605

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.0-cp313-cp313-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.0-cp313-cp313-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 42dcf657990669c87873d780083d790f1a61267fea7a03a2a588d1829e3c5ed3
MD5 fac52f933ebe0e1e11aa5229f69d00cf
BLAKE2b-256 ed7186c73d54425ae17778da42f6fd75612c34864bffc4cb8cbe4869224760fb

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.0-cp312-cp312-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.0-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 47650a2db761b1a678b528a63fadb87483f26e5fae2ef53cbab51b17634222cf
MD5 ce5799103f7e96ba816e0289379603b3
BLAKE2b-256 d53f8d1cb5eedff59aa1dff8ae5bf6c75bca230049ff6b38dcc530fecd402308

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.0-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.0-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 6bdebba403b8aaa03004d5eebfc1bc05f4cb122ad6dcff3f3d88cebeb76e1697
MD5 8913bfdd1760e6836925bb53d6d597df
BLAKE2b-256 14208d9422bf72f04ea2b88921c37f526f2aef0be3a35b38710bcbdf48d58b9d

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.0-cp312-cp312-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.0-cp312-cp312-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 03c0e9991b8048fd1fb9180e739c1756d76cf1dedb434641cdc11434d95284a1
MD5 c9527961a66652b6d60142796a17788a
BLAKE2b-256 bc740d5208b745d39bea92148322783b13f92ea7ec555bd31cd0b90deb4811d7

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.0-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.0-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 b6ef713ca2f0b15e39d23a877e6dc4198a16541520b044d1f92e1f59803f4132
MD5 2b2d7d74cd8d0ce15c6d98a09cb1553e
BLAKE2b-256 32893b8aa67345737f51071d6276cca9089ac9e88e86eda55820b82e9b2b64f3

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.0-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.0-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 6a9e48e052869a7561834048a7cb94762c25525e0a6e4bc3ff547d9436135958
MD5 96c30b3ec5c8b522ace5b507fb69f887
BLAKE2b-256 287f6c0ce5213dd34f4af40c914533c716d7c875539705eac8cd65ef6138b7e9

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.0-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.0-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 fc3505229b081b4bbc969d3d12a0e7d69d27d6cee6ca60f9a09a691791aa2fe6
MD5 0d889cd91d505f72b85ab25d4e3b90d7
BLAKE2b-256 1e5c37b639d66195e89f25b2ba3e0e75357f51ac90797437e0ef4cda8965693f

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.0-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.0-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 3d6fb79ebb9dcd58410632d55acdfe6f9be59d722fccdbba82dfb836159a4b4b
MD5 5d6507fa60af6223ca2931ebfccd8f72
BLAKE2b-256 547b244edd1fa19f8e3b9b49cdc946e227c074a54c85514fcf61aaca9f9cd23a

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.2.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.2.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 bcf2810c7a6302397d615bb3edc1ebd1fbc7186c514593f6daf029d88ec044d7
MD5 cf192f46f495554ca1845d6c10ed3b99
BLAKE2b-256 c6170689c7e099cfd3b1b1229bbce4c166d3e82746672b23e48be0bdb86cd382

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page