Skip to main content

Python bindings for rust-data-processing: schema-first CSV/JSON/Parquet/Excel ingestion into an in-memory DataSet.

Project description

rust-data-processing

Phase 1 scope: sources → rust-data-processing → Python / optional AI & ML surfaces

Python bindings for the rust-data-processing crate: schema-first ingestion from CSV, JSON, Parquet, and Excel into an in-memory DataSet, with profiling, validation, Polars-backed pipelines, and SQL.

Infographic: Phase 1 — single-node, library-first flow (ingest → DataSet, pipelines, SQL, profile, validate, outliers, transforms, parallel execution, PyO3 bindings, optional chatbot / notebook story).

This page is the PyPI project description (Python-only). Clone the repository for developer setup, Rust sources, and the full monorepo README.

Install

pip install rust-data-processing

Requires Python 3.10+.

Quick start

import rust_data_processing as rdp

schema = [
    {"name": "id", "data_type": "int64"},
    {"name": "name", "data_type": "utf8"},
]
ds = rdp.ingest_from_path("path/to/data.csv", schema, {"format": "csv"})
print("rows", ds.row_count())

report = rdp.profile_dataset(ds, {"head_rows": 50, "quantiles": [0.5]})
print("profile rows sampled", report["row_count"])

validation = rdp.validate_dataset(
    ds,
    {"checks": [{"kind": "not_null", "column": "id", "severity": "error"}]},
)
print("checks", validation["summary"]["total_checks"])

Documentation

Link
Python examples (HTML, pdoc) GitHub Pages — examples
Python API (HTML, pdoc) GitHub Pages — Python
Python API (markdown) API.md in the repository
Combined site (landing + Rust rustdoc) GitHub Pages — home
Rust crate API docs.rs/rust-data-processing
Repository github.com/vihangdesai2018-png/rust-data-processing

License

MIT OR Apache-2.0 - see LICENSE-MIT and LICENSE-APACHE in the repository.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rust_data_processing-0.1.8.tar.gz (1.7 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

rust_data_processing-0.1.8-cp314-cp314-win_amd64.whl (28.9 MB view details)

Uploaded CPython 3.14Windows x86-64

rust_data_processing-0.1.8-cp314-cp314-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (31.7 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.17+ x86-64

rust_data_processing-0.1.8-cp314-cp314-macosx_11_0_arm64.whl (28.8 MB view details)

Uploaded CPython 3.14macOS 11.0+ ARM64

rust_data_processing-0.1.8-cp313-cp313-win_amd64.whl (28.9 MB view details)

Uploaded CPython 3.13Windows x86-64

rust_data_processing-0.1.8-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (31.6 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.17+ x86-64

rust_data_processing-0.1.8-cp313-cp313-macosx_11_0_arm64.whl (28.8 MB view details)

Uploaded CPython 3.13macOS 11.0+ ARM64

rust_data_processing-0.1.8-cp312-cp312-win_amd64.whl (28.9 MB view details)

Uploaded CPython 3.12Windows x86-64

rust_data_processing-0.1.8-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (31.6 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.17+ x86-64

rust_data_processing-0.1.8-cp312-cp312-macosx_11_0_arm64.whl (28.8 MB view details)

Uploaded CPython 3.12macOS 11.0+ ARM64

rust_data_processing-0.1.8-cp311-cp311-win_amd64.whl (28.9 MB view details)

Uploaded CPython 3.11Windows x86-64

rust_data_processing-0.1.8-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (31.7 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.17+ x86-64

rust_data_processing-0.1.8-cp311-cp311-macosx_11_0_arm64.whl (28.8 MB view details)

Uploaded CPython 3.11macOS 11.0+ ARM64

rust_data_processing-0.1.8-cp310-cp310-win_amd64.whl (28.9 MB view details)

Uploaded CPython 3.10Windows x86-64

rust_data_processing-0.1.8-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (31.7 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.17+ x86-64

File details

Details for the file rust_data_processing-0.1.8.tar.gz.

File metadata

  • Download URL: rust_data_processing-0.1.8.tar.gz
  • Upload date:
  • Size: 1.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for rust_data_processing-0.1.8.tar.gz
Algorithm Hash digest
SHA256 d4d1a7788ea299040b77e0f6bca317797f2e1dcc2a59fc48ce591b9f979ca1ed
MD5 38a9096683ea2f891a4b6438ad4b220a
BLAKE2b-256 31a06de5e21570700e4feb057e53b9a95e679f5f45b82c2a9fc74e6780843b51

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.1.8-pp311-pypy311_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.1.8-pp311-pypy311_pp73-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 660ccc7d925dc1238be0160f32a385dec1e46a36c53f7c7c3745bd8fa2734b2d
MD5 bcbd1cedd462fb4a9e96f8d1a45504c5
BLAKE2b-256 269d0c7365c0378138e17e329ee4eb23fe1709f77763939cd1b183569cd0875b

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.1.8-cp314-cp314-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.1.8-cp314-cp314-win_amd64.whl
Algorithm Hash digest
SHA256 8ee41ab53ce474966ad2681d91e3b57bf60d5a2f4637dc9797a69b241d650a3f
MD5 734ae4b7d884a2a97a99d757a3bd56c8
BLAKE2b-256 86ecefe0b4e6184c67668a6c46e0455e84fed0f3131585b27914a646b358958e

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.1.8-cp314-cp314-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.1.8-cp314-cp314-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 78c16f299f73eb08cc6b74af4007da0c182dec5b849f094e75163cecc418bbb2
MD5 e8efbc9bcc3a36c7990fde7391ec469d
BLAKE2b-256 29ad0cec182ebba7a77400a8a6551fc0211bd08e6b4ebd366d27d2156aee703d

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.1.8-cp314-cp314-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.1.8-cp314-cp314-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 67a0737b97ea2aa80df18f61ae135c315f45266a7b726b35ffff91169b09a156
MD5 afd4bce74294aa4ca825f61aeb810cb2
BLAKE2b-256 a3e2dc778cb1829325888ddf2ca62ccbc271b5cab5bcade7eb02ea44267115e2

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.1.8-cp313-cp313-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.1.8-cp313-cp313-win_amd64.whl
Algorithm Hash digest
SHA256 2994d70f399a8950ccb74f21ee47607199dd5c3e69d63dd8d2b221c9a0eebdc2
MD5 77fe257617b7d7b7a22e9d7b25d4cdde
BLAKE2b-256 61801665480dec8b6e739c483e6aa3429439d0dd45170433e26939399149c1de

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.1.8-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.1.8-cp313-cp313-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 c9e43a33163f249a7d05188931a67f98977fee93945ba3d0a040e63ac753e051
MD5 fdf81e6911fcebc2df945e5175997881
BLAKE2b-256 661760a80bb910e4c7facf1898576eccc5178c75daaab24179da19e038f19d14

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.1.8-cp313-cp313-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.1.8-cp313-cp313-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 6b82b8f3ffb9a317ad946c5194d4c0e6ba30262ca19aa28dea9e21987a56480f
MD5 6376efe6517e77c34bb305a6e15cf608
BLAKE2b-256 74104ca1fcb8b30eb25daa3106f8abf921a6579d8767c8db06f253ced55cd3b2

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.1.8-cp312-cp312-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.1.8-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 95a5bf8626cba4dbf5a82c38dd894387e88270febcbc5addaad36fb973ed763b
MD5 28c92b9cca3b9ab027120e40cb9593a5
BLAKE2b-256 3d8bc106785e9c87be26ba824ec69aec7514c3e6754cec0d90a7f0490cd53b6e

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.1.8-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.1.8-cp312-cp312-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 9d49e76a56f84d8ea5b64ab26ee964604438dcab6f9449db3bc63e4148b9cf7d
MD5 6c1599aba4d56b51bc7b372212fdc01a
BLAKE2b-256 739964833e3011532b4534a046746e931ab6774c8081eddf41839776ae27c9e1

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.1.8-cp312-cp312-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.1.8-cp312-cp312-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 0ec25fa2292c904250fdecd019c03688b078df698f727477884f048d3ec4783b
MD5 e3437dd3cca47079d2944e26aa39b86c
BLAKE2b-256 e8b9b7de438d25b6a312739b5be0facc9b697aac33656f6aebb1e0d1ace9388b

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.1.8-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.1.8-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 3349a8f650490c0a02dcf4105eb843a4d07064be17eaf7965d8fadfe02a44f71
MD5 7fa5fa5099b7528bc83f6a1e3e425ef0
BLAKE2b-256 ed88e7f8ee8cf967dcbc345db0a64a93741459cf03ff6bfa09db4f28fb31b0e3

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.1.8-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.1.8-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 fbe467208d883a81548353b1e16a809b5c183ce4d6948b72d6c60a919b05d0e5
MD5 fec67a607a37312fd082f587fc49afa8
BLAKE2b-256 034776b10232cce3d67dbc9896409af817e9e833c10bd3031c771e018832e353

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.1.8-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.1.8-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 01e1c3784a0f880e55ba02056b243c40e1d4b34999fb806cc38d87882addc233
MD5 4319fa9656e2d225682cf7232ea5da75
BLAKE2b-256 71a89a9039ec86958095424e5a0ee9b803c6a2abed32580665ecfc0774751b22

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.1.8-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.1.8-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 c3b3f4192bf8591014aae4fb0b26a8557cb520f7fa6e92d86118689d770c0035
MD5 aaafc3bbace089909daed76faa55e86c
BLAKE2b-256 a147e3e131b77a208cf74810fb12cb148aefc261f8c0769a04d3ff91761afd8b

See more details on using hashes here.

File details

Details for the file rust_data_processing-0.1.8-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for rust_data_processing-0.1.8-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 32cfbbe05ae8ef003472efa520475da45d2cd40f8616f9d93f6fa15e9315267c
MD5 fdce124a8255619f282131cc006daa7b
BLAKE2b-256 a5d2a8bd564476d2fb01cecbf932ab4b9d1ba8cbb034968b57b699bf89fe2a14

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page