Skip to main content

A scalable framework for data input and output operations in Spark applications

Project description

PyData I/O

License Spark Python PRs Welcome

Data I/O is an open source project that provides a flexible and scalable framework for data input and output operations in Spark applications. It offers a set of powerful tools and abstractions to simplify and streamline data processing pipelines.

Features

  • Easy-to-use API for defining data processors and transformations
  • Seamless integration with popular data storage systems and formats
  • Support for batch and streaming data processing
  • Extensible architecture for custom data processors and pipelines
  • Scalable and fault-tolerant processing using Apache Spark
  • Open to make use of python ML models ecosystem (sklearn, xgboost, pytorch...)

Getting Started

To get started with PyData I/O, please refer to the documentation for installation instructions, usage examples, and API references.

Issues and Support

If you encounter any issues or require support, please create a new issue on the GitHub repository.

Contribution

Contributions to Data I/O are welcome! To contribute, please follow the guidelines outlined in our contribution guide.

License

This project is licensed under the Apache License 2.0 license. See the LICENSE file for more information.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pydataio-1.0.0.tar.gz (95.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pydataio-1.0.0-py3-none-any.whl (17.2 kB view details)

Uploaded Python 3

File details

Details for the file pydataio-1.0.0.tar.gz.

File metadata

  • Download URL: pydataio-1.0.0.tar.gz
  • Upload date:
  • Size: 95.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pydataio-1.0.0.tar.gz
Algorithm Hash digest
SHA256 886767abcb024f77d6119c0df81702328eb60806ca8c82be61d58eb128739d7e
MD5 5a4fee67c83afd09c90d506ad9bf2cb9
BLAKE2b-256 d99e507af27b4305358ed30aec4567c5ea51528e2728ee197c9c6f31d3912c49

See more details on using hashes here.

Provenance

The following attestation bundles were made for pydataio-1.0.0.tar.gz:

Publisher: release-pypi.yml on AmadeusITGroup/PyDataIO

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pydataio-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: pydataio-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 17.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pydataio-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b9046d73ac6fc77e365a0bcc00fbcca6cb47183a983a8a12f6a5358e767ccc22
MD5 7c6430bbd038f9edb207a05004df31c5
BLAKE2b-256 082dfe4bf2b2a2b13ba4370688f06c7cacdd6fe8ca7214094ce846234310a41d

See more details on using hashes here.

Provenance

The following attestation bundles were made for pydataio-1.0.0-py3-none-any.whl:

Publisher: release-pypi.yml on AmadeusITGroup/PyDataIO

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page