Skip to main content

A scalable framework for data input and output operations in Spark applications

Project description

PyData I/O

License Spark Python PRs Welcome

Data I/O is an open source project that provides a flexible and scalable framework for data input and output operations in Spark applications. It offers a set of powerful tools and abstractions to simplify and streamline data processing pipelines.

Features

  • Easy-to-use API for defining data processors and transformations
  • Seamless integration with popular data storage systems and formats
  • Support for batch and streaming data processing
  • Extensible architecture for custom data processors and pipelines
  • Scalable and fault-tolerant processing using Apache Spark
  • Open to make use of python ML models ecosystem (sklearn, xgboost, pytorch...)

Getting Started

To get started with PyData I/O, please refer to the documentation for installation instructions, usage examples, and API references.

Issues and Support

If you encounter any issues or require support, please create a new issue on the GitHub repository.

Contribution

Contributions to Data I/O are welcome! To contribute, please follow the guidelines outlined in our contribution guide.

License

This project is licensed under the Apache License 2.0 license. See the LICENSE file for more information.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pydataio-1.1.1.tar.gz (97.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pydataio-1.1.1-py3-none-any.whl (17.2 kB view details)

Uploaded Python 3

File details

Details for the file pydataio-1.1.1.tar.gz.

File metadata

  • Download URL: pydataio-1.1.1.tar.gz
  • Upload date:
  • Size: 97.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pydataio-1.1.1.tar.gz
Algorithm Hash digest
SHA256 9a2de999cfa66b38354acf47a95a650420bb0c15994f8912c5530dc7fef65d91
MD5 95191bd941dbbf7a22bcdfb60645eb56
BLAKE2b-256 9955d3cb200d0f8d2646cb50db2d6a4195ce4d26342d058d45b09528ba8a8be4

See more details on using hashes here.

Provenance

The following attestation bundles were made for pydataio-1.1.1.tar.gz:

Publisher: release-pypi.yml on AmadeusITGroup/PyDataIO

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pydataio-1.1.1-py3-none-any.whl.

File metadata

  • Download URL: pydataio-1.1.1-py3-none-any.whl
  • Upload date:
  • Size: 17.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pydataio-1.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 3c7073cf1cbc12c86e698d007a2e1ec3e636892c288383619ea5ff0b2dac74ac
MD5 15b20db47c878587770c7bc1a80624ff
BLAKE2b-256 0a14e39ee181d01980f58ea57fe180ee04688cbbad11511de2dfa9d525f194d6

See more details on using hashes here.

Provenance

The following attestation bundles were made for pydataio-1.1.1-py3-none-any.whl:

Publisher: release-pypi.yml on AmadeusITGroup/PyDataIO

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page