Skip to main content

A scalable framework for data input and output operations in Spark applications

Project description

PyData I/O

License Spark Python PRs Welcome

Data I/O is an open source project that provides a flexible and scalable framework for data input and output operations in Spark applications. It offers a set of powerful tools and abstractions to simplify and streamline data processing pipelines.

Features

  • Easy-to-use API for defining data processors and transformations
  • Seamless integration with popular data storage systems and formats
  • Support for batch and streaming data processing
  • Extensible architecture for custom data processors and pipelines
  • Scalable and fault-tolerant processing using Apache Spark
  • Open to make use of python ML models ecosystem (sklearn, xgboost, pytorch...)

Getting Started

To get started with PyData I/O, please refer to the documentation for installation instructions, usage examples, and API references.

Issues and Support

If you encounter any issues or require support, please create a new issue on the GitHub repository.

Contribution

Contributions to Data I/O are welcome! To contribute, please follow the guidelines outlined in our contribution guide.

License

This project is licensed under the Apache License 2.0 license. See the LICENSE file for more information.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pydataio-1.1.0.tar.gz (97.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pydataio-1.1.0-py3-none-any.whl (17.2 kB view details)

Uploaded Python 3

File details

Details for the file pydataio-1.1.0.tar.gz.

File metadata

  • Download URL: pydataio-1.1.0.tar.gz
  • Upload date:
  • Size: 97.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pydataio-1.1.0.tar.gz
Algorithm Hash digest
SHA256 a97e9b5e505c0d582da967854a347f7ee0d164fa076910757ff3aeef99785584
MD5 8053dee167e8a8afe85673e6ab53c050
BLAKE2b-256 1dda49fdd774c62a4f47d6a01591c76428a165a5d7490493cbe955ed9ca2a08d

See more details on using hashes here.

Provenance

The following attestation bundles were made for pydataio-1.1.0.tar.gz:

Publisher: release-pypi.yml on AmadeusITGroup/PyDataIO

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pydataio-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: pydataio-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 17.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pydataio-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 100a5adc268ef14b650a517557d196de9d1ed10684ef47eafd51db46334b7637
MD5 101a1edb4dbb00bcae1dd3eaf8d1bcd4
BLAKE2b-256 aa1910b54e5c572c8d5b84af9fad6a469d03927fe6a16bcfdfd8232392f65421

See more details on using hashes here.

Provenance

The following attestation bundles were made for pydataio-1.1.0-py3-none-any.whl:

Publisher: release-pypi.yml on AmadeusITGroup/PyDataIO

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page