Skip to main content

A scalable framework for data input and output operations in Spark applications

Project description

PyData I/O

License Spark Python PRs Welcome

Data I/O is an open source project that provides a flexible and scalable framework for data input and output operations in Spark applications. It offers a set of powerful tools and abstractions to simplify and streamline data processing pipelines.

Features

  • Easy-to-use API for defining data processors and transformations
  • Seamless integration with popular data storage systems and formats
  • Support for batch and streaming data processing
  • Extensible architecture for custom data processors and pipelines
  • Scalable and fault-tolerant processing using Apache Spark
  • Open to make use of python ML models ecosystem (sklearn, xgboost, pytorch...)

Getting Started

To get started with PyData I/O, please refer to the documentation for installation instructions, usage examples, and API references.

Issues and Support

If you encounter any issues or require support, please create a new issue on the GitHub repository.

Contribution

Contributions to Data I/O are welcome! To contribute, please follow the guidelines outlined in our contribution guide.

License

This project is licensed under the Apache License 2.0 license. See the LICENSE file for more information.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pydataio-1.0.1.tar.gz (97.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pydataio-1.0.1-py3-none-any.whl (17.2 kB view details)

Uploaded Python 3

File details

Details for the file pydataio-1.0.1.tar.gz.

File metadata

  • Download URL: pydataio-1.0.1.tar.gz
  • Upload date:
  • Size: 97.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pydataio-1.0.1.tar.gz
Algorithm Hash digest
SHA256 5de50ed534478a9e8e684c0963fefba32c4ec52a071c73d8ffecdf960f45e3d6
MD5 6cb0e3820855e3ec936d9f28fb12f537
BLAKE2b-256 a96d573bb1e78c12891d833ba8a5f4f666e274da3503a21f7960060a29dde85c

See more details on using hashes here.

Provenance

The following attestation bundles were made for pydataio-1.0.1.tar.gz:

Publisher: release-pypi.yml on AmadeusITGroup/PyDataIO

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pydataio-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: pydataio-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 17.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pydataio-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 b06749977f223c50068b2c36f22f735444a38ee030c0f6e4ae9e9404cb691468
MD5 2097194704a94ccc03d803887e44009e
BLAKE2b-256 e28716d555e524c9856a74ceb9b5f4b4e85775916522c3ad3987a4c698d01f44

See more details on using hashes here.

Provenance

The following attestation bundles were made for pydataio-1.0.1-py3-none-any.whl:

Publisher: release-pypi.yml on AmadeusITGroup/PyDataIO

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page