Skip to main content

A scalable framework for data input and output operations in Spark applications

Project description

PyData I/O

License Spark Python PRs Welcome

Data I/O is an open source project that provides a flexible and scalable framework for data input and output operations in Spark applications. It offers a set of powerful tools and abstractions to simplify and streamline data processing pipelines.

Features

  • Easy-to-use API for defining data processors and transformations
  • Seamless integration with popular data storage systems and formats
  • Support for batch and streaming data processing
  • Extensible architecture for custom data processors and pipelines
  • Scalable and fault-tolerant processing using Apache Spark
  • Open to make use of python ML models ecosystem (sklearn, xgboost, pytorch...)

Getting Started

To get started with PyData I/O, please refer to the documentation for installation instructions, usage examples, and API references.

Issues and Support

If you encounter any issues or require support, please create a new issue on the GitHub repository.

Contribution

Contributions to Data I/O are welcome! To contribute, please follow the guidelines outlined in our contribution guide.

License

This project is licensed under the Apache License 2.0 license. See the LICENSE file for more information.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pydataio-1.0.2.tar.gz (97.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pydataio-1.0.2-py3-none-any.whl (17.2 kB view details)

Uploaded Python 3

File details

Details for the file pydataio-1.0.2.tar.gz.

File metadata

  • Download URL: pydataio-1.0.2.tar.gz
  • Upload date:
  • Size: 97.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pydataio-1.0.2.tar.gz
Algorithm Hash digest
SHA256 a3eed2fdd73cc4557dcf56299e593528dcf17048038ac14a2969a58b84787961
MD5 cc3d15d7518f1b7899c1e36c750483de
BLAKE2b-256 9db1a9e6f3d63a0b0b94b270053be4f912c95b9ae13deb1e3bafd587c566cb0e

See more details on using hashes here.

Provenance

The following attestation bundles were made for pydataio-1.0.2.tar.gz:

Publisher: release-pypi.yml on AmadeusITGroup/PyDataIO

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pydataio-1.0.2-py3-none-any.whl.

File metadata

  • Download URL: pydataio-1.0.2-py3-none-any.whl
  • Upload date:
  • Size: 17.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pydataio-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 c9af4beee52924135f10acbf1f742a2e7ac0a9bc967ad2c58f0666ab663277e5
MD5 3a7ff0cf340ec49bd9b717b80b93d1a4
BLAKE2b-256 d029070e70a738d2cf3ae92229c247ebfe7b7a20fe31b2b0942f7ffcb09a5077

See more details on using hashes here.

Provenance

The following attestation bundles were made for pydataio-1.0.2-py3-none-any.whl:

Publisher: release-pypi.yml on AmadeusITGroup/PyDataIO

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page