A scalable framework for data input and output operations in Spark applications
Project description
PyData I/O
Data I/O is an open source project that provides a flexible and scalable framework for data input and output operations in Spark applications. It offers a set of powerful tools and abstractions to simplify and streamline data processing pipelines.
Features
- Easy-to-use API for defining data processors and transformations
- Seamless integration with popular data storage systems and formats
- Support for batch and streaming data processing
- Extensible architecture for custom data processors and pipelines
- Scalable and fault-tolerant processing using Apache Spark
- Open to make use of python ML models ecosystem (sklearn, xgboost, pytorch...)
Getting Started
To get started with PyData I/O, please refer to the documentation for installation instructions, usage examples, and API references.
Issues and Support
If you encounter any issues or require support, please create a new issue on the GitHub repository.
Contribution
Contributions to Data I/O are welcome! To contribute, please follow the guidelines outlined in our contribution guide.
License
This project is licensed under the Apache License 2.0 license. See the LICENSE file for more information.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pydataio-1.0.2.tar.gz.
File metadata
- Download URL: pydataio-1.0.2.tar.gz
- Upload date:
- Size: 97.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a3eed2fdd73cc4557dcf56299e593528dcf17048038ac14a2969a58b84787961
|
|
| MD5 |
cc3d15d7518f1b7899c1e36c750483de
|
|
| BLAKE2b-256 |
9db1a9e6f3d63a0b0b94b270053be4f912c95b9ae13deb1e3bafd587c566cb0e
|
Provenance
The following attestation bundles were made for pydataio-1.0.2.tar.gz:
Publisher:
release-pypi.yml on AmadeusITGroup/PyDataIO
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
pydataio-1.0.2.tar.gz -
Subject digest:
a3eed2fdd73cc4557dcf56299e593528dcf17048038ac14a2969a58b84787961 - Sigstore transparency entry: 732390531
- Sigstore integration time:
-
Permalink:
AmadeusITGroup/PyDataIO@f86c11bc74ef391a620f1fbefe918017d6f67f78 -
Branch / Tag:
refs/tags/v1.0.2 - Owner: https://github.com/AmadeusITGroup
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release-pypi.yml@f86c11bc74ef391a620f1fbefe918017d6f67f78 -
Trigger Event:
push
-
Statement type:
File details
Details for the file pydataio-1.0.2-py3-none-any.whl.
File metadata
- Download URL: pydataio-1.0.2-py3-none-any.whl
- Upload date:
- Size: 17.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c9af4beee52924135f10acbf1f742a2e7ac0a9bc967ad2c58f0666ab663277e5
|
|
| MD5 |
3a7ff0cf340ec49bd9b717b80b93d1a4
|
|
| BLAKE2b-256 |
d029070e70a738d2cf3ae92229c247ebfe7b7a20fe31b2b0942f7ffcb09a5077
|
Provenance
The following attestation bundles were made for pydataio-1.0.2-py3-none-any.whl:
Publisher:
release-pypi.yml on AmadeusITGroup/PyDataIO
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
pydataio-1.0.2-py3-none-any.whl -
Subject digest:
c9af4beee52924135f10acbf1f742a2e7ac0a9bc967ad2c58f0666ab663277e5 - Sigstore transparency entry: 732390533
- Sigstore integration time:
-
Permalink:
AmadeusITGroup/PyDataIO@f86c11bc74ef391a620f1fbefe918017d6f67f78 -
Branch / Tag:
refs/tags/v1.0.2 - Owner: https://github.com/AmadeusITGroup
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release-pypi.yml@f86c11bc74ef391a620f1fbefe918017d6f67f78 -
Trigger Event:
push
-
Statement type: