Prefect integrations for orchestrating and monitoring apache spark jobs on kubernetes using spark-on-k8s-operator.
Project description
prefect-spark-on-k8s-operator
Visit the full docs here to see additional examples and the API reference.
Prefect integrations for orchestrating and monitoring apache spark jobs on kubernetes using spark-on-k8s-operator.
Welcome!
prefect-spark-on-k8s-operator
is a collection of Prefect flows enabling orchestration, observation and management of SparkApplication
custom kubernetes resources defined according to spark-on-k8s-operator CRD v1Beta2 API Spec.
Jump to examples.
Resources
For more tips on how to use tasks and flows in a Collection, check out Using Collections!
Installation
You need to configure the kubernetes credentials as per prefect-kubernetes
documentation.
Install prefect-spark-on-k8s-operator
with pip
:
pip install prefect-spark-on-k8s-operator
Requires an installation of Python 3.7+.
We recommend using a Python virtual environment manager such as pipenv, conda or virtualenv.
These flows are designed to work with Prefect 2.0. For more information about how to use Prefect, please refer to the Prefect documentation.
Example Usage
Specify and run a SparkApplication from a yaml file
import asyncio
from prefect_kubernetes.credentials import KubernetesCredentials
from prefect_spark_on_k8s_operator import (
SparkApplication,
run_spark_application, # this is a flow
)
app = SparkApplication.from_yaml_file(
credentials=KubernetesCredentials.load("k8s-creds"),
manifest_path="path/to/spark_application.yaml",
)
if __name__ == "__main__":
# run the flow
asyncio.run(run_spark_application(app))
Feedback
If you encounter any bugs while using prefect-spark-on-k8s-operator
, feel free to open an issue in the prefect-spark-on-k8s-operator repository.
If you have any questions or issues while using prefect-spark-on-k8s-operator
, you can find help in either the Prefect Discourse forum or the Prefect Slack community.
Feel free to star or watch prefect-spark-on-k8s-operator
for updates too!
Contributing
If you'd like to help contribute to fix an issue or add a feature to prefect-spark-on-k8s-operator
, please propose changes through a pull request from a fork of the repository.
Here are the steps:
- Fork the repository
- Clone the forked repository
- Install the repository and its dependencies:
pip install -e ".[dev]"
- Make desired changes
- Add tests
- Insert an entry to CHANGELOG.md
- Install
pre-commit
to perform quality checks prior to commit:
pre-commit install
git commit
,git push
, and create a pull request
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for prefect-spark-on-k8s-operator-0.1.2.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 35e27b38ba9a469bd28eaf6d8a00b8ee728c8889000dfff542e30bce46e47dd4 |
|
MD5 | 044b19659b7159bacb53f0618c85051e |
|
BLAKE2b-256 | 167aee4ecaac15489161b83794877d313c8c3fd2afc481aab36611538c11785c |
Hashes for prefect_spark_on_k8s_operator-0.1.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 38f7055a6b17c3f3738b4134d2c21af6c7396e2bc1c2aef1951889548c4de3ce |
|
MD5 | 3ec841a6df9153675ac4a36bfe60caa5 |
|
BLAKE2b-256 | e02be0718ace0cd465a58240614ea0dcf35a9391a2cccfc9cb030fb72fc53450 |