Tekton Compiler for Kubeflow Pipelines

These details have been verified by PyPI

Maintainers

ckadner fenglixa jinchihe tomcli yihongwang

These details have not been verified by PyPI

Project links

Homepage

Project description

Kubeflow Pipelines SDK for Tekton

The Kubeflow Pipelines SDK allows data scientists to define end-to-end machine learning and data pipelines. The output of the Kubeflow Pipelines SDK compiler is YAML for Argo.

The kfp-tekton SDK is extending the Compiler and the Client of the Kubeflow Pipelines SDK to generate Tekton YAML and to subsequently upload and run the pipeline with the Kubeflow Pipelines engine backed by Tekton.

SDK Packages Overview
Project Prerequisites
Installation
Compiling a Kubeflow Pipelines DSL Script
Big data passing workspace configuration
Running the Compiled Pipeline on a Tekton Cluster
List of Available Features
List of Helper Functions for Python Kubernetes Client
Tested Pipelines
Troubleshooting

SDK Packages Overview

The kfp-tekton SDK is an extension to the Kubeflow Pipelines SDK adding the TektonCompiler and the TektonClient:

kfp_tekton.compiler includes classes and methods for compiling pipeline Python DSL into a Tekton PipelineRun YAML spec. The methods in this package include, but are not limited to, the following:
- kfp_tekton.compiler.TektonCompiler.compile compiles your Python DSL code into a single static configuration (in YAML format) that the Kubeflow Pipelines service can process. The Kubeflow Pipelines service converts the static configuration into a set of Kubernetes resources for execution.
kfp_tekton.TektonClient contains the Python client libraries for the Kubeflow Pipelines API. Methods in this package include, but are not limited to, the following:
- kfp_tekton.TektonClient.upload_pipeline uploads a local file to create a new pipeline in Kubeflow Pipelines.
- kfp_tekton.TektonClient.create_experiment creates a pipeline experiment and returns an experiment object.
- kfp_tekton.TektonClient.run_pipeline runs a pipeline and returns a run object.
- kfp_tekton.TektonClient.create_run_from_pipeline_func compiles a pipeline function and submits it for execution on Kubeflow Pipelines.
- kfp_tekton.TektonClient.create_run_from_pipeline_package runs a local pipeline package on Kubeflow Pipelines.

Project Prerequisites

Python: 3.8 or later. For Python 3.12, make sure to not have the SETUPTOOLS_USE_DISTUTILS flag because it's already deprecated.
Tekton: v0.53.2 or later
Tekton CLI: 0.30.1
Kubeflow Pipelines: KFP with Tekton backend

Follow the instructions for installing project prerequisites and take note of some important caveats.

Installation

You can install the latest release of the kfp-tekton compiler from PyPi. We recommend to create a Python virtual environment first:

python3 -m venv .venv
source .venv/bin/activate

pip install kfp-tekton

Alternatively you can install the latest version of the kfp-tekton compiler from the source by cloning the repository https://github.com/kubeflow/kfp-tekton:

Clone the kfp-tekton repo:

git clone https://github.com/kubeflow/kfp-tekton.git
cd kfp-tekton

Setup Python environment with Conda or a Python virtual environment:
```
python3 -m venv .venv
source .venv/bin/activate
```
Build the compiler:
```
pip install -e sdk/python
```
Run the compiler tests (optional):
```
pip install pytest
make test
```

Compiling a Kubeflow Pipelines DSL Script

The kfp-tekton Python package comes with the dsl-compile-tekton command line executable, which should be available in your terminal shell environment after installing the kfp-tekton Python package.

If you cloned the kfp-tekton project, you can find example pipelines in the samples folder or under sdk/python/tests/compiler/testdata folder.

dsl-compile-tekton \
    --py sdk/python/tests/compiler/testdata/parallel_join.py \
    --output pipeline.yaml

Note: If the KFP DSL script contains a __main__ method calling the kfp_tekton.compiler.TektonCompiler.compile() function:

if __name__ == "__main__":
    from kfp_tekton.compiler import TektonCompiler
    TektonCompiler().compile(pipeline_func, "pipeline.yaml")

... then the pipeline can be compiled by running the DSL script with python3 executable from a command line shell, producing a Tekton YAML file pipeline.yaml in the same directory:

python3 pipeline.py

Big data passing workspace configuration

When big data files are defined in KFP. Tekton will create a workspace to share these big data files among tasks that run in the same pipeline. By default, the workspace is a Read Write Many PVC with 2Gi storage using the kfp-csi-s3 storage class to push artifacts to S3. But you can change these configuration using the environment variables below:

export DEFAULT_ACCESSMODES=ReadWriteMany
export DEFAULT_STORAGE_SIZE=2Gi
export DEFAULT_STORAGE_CLASS=kfp-csi-s3

To pass big data using cloud provider volumes, it's recommended to use the volume_based_data_passing_method for both Tekton and Argo runtime.

If you want to change the input and output copy artifact images, please modify the following environment variables:

export TEKTON_BASH_STEP_IMAGE=busybox  # input and output copy artifact images
export TEKTON_COPY_RESULTS_STEP_IMAGE=library/bash # output copy results images
export CONDITION_IMAGE_NAME=python:3.9.17-alpine3.18 # condition task default image name

Running the Compiled Pipeline on a Tekton Cluster

After compiling the sdk/python/tests/compiler/testdata/parallel_join.py DSL script in the step above, we need to deploy the generated Tekton YAML to Kubeflow Pipeline engine.

You can run the pipeline directly using a pre-compiled file and KFP-Tekton SDK. For more details, please look at the KFP-Tekton user guide SDK documentation

experiment = kfp_tekton.TektonClient.create_experiment(name=EXPERIMENT_NAME, namespace=KUBEFLOW_PROFILE_NAME)
run = client.run_pipeline(experiment.id, 'parallal-join-pipeline', 'pipeline.yaml')

You can also deploy directly on Tekton cluster with kubectl. The Tekton server will automatically start a pipeline run. We can then follow the logs using the tkn CLI.

kubectl apply -f pipeline.yaml

tkn pipelinerun logs --last --follow

Once the Tekton Pipeline is running, the logs should start streaming:

Waiting for logs to be available...

[gcs-download : main] With which he yoketh your rebellious necks Razeth your cities and subverts your towns And in a moment makes them desolate

[gcs-download-2 : main] I find thou art no less than fame hath bruited And more than may be gatherd by thy shape Let my presumption not provoke thy wrath

[echo : main] Text 1: With which he yoketh your rebellious necks Razeth your cities and subverts your towns And in a moment makes them desolate
[echo : main]
[echo : main] Text 2: I find thou art no less than fame hath bruited And more than may be gatherd by thy shape Let my presumption not provoke thy wrath
[echo : main]

List of Available Features

To understand how each feature is implemented and its current status, please visit the FEATURES doc.

List of Helper Functions for Python Kubernetes Client

KFP Tekton provides a list of common Kubernetes client helper functions to simplify the process of creating certain Kubernetes resources. please visit the K8S_CLIENT_HELPER doc for more details.

Tested Pipelines

We are testing the compiler on more than 80 pipelines found in the Kubeflow Pipelines repository, specifically the pipelines in KFP compiler testdata folder, the KFP core samples and the samples contributed by third parties.

A report card of Kubeflow Pipelines samples that are currently supported by the kfp-tekton compiler can be found here. If you work on a PR that enables another of the missing features please ensure that your code changes are improving the number of successfully compiled KFP pipeline samples.

Troubleshooting

When you encounter ServiceAccount related permission issues, refer to the "Service Account and RBAC" doc
If you run into the error bad interpreter: No such file or director when trying to use Python's venv, remove the current virtual environment in the .venv directory and create a new one using virtualenv .venv

Project details

These details have been verified by PyPI

Maintainers

ckadner fenglixa jinchihe tomcli yihongwang

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

1.9.3

Feb 27, 2024

1.9.2

Dec 26, 2023

1.9.1

Dec 21, 2023

1.9.0

Nov 27, 2023

1.8.1

Oct 17, 2023

1.8.0

Sep 11, 2023

1.7.4

Sep 1, 2023

1.7.3

Aug 28, 2023

1.7.2

Aug 16, 2023

1.7.1

Jun 27, 2023

1.7.0

May 15, 2023

1.6.9

Sep 1, 2023

1.6.8

Aug 28, 2023

1.6.7

Aug 25, 2023

1.6.6

May 8, 2023

1.6.5

Apr 11, 2023

1.6.4

Mar 22, 2023

1.6.3

Mar 15, 2023

1.6.2

Feb 20, 2023

1.6.1

Feb 20, 2023

1.6.0

Feb 16, 2023

1.5.10

Feb 27, 2024

1.5.9

Sep 12, 2023

1.5.8

Sep 1, 2023

1.5.7

Aug 28, 2023

1.5.6

Aug 16, 2023

1.5.5

Aug 7, 2023

1.5.4

May 8, 2023

1.5.3

Mar 20, 2023

1.5.2

Mar 13, 2023

1.5.1

Jan 27, 2023

1.5.0

Jan 10, 2023

1.4.2

Jan 10, 2023

1.4.1

Dec 6, 2022

1.4.0

Oct 31, 2022

1.3.1

Aug 22, 2022

1.3.0

Jul 29, 2022

1.2.3

Jul 12, 2022

1.2.2

Jun 17, 2022

1.2.1

May 6, 2022

1.2.0

Mar 10, 2022

1.1.1

Feb 10, 2022

1.1.0

Jan 12, 2022

1.0.1

Oct 9, 2021

1.0.0

Sep 1, 2021

0.9.0

Jul 30, 2021

0.8.1

Jun 15, 2021

0.8.0

May 12, 2021

0.8.0rc0 pre-release

Apr 9, 2021

0.7.0

Mar 2, 2021

0.6.0

Jan 29, 2021

0.6.0rc0 pre-release

Jan 27, 2021

0.5.1

Jan 29, 2021

0.5.1rc1 pre-release

Jan 27, 2021

0.5.1rc0 pre-release

Jan 27, 2021

0.5.0

Dec 12, 2020

0.4.0

Nov 11, 2020

0.3.0

Sep 11, 2020

0.3.0rc0 pre-release

Sep 1, 2020

0.2.0

Jul 8, 2020

0.1.0

Jun 20, 2020

0.0.1

Apr 20, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kfp-tekton-1.9.3.tar.gz (81.1 kB view details)

Uploaded Feb 27, 2024 Source

File details

Details for the file kfp-tekton-1.9.3.tar.gz.

File metadata

Download URL: kfp-tekton-1.9.3.tar.gz
Upload date: Feb 27, 2024
Size: 81.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.9.7

File hashes

Hashes for kfp-tekton-1.9.3.tar.gz
Algorithm	Hash digest
SHA256	`1a0e30520e348d40340da94d35ffb30ae626358302af036c163abda9939d96b9`
MD5	`1f4ed3cfdd95063a645f120fea4cd801`
BLAKE2b-256	`1c2e952f7a95c44f1a3a514277ffa488c0c951cbf32f3b851c4cf78c145b4b07`

See more details on using hashes here.

kfp-tekton 1.9.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Kubeflow Pipelines SDK for Tekton

Table of Contents

SDK Packages Overview

Project Prerequisites

Installation

Compiling a Kubeflow Pipelines DSL Script

Big data passing workspace configuration

Running the Compiled Pipeline on a Tekton Cluster

List of Available Features

List of Helper Functions for Python Kubernetes Client

Tested Pipelines

Troubleshooting

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes