Skip to main content

Running Unicore Jobs from airflow DAGs.

Project description

Generic badge

This project integrates UNICORE and Apache Airflow. UNICORE is a software suite that, among other functions, provides seamless access to high-performance compute and data resources. Airflow is a platform to programmatically author, schedule and monitor workflows.

In the current state, this projects provides a set of airflow operators, which can be used as part of airflow workflows to submit jobs to Unicore. The UnicoreExecutor only offers experimental support for airflow 3 so far. Further support is currently being worked on.

Using the Unicore Operators

There are multiple Unicore operators provided by this package. The most versatile one is the UnicoreGenericOperator, which supports a lot of job parameters. All other operators are intended to offer a slightly less complex constructor, and therefore simpler usage, but all generic parameters are still available to be used.

All operators support all possible parameters of the Unicore job description. Here is an excerpt containing some commonly used parameters:

parameter name

type

default

description

application_name

str

None

Application Name

application_version

str

None

Application Version

executable

str

None

Command line executable

arguments

List(str)

None

Command line arguments

environment

Map(str,str)

None

environment arguments

parameters

Map

None

Application Parameters

project

str

None

Accounting Project

imports

List(imports)

None

Stage-in/data import - see Unicore docs

exports

List(exports)

None

Stage-out/data export - see Unicore docs

For imports and exports go here for details.

The UnicoreGenericOperator supports the following additional parameters:

parameter name

type

default

description

name

str

None

name for the airflow task and the Unicore job

xcom_output_files

List(str)

[“stdout”,”stderr”]

list of files of which the content should be put into xcoms

base_url

str

configured in airflow connections or None

The base URL of the UNICOREX server to be used for the Unicore client

credential

pyunicore credential

configured in airflow connections or None

A Unicore Credential to be used for the Unicore client

credential_username

str

configured in airflow connections or None

Username for the Unicore client credentials

credential_password

str

configured in airflow connections or None

Password the the Unicore client credentials

credential_token

str

configured in airflow connections or None

An OIDC token to be used by the Unicore client

The UnicoreScriptOperator offers a way to more easily submit a script as a job, where the script content can be provided as a string.

parameter name

type

default

description

script_content

str

None

The content of the script file

The UnicoreBSSOperator offers a way to directly submit batch-scripts from their content-strings.

parameter name

type

default

description

bss_file_content

str

None

The content of the batch script file

The UnicoreExecutableOperator offers a reduced constructor that only requires an executable.

parameter name

type

default

description

executable

str

None

The executable to run for this job

xcom_output_files

List(str)

[“stdout”,”stderr”]

list of files of which the content should be put into xcoms

The UnicoreDateOperator is more of a testing operator, since it will only run the date executable.

Behaviour on Errors and Success

The Unicore Operators do not do a lot of error and exception handling, and mostly just forward any problems to be handled by airflow. All of the Unicore logic is handled by the pyunicore library.

While some validation of the resulting Unicore job description is done automatically, it may still be possible to build an invalid job description with the operators. This may lead to a submission failure with Unicore. In this case, an exception is thrown to be handled by airflow.

For a successful job submission, the job exit code is returned as the task return value, so that airflow can handle non-zero exit codes. All operators will also append the content of the job-log-file from Unicore to the airflow task log. Also, some job results and values will be uploaded via airflow-x-coms as well:

xcom name

description

Unicore Job ID

the Unicore ID for the job

Unicore Job

the TSI script that was submitted by Unicore

BSS_SUBMIT

the bss_script submitted by Unicore

status_message

the status message for the Unicore job

log

the Unicore job log

workdir_content

content of the job workdir upon completion

[xcom_output_files]

content of each file in their own xcom, by default stdout and stderr

Example DAGs

There are some example DAGs in this repository under project-dir/dags.

  • unicore-test-1.py just shows basic date and executable usage.

  • unicore-test-2.py has some basic examples for the generic operator.

  • unicore-test-3.py also includes script-operator examples.

  • unicore-test-4.py has some examples with more arguments.

  • unicore-test-bss.py shows how bss submission can be done (very simple example).

  • unicore-test-credentials.py demonstrates that not only the credentials from the airflow connections backend can be used, but they can also be provided in the constructor of the operator.

  • unicore-test-import-export.py gives short examples for the imports and exports usage.

Setup testing env

Ensure a current version of docker is installed.

Run python3 -m build to build the python package.

Run the testing-env/build-image.sh script to create the customized airflow image, which will contain the newly build python package.

Run testing-env/run-testing-env.sh init to initialize the airflow containers, database etc. This only needs to be done once.

Run testing-env/run-testing-env.sh up to start the local airflow and Unicore deployment. Airflow will be available on port 8080, Unicore on port 8081.

The run-testing-env.sh script supports the commands up, down, start, stop, ps and init for matching docker compose functions.

Install package via pip

pip install airflow-unicore-integration

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

airflow_unicore_integration-0.1.10.tar.gz (19.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

airflow_unicore_integration-0.1.10-py3-none-any.whl (19.7 kB view details)

Uploaded Python 3

File details

Details for the file airflow_unicore_integration-0.1.10.tar.gz.

File metadata

File hashes

Hashes for airflow_unicore_integration-0.1.10.tar.gz
Algorithm Hash digest
SHA256 df361fc94e3a3579a0ead9833b889a13cd1641e3e713f1a61281e68b3c4f0421
MD5 26056b5d98d3db0fcaf4ddf8617e41db
BLAKE2b-256 cb729caea4768e29fcd3d5c6cbc0c039b230401044daea07541f298d6c8e1cfa

See more details on using hashes here.

Provenance

The following attestation bundles were made for airflow_unicore_integration-0.1.10.tar.gz:

Publisher: publish-to-pypi.yml on UNICORE-EU/airflow-unicore-integration

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file airflow_unicore_integration-0.1.10-py3-none-any.whl.

File metadata

File hashes

Hashes for airflow_unicore_integration-0.1.10-py3-none-any.whl
Algorithm Hash digest
SHA256 3ff603b4837c0c5e44c8573bbcf7ced1d74109e9d922aac5af2282c5a808daa1
MD5 a9a44c627e8aa1d9277866334679747e
BLAKE2b-256 41842f27a7d38e0aa64b230b48a67d29b2690a43f176a2acb7de7aa1412aeeef

See more details on using hashes here.

Provenance

The following attestation bundles were made for airflow_unicore_integration-0.1.10-py3-none-any.whl:

Publisher: publish-to-pypi.yml on UNICORE-EU/airflow-unicore-integration

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page