Submission and monitoring of jobs and notebooks using the Yeedu API in Apache Airflow.
Project description
Airflow Yeedu Operator
Installation
To use the Yeedu Operator in your Airflow environment, install it using the following command:
pip3 install airflow-yeedu-operator
Overview
The YeeduOperator
acts as a bridge within Airflow, allowing you to effortlessly interact with both Yeedu jobs and notebooks. It streamlines the process of:
- Submitting Jobs and Notebooks: You can directly send jobs and notebooks to Yeedu using this operator, integrating them seamlessly into your Airflow workflows.
- Monitoring Progress: The operator keeps you informed about the status of your submitted Yeedu jobs and notebooks, providing you with real-time updates.
- Handling Completion: Upon completion, the operator gracefully handles the outcome (success or failure) for both jobs and notebooks.
- Managing Logs: All relevant logs associated with Yeedu jobs and notebooks are conveniently accessible within Airflow, keeping your workflow environment organized.
Prerequisites
Before using the YeeduOperator, ensure you have the following:
- Access to the Yeedu API: You'll need valid credentials to interact with the Yeedu API.
- Creating Airflw Connection (For Yeedu Credentials):
Steps to Create an Airflow Connection
-
Navigate to Connections:
- In the Airflow UI, click on the
Admin
tab at the top of the screen. - From the dropdown menu, select
Connections
.
- In the Airflow UI, click on the
-
Create a New Connection:
- On the Connections page, click the
+
(plus) button to add a new connection.
- On the Connections page, click the
-
Fill in the Connection Details:
- Conn Id: A unique identifier for your connection. Example:
my_yeeduu_connection
. - Conn Type: Select the appropriate type for your connection. For a Yeedu Notebook or job,
HTTP
is appropriate. - Host: Yeedu hostname.
- Login: The username for the connection.
- Password: The password for the connection.
- Conn Id: A unique identifier for your connection. Example:
-
Extra:
- Click on the
Extra
field to expand it. This field allows you to provide additional parameters in JSON format.
{ "YEEDU_AIRFLOW_VERIFY_SSL": "true", "YEEDU_SSL_CERT_FILE": "/path/to/cert/file" }
- Click on the
DAG: Yeedu Job Execution
-
Setting Up the DAG
Import the necessary modules and instantiate the DAG with required arguments and schedule interval.
from datetime import datetime, timedelta from airflow import DAG from yeedu.operators.yeedu import YeeduOperator # Define DAG arguments default_args = { 'owner': 'airflow', 'depends_on_past': False, 'start_date': datetime(2023, 1, 1), 'retries': 1, 'retry_delay': timedelta(minutes=5), } # Instantiate DAG dag = DAG( 'yeedu_job_execution', default_args=default_args, description='DAG to execute jobs using Yeedu API', schedule_interval='@once', catchup=False, )
-
Creating Yeedu Operator Tasks
Create tasks using
YeeduOperator
to perform various Yeedu API operations.submit_job_task = YeeduOperator( task_id='demo_dag', job_url='https://hostname/tenant/tenant_id/workspace/workspace_id/spark/notebook/notebook_id', # Replace with job or notebook url connection_id='yeedu_connection', # Replace with your connection id dag=dag, )
-
Execution
To execute this DAG:
- Ensure all required configurations (job_url, connection_id) are correctly provided in the task definition.
- Place the DAG file in the appropriate Airflow DAGs folder.
- Trigger the DAG manually or based on the defined schedule interval.
- Monitor the Airflow UI for task execution and logs.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file airflow-yeedu-operator-1.0.12.tar.gz
.
File metadata
- Download URL: airflow-yeedu-operator-1.0.12.tar.gz
- Upload date:
- Size: 16.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 68beb31309401f935a0f8677d3171657f8444664011b330708dc048512e89982 |
|
MD5 | 25824121745af1ceedc2618673d6ce99 |
|
BLAKE2b-256 | 6b380cde37b28ddfdbc2736a5ab712c7343935ab9d72f870cc2dc46be4108229 |
Provenance
File details
Details for the file airflow_yeedu_operator-1.0.12-py3-none-any.whl
.
File metadata
- Download URL: airflow_yeedu_operator-1.0.12-py3-none-any.whl
- Upload date:
- Size: 19.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 981b52538bc56e999592fe1eac68afa3f1c5b28636c19739b85592690787d3ad |
|
MD5 | 3c67a439ee939812e21dfdb6c33f07cc |
|
BLAKE2b-256 | 47e2ef39b3d133743a24c3217adb507e2b3d434756f2294c03e9b3662052defc |