Skip to main content

Airflow extension for communicating with Wherobots Cloud

Project description

Airflow Providers for Wherobots

Airflow providers to bring Wherobots Cloud's spatial compute to your data workflows and ETLs.

Installation

If you use Poetry in your project, add the dependency with poetry add:

$ poetry add git+https://github.com/wherobots/airflow-providers-wherobots

Otherwise, just pip install it:

$ pip install git+https://github.com/wherobots/airflow-providers-wherobots

Usage

Create a connection

You first need to create a Connection in Airflow. This can be done from the UI, or from the command-line. The default Wherobots connection name is wherobots_default; if you use another name you must specify that name with the wherobots_conn_id parameter when initializing Wherobots operators.

The only required fields for the connection are:

  • the Wherobots API endpoint in the host field;
  • your Wherobots API key in the password field.
$ airflow connections add "wherobots_default" \
    --conn-type "wherobots" \
    --conn-host "api.cloud.wherobots.com" \
    --conn-password "$(< api.key)"

Execute a SQL query

The WherobotsSqlOperator allows you to run SQL queries on the Wherobots cloud, from which you can build your ETLs and data transformation workflows by querying, manipulating, and producing datasets with WherobotsDB.

Refer to the Wherobots Documentation and this guidance to learn how to read data, transform data, and write results in Spatial SQL with WherobotsDB.

Example

Below is an example Airflow DAG that executes a SQL query on Wherobots Cloud:

import datetime

from airflow import DAG
from airflow_providers_wherobots.operators.sql import WherobotsSqlOperator


with DAG(
    dag_id="example_wherobots_sql_dag",
    start_date=datetime.datetime.date(datetime.datetime.now()),
    schedule="@hourly",
    catchup=False
):
    # Create a `wherobots.test.airflow_example` table with 100 records
    # from the OMF `places_place` dataset.
    operator = WherobotsSqlOperator(
        task_id="execute_query",
        sql=f"""
        INSERT INTO wherobots.test.airflow_example
        SELECT id, geometry, confidence, geohash
        FROM wherobots_open_data.overture.places_place
        LIMIT 100
        """,
        return_last=False,
    )

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

airflow_providers_wherobots-0.1.2.tar.gz (6.9 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file airflow_providers_wherobots-0.1.2.tar.gz.

File metadata

File hashes

Hashes for airflow_providers_wherobots-0.1.2.tar.gz
Algorithm Hash digest
SHA256 f8ff4953d383f71db7292e6c78493bf3002a6d1633f6f537f3cb574784b3fc58
MD5 a4795c7ebb2c48653b226b49719db9e2
BLAKE2b-256 54328ce0e67e4c1e78047774b6e8af97bd99f76920e32b66547c3409a76c618d

See more details on using hashes here.

File details

Details for the file airflow_providers_wherobots-0.1.2-py3-none-any.whl.

File metadata

File hashes

Hashes for airflow_providers_wherobots-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 3a13e6eb004c8316967d95f6346859c40e601993833db26937a6deac2ec7e5f5
MD5 79ac98237f151bde37bf80a0b754767d
BLAKE2b-256 ef583ae6d5aeada8a36fd2e79be74c567ab9925b65f3c73a5621dff97c9a02a2

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page