Airflow extension for communicating with Wherobots Cloud
Project description
Airflow Providers for Wherobots
Airflow providers to bring Wherobots Cloud's spatial compute to your data workflows and ETLs.
Installation
If you use Poetry in your project, add the
dependency with poetry add
:
$ poetry add airflow-providers-wherobots
Otherwise, just pip install
it:
$ pip install airflow-providers-wherobots
Usage
Create a connection
You first need to create a Connection in Airflow. This can be done from
the UI, or from the command-line. The default Wherobots connection name
is wherobots_default
; if you use another name you must specify that
name with the wherobots_conn_id
parameter when initializing Wherobots
operators.
The only required fields for the connection are:
- the Wherobots API endpoint in the
host
field; - your Wherobots API key in the
password
field.
$ airflow connections add "wherobots_default" \
--conn-type "generic" \
--conn-host "api.cloud.wherobots.com" \
--conn-password "$(< api.key)"
Execute a SQL query
The WherobotsSqlOperator
allows you to run SQL queries on the
Wherobots cloud, from which you can build your ETLs and data
transformation workflows by querying, manipulating, and producing
datasets with WherobotsDB.
Refer to the Wherobots Documentation and this guidance to learn how to read data, transform data, and write results in Spatial SQL with WherobotsDB.
Refer to the Wherobots Apache Airflow Provider Documentation to get more detailed guidance about how to use the Wherobots Apache Airflow Provider.
Example
Below is an example Airflow DAG that executes a SQL query on Wherobots Cloud:
import datetime
from airflow import DAG
from airflow_providers_wherobots.operators.sql import WherobotsSqlOperator
with DAG(
dag_id="example_wherobots_sql_dag",
start_date=datetime.datetime.now(),
schedule="@hourly",
catchup=False
):
# Create a `wherobots.test.airflow_example` table with 100 records
# from the OMF `places_place` dataset.
operator = WherobotsSqlOperator(
task_id="execute_query",
sql=f"""
INSERT INTO wherobots.test.airflow_example
SELECT id, geometry, confidence, geohash
FROM wherobots_open_data.overture.places_place
LIMIT 100
""",
return_last=False,
)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file airflow_providers_wherobots-0.1.5.tar.gz
.
File metadata
- Download URL: airflow_providers_wherobots-0.1.5.tar.gz
- Upload date:
- Size: 7.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.0 CPython/3.12.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d5b135ef2489b3479c3d1f870b5ebbe423e5307cab4284d834b91bc53c4d4225 |
|
MD5 | ece0f6e2314c4d9faa430b93d941387f |
|
BLAKE2b-256 | a557e58be65a2dd8ca63d4edf47d12649bc12cd6ea6154913babfc88ff07fca0 |
File details
Details for the file airflow_providers_wherobots-0.1.5-py3-none-any.whl
.
File metadata
- Download URL: airflow_providers_wherobots-0.1.5-py3-none-any.whl
- Upload date:
- Size: 9.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.0 CPython/3.12.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0686da7d29b85222c891a6d2dba357465992c0a911cb7da898641cfe18c1128a |
|
MD5 | 3e92b8bfe2ae14ec40edcfd1c9d3aeb2 |
|
BLAKE2b-256 | a5bc40ed9db02e54f37bf1f54f56dd6f8285807f87bd79cbc03906aeab3b0513 |