No project description provided
Project description
dagster-stitch
This library provides a Dagster integration for Stitch, a managed batch data ingestion service similar to Fivetran and Airbyte.
Disclaimer
Please note this library is under active development and should be used cautiously! Currently it supports triggering replication jobs and materializing the resulting tables as Dagster assets that can be used in downstream jobs. Reconciliation and other features are not yet included but would be great additions.
Installation
To install the library, run:
$ pip install dagster-stitch
For development, it can be installed locally and tested with:
$ pip install -e .[lint,test]
$ pytest
Configuration
Setup
To use the library, you must configure a stitch
resource in your Dagster instance. The resource requires an api_key
to authenticate with Stitch. You can find or generate one in the Stitch UI under Account Settings > API Access Keys
.
You will also need to note your Stitch account ID and the ID of the data source you want to replicate to be used in the asset or operation configuration. These can be found by navigating to the data source in the Stitch UI and looking at the URL. For example, if the URL is https://app.stitchdata.com/client/12345/pipeline/v2/sources/67890/summary
, then the account ID is 12345
and the data source ID is 67890
.
Usage
Here's an example of how to instantiate an asset that will materialize the table_name
table from the data_source_id
data source:
from dagster import Definitions
from dagster_stitch import stitch_resource, build_stitch_assets
stitch_instance = stitch_resource.configured(
{
"api_key": "your_stitch_api_key",
"account_id": 12345,
}
)
stitch_assets = build_stitch_assets(
data_source_id=54321,
destination_tables=["data_source_name.table_name"]
)
definitions = Definitions(
assets=stitch_assets,
resources={"stitch": stitch_instance},
)
Note that you should include your data source name prefixed by a point in your destination_tables
and if the table do not correspond to a materialized asset, the materialization will raise an error.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file dagster-stitch-0.1.3.tar.gz
.
File metadata
- Download URL: dagster-stitch-0.1.3.tar.gz
- Upload date:
- Size: 18.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5a17bb57ba51f4259beae96d55ad23c918c037364b6b8db9924d8627c2a72e36 |
|
MD5 | 6c8801a3b6c8c768fdf9628a6ebf2056 |
|
BLAKE2b-256 | 06941ee5120a506730bff85f040435d968708f62b4289cdf484f19e3af102c08 |
File details
Details for the file dagster_stitch-0.1.3-py3-none-any.whl
.
File metadata
- Download URL: dagster_stitch-0.1.3-py3-none-any.whl
- Upload date:
- Size: 16.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9482ddb96a0826a96fc5ff16aebacde4c7c9404725dfa1c74f7f15f6d7d17a81 |
|
MD5 | ef8420c6a055a73d8411e2501879561b |
|
BLAKE2b-256 | 262b07cfcfa563d8cea527fb3f80420e46e7086778f239dcff7a860f7c77d4dc |