Skip to main content

No project description provided

Project description

dagster-stitch

PyPI PyPI - Python Version PyPI - License Code Style - Black

This library provides a Dagster integration for Stitch, a managed batch data ingestion service similar to Fivetran and Airbyte.

Disclaimer

Please note this library is under active development and should be used cautiously! Currently it supports triggering replication jobs and materializing the resulting tables as Dagster assets that can be used in downstream jobs. Reconciliation and other features are not yet included but would be great additions.

Installation

To install the library, run:

$ pip install dagster-stitch

For development, it can be installed locally and tested with:

$ pip install -e .[lint,test]
$ pytest

Configuration

Setup

To use the library, you must configure a stitch resource in your Dagster instance. The resource requires an api_key to authenticate with Stitch. You can find or generate one in the Stitch UI under Account Settings > API Access Keys.

You will also need to note your Stitch account ID and the ID of the data source you want to replicate to be used in the asset or operation configuration. These can be found by navigating to the data source in the Stitch UI and looking at the URL. For example, if the URL is https://app.stitchdata.com/client/12345/pipeline/v2/sources/67890/summary, then the account ID is 12345 and the data source ID is 67890.

Usage

Here's an example of how to instantiate an asset that will materialize the table_name table from the data_source_id data source:

from dagster import Definitions
from dagster_stitch import stitch_resource, build_stitch_assets

stitch_instance = stitch_resource.configured(
    {
        "api_key": "your_stitch_api_key",
        "account_id": 12345,
    }
)
stitch_assets = build_stitch_assets(
    data_source_id=54321,
    destination_tables=["data_source_name.table_name"]
)

definitions = Definitions(
    assets=stitch_assets,
    resources={"stitch": stitch_instance},
)

Note that you should include your data source name prefixed by a point in your destination_tables and if the table do not correspond to a materialized asset, the materialization will raise an error.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dagster-stitch-0.1.3.tar.gz (18.1 kB view details)

Uploaded Source

Built Distribution

dagster_stitch-0.1.3-py3-none-any.whl (16.7 kB view details)

Uploaded Python 3

File details

Details for the file dagster-stitch-0.1.3.tar.gz.

File metadata

  • Download URL: dagster-stitch-0.1.3.tar.gz
  • Upload date:
  • Size: 18.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.9

File hashes

Hashes for dagster-stitch-0.1.3.tar.gz
Algorithm Hash digest
SHA256 5a17bb57ba51f4259beae96d55ad23c918c037364b6b8db9924d8627c2a72e36
MD5 6c8801a3b6c8c768fdf9628a6ebf2056
BLAKE2b-256 06941ee5120a506730bff85f040435d968708f62b4289cdf484f19e3af102c08

See more details on using hashes here.

File details

Details for the file dagster_stitch-0.1.3-py3-none-any.whl.

File metadata

File hashes

Hashes for dagster_stitch-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 9482ddb96a0826a96fc5ff16aebacde4c7c9404725dfa1c74f7f15f6d7d17a81
MD5 ef8420c6a055a73d8411e2501879561b
BLAKE2b-256 262b07cfcfa563d8cea527fb3f80420e46e7086778f239dcff7a860f7c77d4dc

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page