Skip to main content

SDK for Watcher framework

Project description

Watcher SDK

PyPI version Changelog

QuickStart

Installation

You can install the Watcher SDK, etl-watcher-sdk, using your preferred package manager.

Store Pipeline and Address Lineage Configuration

Store your pipeline and address lineage configuration in a Python file.

from watcher import Pipeline, PipelineConfig, AddressLineage, Address

MY_ETL_PIPELINE_CONFIG = PipelineConfig(
    pipeline=Pipeline(
        name="my-etl-pipeline",
        pipeline_type_name="extraction",
    ),
    default_watermark="2024-01-01",
    address_lineage=AddressLineage(
        source_addresses=[
            Address(
                name="source_db.source_schema.source_table",
                address_type_name="postgres",
                address_type_group_name="database",
            )
        ],
        target_addresses=[
            Address(
                name="target_db.target_schema.target_table",
                address_type_name="snowflake",
                address_type_group_name="warehouse",
            )
        ],
    ),
)

Sync Pipeline and Address Lineage Configuration

Sync your pipeline and address lineage configuration to the Watcher framework. This ensures your code is the source of truth for the pipeline and address lineage configuration.

from watcher import Watcher, PipelineConfig

watcher = Watcher("https://api.watcher.example.com")
synced_config = watcher.sync_pipeline_config(MY_ETL_PIPELINE_CONFIG)
print(f"Pipeline synced!")

Track Pipeline Execution

from watcher import Watcher, PipelineConfig, ETLResult

watcher = Watcher("https://api.watcher.example.com")

synced_config = watcher.sync_pipeline_config(MY_ETL_PIPELINE_CONFIG)

@watcher.track_pipeline_execution(
    pipeline_id=synced_config.pipeline.id, 
    active=synced_config.pipeline.active
)
def etl_pipeline():
    print("Starting ETL pipeline")
    
    # Your ETL work here
    # Set completed_successfully=True/False based on your logic
    
    return ETLResult(
        completed_successfully=True,
        inserts=100,
        total_rows=100,
        execution_metadata={"partition": "2025-01-01"},
    )

etl_pipeline()

Contributing

I welcome contributions to the Watcher SDK! Please see the Contributing Guidelines for details on how to get started, the development process, and how to submit pull requests.

Quick Start for Contributors

  1. Fork the repository
  2. Create a feature branch
  3. Make your changes
  4. Spin up the Watcher framework for manual integration testing
  5. Run tests in the repo with make test
  6. Submit a pull request

For detailed information about the coding standards, testing requirements, and contribution process, please refer to the Contributing Guidelines.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

etl_watcher_sdk-0.1.21.tar.gz (6.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

etl_watcher_sdk-0.1.21-py3-none-any.whl (9.5 kB view details)

Uploaded Python 3

File details

Details for the file etl_watcher_sdk-0.1.21.tar.gz.

File metadata

  • Download URL: etl_watcher_sdk-0.1.21.tar.gz
  • Upload date:
  • Size: 6.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.11

File hashes

Hashes for etl_watcher_sdk-0.1.21.tar.gz
Algorithm Hash digest
SHA256 aae8c20398376361a0f0ba12862b9bc3175a1c44b3208ce4fb990e50b2bbe65b
MD5 55b3174edb77f8b1aab7da2a55c6a811
BLAKE2b-256 f83db02d9f7bfe46bb07c9609b48e304f1d92379a50df9ac7e534ca0925fc508

See more details on using hashes here.

File details

Details for the file etl_watcher_sdk-0.1.21-py3-none-any.whl.

File metadata

File hashes

Hashes for etl_watcher_sdk-0.1.21-py3-none-any.whl
Algorithm Hash digest
SHA256 3a6f078a11b1a7ef854e2859c6953bb66ebe89f3a99eb3f924e2105d702ea924
MD5 db70e47e49e921551b5149cacb2bcc9b
BLAKE2b-256 4418269fb01c6830610f8c9257fe8ac69caf4ea9bfd163b5a3a1b8367f14eea4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page