Bigeye Airflow Library supports Airflow versions >=2.2.0, <2.8.0 and offers custom operators for interacting with your your bigeye workspace.
Project description
Bigeye Airflow Operators for Airflow Versions 2.x
Operators
Create Metric Operator (bigeye_airflow.operators.create_metric_operator)
The CreateMetricOperator creates metrics from a list of metric configurations provided to the operator. This operator will fill in reasonable defaults like setting thresholds. It authenticates through an Airflow connection ID and offers the option to run the metrics after those metrics have been created. Please review the link below to understand the structure of the configurations.
Create or Update Metric Swagger
Parameters
- connection_id: str - The Airfow connection ID used to store the required Bigeye credential.
- warehouse_id: int - The Bigeye source/warehouse id to which the metric configurations will be deployed.
- configuration: List[dict] - A list of metric configurations conforming to the following schema.
schema_name: str table_name: str column_name: str metric_template_id: uuid.UUID metric_name: str description: str notifications: List[str] thresholds: List[dict] filters: List[str] group_by: List[str] user_defined_metric_name: str metric_type: SimpleMetricCategory default_check_frequency_hours: int update_schedule: str delay_at_update: str timezone: str should_backfill: bool lookback_type: str lookback_days: int window_size: str _window_size_seconds
- run_after_upsert: bool - If true it will run the metrics after creation. Defaults to False.
- workspace_id: Optional[int] - The ID of the workspace where metrics should be created. If only 1 workspace configured, then will default to that else this will be required.
Run Metrics Operator (bigeye_airflow.operators.run_metrics_operator)
The RunMetricsOperator will run metrics in Bigeye based on the following:
- All metrics for a given table, by providing warehouse ID, schema name and table name.
- All metrics for a given collection, by providing the collection ID.
- Any and all metrics, given a list of metric IDs.
Currently, if a list of metric IDs is provided these will be run instead of metrics provided for warehouse_id, schema_name, table_name, and collection_id
Parameters
- connection_id: str - The Airfow connection ID used to store the required Bigeye credential.
- warehouse_id: int - The Bigeye source/warehouse id for which metrics will be run.
- schema_name: str - The schema name for which metrics will be run.
- table_name: str - The table name for which metrics will be run.
- collection_id: int - The ID of the collection where the operator will run the metrics.
- metric_ids: List[int] - The metric ids to run.
- workspace_id: Optional[int] - The ID of the workspace where metrics should be run. If only 1 workspace configured, then will default to that else this will be required.
- circuit_breaker_mode: bool - Whether dag should raise an exception if metrics result in alerting state, default False.
Create Delta Operator (bigeye_airflow.operators.create_delta_operator)
The CreateDeltaOperator creates deltas from a list of delta configurations provided to the operator. This operator will fill in reasonable defaults like column mappings. It authenticates through an Airflow connection ID and offers the option to run the deltas after those deltas have been created. Please review the link below to understand the structure of the configurations.
Parameters
- connection_id: str - The Airfow connection ID used to store the required Bigeye credential.
- warehouse_id: int - The Bigeye source/warehouse id to which the metric configurations will be deployed.
- configuration: List[dict] - A list of delta configurations conforming to the following schema.
delta_name: str fq_source_table_name: str target_table_comparisons: dict - example: {"target_table_comparisons": [{"fq_target_table_name": "Snowflake.TOOY_DEMO_DB.PROD_REPL.ORDERS"}] tolerance: Optional[float] - default = 0.0 cron_schedule: Optional[dict] - default = None - example: {"cron_schedule": {"name": "Midnight UTC", "cron": "0 0 * * *"}} notification_channels: Optional[List[dict]] - default = None - example: {"notification_channels: [{"slack": "#data-alerts"}]
- run_after_upsert: bool - If true it will run the deltas after creation. Defaults to False.
- workspace_id: Optional[int] - The ID of the workspace where deltas should be created. If only 1 workspace configured, then will default to that else this will be required.
Run Deltas Operator (bigeye_airflow.operators.run_deltas_operator)
The RunDeltasOperator will run deltas in Bigeye based on the following:
- All deltas for a given table, by providing warehouse ID, schema name and table name.
- Any and all deltas, given a list of delta IDs.
Currently, if a list of delta IDs is provided these will be run instead of metrics provided for warehouse_id, schema_name, table_name.
Parameters
- connection_id: str - The Airfow connection ID used to store the required Bigeye credential.
- warehouse_id: int - The Bigeye source/warehouse id for which metrics will be run.
- schema_name: str - The schema name for which metrics will be run.
- table_name: str - The table name for which metrics will be run.
- delta_ids: List[int] - The delta ids to run.
- workspace_id: Optional[int] - The ID of the workspace where deltas should be run. If only 1 workspace configured, then will default to that else this will be required.
- circuit_breaker_mode: bool - Whether dag should raise an exception if deltas result in alerting state, default False.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for bigeye_airflow-0.1.34-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f0b0b9af6dba9854f9e953a90900ba2570cb3dbf0fe87c6ff2fe19d383115b6e |
|
MD5 | 25db327a1a48e8081c185e884e964e5f |
|
BLAKE2b-256 | 3bb8c3c3051a3de571cd725527d79e674ab29710134b1f58f912331677858334 |