Skip to main content

Marquez integration with Airflow

Project description

marquez-airflow

Is a library that integrates Airflow DAGs with Marquez for automatic metadata collection.

Requirements

  • Python 3.5+
  • apache-airflow 1.10.0+
  • marquez-client

Installation

pip install marquez-airflow

Usage

Once the library is installed in your system, your current DAGs need to be modified slightly by changing the DAG to a MarquezDAG, see example below:

from marquez.airflow import MarquezDag as DAG
from airflow.operators.dummy_operator import DummyOperator
from datetime import datetime


DAG_NAME = 'my_DAG_name'

default_args = {
    'mqz_namespace': 'namespace_1',
    'mqz_location': 'github://data-dags/dag_location/',
    'mqz_input_datasets': ["s3://some_data", "s3://more_data"],
    'mqz_output_datasets': ["s3://output_data"],

    'owner': ...,
    'depends_on_past': False,
    'start_date': ...,
}

dag = DAG(DAG_NAME, schedule_interval='*/10 * * * *',
          default_args=default_args, description="yet another DAG")

run_this = DummyOperator(task_id='run_this', dag=dag)
run_this_too = DummyOperator(task_id='run_this_too', dag=dag)
run_this_too.set_upstream(run_this)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

marquez-airflow-0.0.2.tar.gz (2.6 kB view details)

Uploaded Source

Built Distribution

marquez_airflow-0.0.2-py3-none-any.whl (7.2 kB view details)

Uploaded Python 3

File details

Details for the file marquez-airflow-0.0.2.tar.gz.

File metadata

  • Download URL: marquez-airflow-0.0.2.tar.gz
  • Upload date:
  • Size: 2.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.30.0 CPython/3.5.6

File hashes

Hashes for marquez-airflow-0.0.2.tar.gz
Algorithm Hash digest
SHA256 8741981ddf13db4cfa7b3af2a4a1b5bf8d09ab2826926bdc156af85a805168ab
MD5 ee369dc51567d74391e81fd1bd368418
BLAKE2b-256 92fecb2daa14840b51e1bb38500244b0f7bcbec915aee25e71815307a77a2c37

See more details on using hashes here.

File details

Details for the file marquez_airflow-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: marquez_airflow-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 7.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.30.0 CPython/3.5.6

File hashes

Hashes for marquez_airflow-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 6dc4baf9d18c2e64968d30268573b8217bfeef479f26f2079e4b4cff0b35e1f4
MD5 c8087b0c4d8fde19388cd7ff5b7f71b8
BLAKE2b-256 19636226a3c6ea025de5abe417ce6c0fd775e8e394848dcd2e03016fe117fea3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page