Skip to main content

Ingestion Framework for OpenMetadata

Project description


This guide will help you setup the Ingestion framework and connectors

Python version 3.8+

OpenMetadata Ingesiton is a simple framework to build connectors and ingest metadata of various systems through OpenMetadata APIs. It could be used in an orchestration framework(e.g. Apache Airflow) to ingest metadata. Prerequisites

  • Python >= 3.8.x

Install From PyPI

python3 -m pip install --upgrade pip wheel setuptools openmetadata-ingestion
python3 -m spacy download en_core_web_sm

Install Ingestion Connector Dependencies

Click here to go to Ingestion Connector's Documentation

Generate Redshift Data

metadata ingest -c ./pipelines/redshift.json

Generate Redshift Usage Data

metadata ingest -c ./pipelines/redshift_usage.json

Generate Sample Tables

metadata ingest -c ./pipelines/sample_tables.json

Generate Sample Users

metadata ingest -c ./pipelines/sample_users.json

Ingest MySQL data to Metadata APIs

metadata ingest -c ./pipelines/mysql.json

Ingest Bigquery data to Metadata APIs

export GOOGLE_APPLICATION_CREDENTIALS="$PWD/pipelines/creds/bigquery-cred.json"
metadata ingest -c ./pipelines/bigquery.json

Index Metadata into ElasticSearch

Run ElasticSearch docker

docker run -p 9200:9200 -p 9300:9300 -e "discovery.type=single-node" docker.elastic.co/elasticsearch/elasticsearch:7.10.2

Run ingestion connector

metadata ingest -c ./pipelines/metadata_to_es.json

Generated sources

We are using datamodel-codegen to get some pydantic classes inside the generated module from the JSON Schemas defining the API and Entities.

This tool bases the class name on the title of the JSON Schema (vs. Java POJO, which uses the file name). Note that this convention is important for us, as having a standardized approach in creating the titles helps us create generic code capable of tackling multiple Type Variables.

Changelog

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

openmetadata-ingestion-0.4.4.tar.gz (102.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

openmetadata_ingestion-0.4.4-py3-none-any.whl (211.4 kB view details)

Uploaded Python 3

File details

Details for the file openmetadata-ingestion-0.4.4.tar.gz.

File metadata

  • Download URL: openmetadata-ingestion-0.4.4.tar.gz
  • Upload date:
  • Size: 102.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.6.0 importlib_metadata/4.8.2 pkginfo/1.8.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.9

File hashes

Hashes for openmetadata-ingestion-0.4.4.tar.gz
Algorithm Hash digest
SHA256 2a4eb91492f1267d0f6d9b044afcc08a2e38e381617321f23b3d31b7753ca4e0
MD5 35d4b9d7fd9d0f09cae26861ec3a2a06
BLAKE2b-256 1801a94773e799797c1e99a7690752251c459838a7c5f0e5c815d3e3c46477a1

See more details on using hashes here.

File details

Details for the file openmetadata_ingestion-0.4.4-py3-none-any.whl.

File metadata

  • Download URL: openmetadata_ingestion-0.4.4-py3-none-any.whl
  • Upload date:
  • Size: 211.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.6.0 importlib_metadata/4.8.2 pkginfo/1.8.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.9

File hashes

Hashes for openmetadata_ingestion-0.4.4-py3-none-any.whl
Algorithm Hash digest
SHA256 c789d1bfa58c8b56323df1074a1bf9c1f39d1663aab86e84046ee9c1098e0f8d
MD5 fa3a214edbb88e3c47c3e1e2dc0db4cf
BLAKE2b-256 dd4aba88e27c93d52b33b8fc8100934cffb8c43d2856c2f4a65eb2468d921c41

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page