A collection of Python-based 'connectors' that extract metadata from various sources to ingest into the Metaphor app.
Project description
Metaphor Connectors
This repository contains a collection of Python-based "connectors" that extract metadata from various sources to ingest into the Metaphor platform.
Installation
This package requires Python 3.9+ installed. You can verify the version on your system by running the following command,
python -V # or python3 on some systems
Once verified, you can install the package using pip,
pip install "metaphor-connectors[all]" # or pip3 on some systems
This will install all the connectors and required dependencies. You can also choose to install only a subset of the dependencies by installing the specific extra, e.g.
pip install "metaphor-connectors[snowflake]"
Similarly, you can also install the package using requirements.txt
or pyproject.toml
.
Docker
We automatically push a docker image to Docker Hub as part of the CI/CD. See this page for more details.
GitHub Action
You can also run the connectors in your CI/CD pipeline using the Metaphor Connectors GitHub Action.
Connectors
Each connector is placed under its own directory under metaphor and extends the metaphor.common.BaseExtractor
class.
Connector Name | Metadata |
---|---|
athena | Schema, description, queries |
azure_data_factory | Lineage, Pipeline |
bigquery | Schema, description, statistics, queries |
bigquery.lineage | Lineage |
bigquery.profile | Data profile |
confluence | Document embeddings |
custom.data_quality | Data quality |
custom.governance | Ownership, tags, description |
custom.lineage | Lineage |
custom.metadata | Custom metadata |
custom.query_attributions | Query attritutions |
datahub | Description, tag, ownership |
dbt | dbt model, test, lineage |
dbt.cloud | dbt model, test, lineage |
fivetran | Lineage, Pipeline |
glue | Schema, description |
informatica | Lineage, Pipeline |
looker | Looker view, explore, dashboard, lineage |
kafka | Schema, description |
metabase | Dashboard, lineage |
mongodb | Schema, statistics |
monte_carlo | Data monitor |
mssql | Schema |
mysql | Schema, description |
oracle | Schema, description, queries |
notion | Document embeddings |
postgresql | Schema, description, statistics |
postgresql.profile | Data profile |
postgresql.usage | Usage |
power_bi | Dashboard, lineage |
quick_sight | Dashboard, lineage |
redshift | Schema, description, statistics, queries |
redshift.profile | Data profile |
s3 | Schema, description |
sharepoint | Document embeddings |
snowflake | Schema, description, statistics, queries |
snowflake.profile | Data profile |
static_web | Document embeddings |
synapse | Schema, queries |
tableau | Dashboard, lineage |
thought_spot | Dashboard, lineage |
trino | Schema, description, queries |
unity_catalog | Schema, description |
unity_catalog.profile | Data profile, statistics |
Development
See Development Environment for more instructions on how to set up your local development environment.
Custom Connectors
See Adding a Custom Connector for instructions and a full example of creating your custom connectors.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for metaphor_connectors-0.14.131.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6f211a58e6bb76447d5a5fe4fd25bb34eb4b4f7aec7e9859040746af58711be0 |
|
MD5 | 116525cef9c39a3f6f30c2d1acf00c5f |
|
BLAKE2b-256 | b6058ae93a7a8d05854847dc3383a496d92504c5e1c6a39073ad09c96de42604 |
Hashes for metaphor_connectors-0.14.131-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0321f8a27c5730bca279712733787f27d2f5ff2f0963cff6c65cc822eb1c3d3a |
|
MD5 | 7b19d36db8bf4be11f16e04e56453b95 |
|
BLAKE2b-256 | 9ca5f6ef9aeb6fd0013c288f6d30dc205dfb6d2ab5520f037c2a9a18c8139db2 |