Skip to main content

Presenting data lineage based on your data warehouse query history

Project description

Logo

License Downloads GitHub last commit

Elementary was built out of the need to effortlessly and immediately gain visibility into the data stack, starting with tracing the actual upstream & downstream dependencies in the data warehouse, without any implementation efforts, security risks or compromises on accuracy.

Data lineage made simple, reliable, and automated.

:star: If you like what we are building, support us with a :star:

Sandbox

Try out our new live demo here.

Quick start

pip install elementary-lineage

# The tool is named edl (Elementary Data Lineage),
# run it to validate the installation:
edl --help

We need a connection file in a simple YAML called profiles.yml. Here is a template to create a Snowflake / BigQuery one.
For further instructions go to our quickstart page.

If you use dbt, you can start right away by running this command with the path to your profiles.yml and the relevant profile name:

edl -d ~/.dbt -p <profile_name>

Documentation & Demo

Our full documentation is available here. We also created a short demo video.

Features

  • Lineage visualization: Visual map of data flow and dependencies in the data warehouse.
  • Dataset status: Present data about freshness and volume on the lineage graph.
  • Accuracy: Reflects the actual state in the DWH based on logs.
  • Plug-and-play: No need for code changes.
  • Graph filters: Filter the graph by dataset, dates, direction, and depth.

You can impact our next features in this roadmap by voting :+1: to issues and opening new ones.

Our plan is to eventually build a data observability platform that is open, transparent and powered by the community. A solution that data teams could easily integrate into their workflows, detect data incidents and prevent them from even happening in the first place.

We are working on Dataset monitoring, Lineage history, Column level lineage, Full lineage, and more.

Community & Support

For additional information and help, you can use one of these channels:

  • Slack (Live chat with the team, support, discussions, etc.)
  • GitHub issues (Bug reports, feature requests)
  • Roadmap (Vote for features and add your inputs)
  • Twitter (Updates on new releases and stuff)

Integrations

  • Snowflake
  • BigQuery
  • Redshift

Ask us for integrations on Slack or as a GitHub issue.

License

Elementary lineage is licensed under Apache License 2.0. See the LICENSE file for licensing information.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

elementary-lineage-0.0.22.tar.gz (51.7 kB view hashes)

Uploaded Source

Built Distribution

elementary_lineage-0.0.22-py3-none-any.whl (56.6 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page