Skip to main content

Data monitoring and lineage

Project description

Logo

License Downloads GitHub last commit

Our goal is to provide data teams with immediate visibility, detection of data issues, and impact analysis.
We focus on providing a simple setup, integrations with the existing stack, and centralized metadata in your own data warehouse.

Supported use cases

Live data lineage - Enriched with operational context like freshness, volume, duration.

Source tables monitoring - Detect breaking changes and discover new data you can leverage in your source tables.

Alerts - Slack alerts on breaking changes and new data can be configured within minutes.

Demo & sandbox

Try out our live lineage sandbox here.

:star: If you like what we are building, support us with a :star:

Quick start

Install & connect

pip install elementary-data

# The tool is named edr (Elementary Data Reliability),
# run it to validate the installation:
edr --help

Add your data warehouse connection details in a profiles.yml file, see our quickstart page to learn more or use this template here. Yes, if you are a dbt user we use dbt's profiles.yml by default (simply add a new profile called 'elementary').

Data lineage

# Generate data lineage graph
edr lineage 

# Filter the graph for a specific table, direction and depth
edr lineage -t +my_table+3

Data monitoring

After you configure sources to monitor, execute it using:

edr monitor

To learn how to continuously monitor your data and use our Slack integration refer to our documentation.

Slack

Documentation

Want to learn more on how to quickly get started with it? Go to our quickstart page.

Have questions about the configuration? Go to our configuration FAQ here.

Curious to learn about the different modules? Use this modules overview.

Our full documentation is available here.

Features

Data lineage

  • Lineage visualization: Visual map of data flow and dependencies in the data warehouse, including legacy that is not managed by dbt.
  • Dataset status: Present data about freshness, volume, permissions and more on the lineage graph itself.
  • Accuracy: Reflects the actual state in the DWH based on logs and your query history.
  • Plug-and-play: No need for code changes.
  • Graph filters: Filter the graph by dataset, dates, direction, and depth.

Tables monitoring

  • Slack notifications.
  • Detect deletions: columns and tables that were removed.
  • Detect data type changes.
  • Detect new data: columns and tables that were added.

You can impact our next features in this roadmap by voting :+1: to issues and opening new ones.

We aim to build an open, transparent, and community-powered data observability platform. A solution that data teams could easily integrate into their workflows, detect data incidents and prevent them from even happening in the first place.

Community & Support

For additional information and help, you can use one of these channels:

  • Slack (Live chat with the team, support, discussions, etc.)
  • GitHub issues (Bug reports, feature requests)
  • Roadmap (Vote for features and add your inputs)
  • Twitter (Updates on new releases and stuff)

Integrations

  • Snowflake - Lineage & monitoring
  • BigQuery - Lineage only
  • Redshift

Ask us for integrations on Slack or as a GitHub issue.

License

Elementary is licensed under Apache License 2.0. See the LICENSE file for licensing information.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

elementary-data-0.1.4.tar.gz (64.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

elementary_data-0.1.4-py3-none-any.whl (71.2 kB view details)

Uploaded Python 3

File details

Details for the file elementary-data-0.1.4.tar.gz.

File metadata

  • Download URL: elementary-data-0.1.4.tar.gz
  • Upload date:
  • Size: 64.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.0.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.8.0

File hashes

Hashes for elementary-data-0.1.4.tar.gz
Algorithm Hash digest
SHA256 be5b908678c61cb9bc67d17bfe9bb691c66dbff52d9439b4def965288f53a144
MD5 d27c49151372db0fef0b7a8d478fbbaf
BLAKE2b-256 17142cd2cfec7a1fc29f073100e7a1c7d64ca71f4d32d5f0e183b9b6b3c57ddf

See more details on using hashes here.

File details

Details for the file elementary_data-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: elementary_data-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 71.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.0.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.8.0

File hashes

Hashes for elementary_data-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 375eb33bf6b0dfe753e98dc48775a58b4d441feba80236193cb2bb054532be4f
MD5 2427ee00f112a50bd7785578ba256eda
BLAKE2b-256 dcf28a73c55475f97bd56a4cd1ba5648bba343ba2737c3e7b2263689895dd3c9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page