Skip to main content

This project helps organizations define Legend data models that can be converted into efficient Delta pipelines

Project description

Legend on Delta Lake

An extension to the Legend framework for spark / delta lake based environment, combining best of open data standards with open source technologies

FINOS - Incubating Build CI Maven Central


In addition to the JDBC connectivity enabled to Databricks from the legend-engine itself, this project helps organizations define data models that can be converted into efficient data pipelines, ensuring data being queried is of high quality and availability. Raw data can be ingested as stream or batch and processed in line with the business semantics defined from the Legend interface. Domain specific language defined in Legend Studio can be interpreted as a series of Spark SQL operations, helping analysts create Delta Lake tables that not only guarantees schema definition but also complies with expectations, derivations and constraints defined by business analysts.

Usage

Make sure to have the jar file of org.finos.legend-community:legend-delta:X.Y.Z and all its dependencies available in your spark classpath and a legend data model (version controlled on gitlab) previously compiled to disk or packaged as a jar file and available in your classpath. For python support, please add the corresponding library from pypi repo.

pip install legend-delta==X.Y.Z

We show you how to extract schema, retrieve and enforce expectations and create delta tables in both scala and python sample notebooks.

Author

Databricks, Inc.

License

Copyright 2021 Databricks, Inc.

Distributed under the Apache License, Version 2.0.

SPDX-License-Identifier: Apache-2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

legend_delta-0.1.10-py3-none-any.whl (5.6 kB view details)

Uploaded Python 3

File details

Details for the file legend_delta-0.1.10-py3-none-any.whl.

File metadata

  • Download URL: legend_delta-0.1.10-py3-none-any.whl
  • Upload date:
  • Size: 5.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.13

File hashes

Hashes for legend_delta-0.1.10-py3-none-any.whl
Algorithm Hash digest
SHA256 9f86bc83807e542f0cfe73ca4f8db714195fced7223e35789482b1dc404a63ae
MD5 99550b9eea5ac48f1d01490866651cc0
BLAKE2b-256 c971971e2eb4c0b9fade8fcdbd3e47974596f82c96d4e92baab973e441684c6e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page