An ETL and DataOps framework for building a lakehouse

These details have not been verified by PyPI

Project links

Project description

Laktory

An open-source DataOps and dataframe-centric ETL framework for building lakehouses. Use it standalone or extend your existing DABs setup with it.

Laktory is an all-in-one solution for defining both data transformations and Databricks resources. Imagine if Declarative Automation Bundles (DAB) supported any Databricks resources and offered a declarative approach to data transformations, that's essentially Laktory.

Deploy it standalone as your full Databricks DataOps platform, or add it alongside your existing DAB setup to manage pipeline definitions and the resources DAB doesn't cover.

This open-source framework streamlines the creation, deployment, and execution of data pipelines while adhering to essential DevOps practices such as version control, code reviews, and CI/CD integration. Powered by Narwhals, Laktory enables seamless transitions between Apache Spark, Polars, and other frameworks to perform data transformations reliably and at scale. Its modular and flexible design allows you to effortlessly combine SQL statements with DataFrame operations, reducing complexity and enhancing productivity.

Since Laktory pipelines are built on top of Narwhals, they can run in any environment that supports Python, from your local machine to a Kubernetes cluster. Pipelines can be orchestrated using tools like Apache Airflow or deployed directly as Databricks Jobs or Declarative Pipelines, offering both flexible and fully managed execution options.

But Laktory goes beyond data pipelines. It empowers you to define and deploy your entire Databricks data platform, from Unity Catalog and access grants to compute and quality monitoring. This empowers your data team to take full ownership of the solution, eliminating the need to juggle multiple technologies.

No more splitting ownership between Terraform for infrastructure and DAB for workflows. With Laktory, the team that builds the pipelines can own the stack end to end.

Help

See documentation for more details.

Installation

Install using

pip install laktory

For more installation options, see the Install section in the documentation.

A Basic Example

from laktory import models


node_brz = models.PipelineNode(
    name="brz_stock_prices",
    sources=[{
        "format": "PARQUET",
        "path": "./data/brz_stock_prices/"
    }],
    transformer={
        "nodes": []
    }
)

node_slv = models.PipelineNode(
    name="slv_stock_prices",
    sources=[{
        "node_name": "brz_stock_prices"
    }],
    sinks=[{
        "path": "./data/slv_stock_prices",
        "mode": "OVERWRITE",
        "format": "PARQUET",
    }],
    transformer={
        "nodes": [
            
            # SQL Transformation
            {
                "expr": """
                    SELECT
                      data.created_at AS created_at,
                      data.symbol AS symbol,
                      data.open AS open,
                      data.close AS close,
                      data.high AS high,
                      data.low AS low,
                      data.volume AS volume
                    FROM
                      {df}
                """   
            },
            
            # Spark Transformation
            {
                "func_name": "drop_duplicates",
                "func_kwargs": {
                    "subset": ["created_at", "symbol"]
                }
            },
        ]
    }
)

pipeline = models.Pipeline(
    name="stock_prices",
    nodes=[node_brz, node_slv],
)

pipeline.execute(spark=spark)

To get started with a more useful example, jump into the Quickstart.

Get Involved

Laktory is growing rapidly, and we'd love for you to be part of our journey! Here's how you can get involved:

Join the Community: Connect with fellow Laktory users and contributors on our Slack. Share ideas, ask questions, and collaborate!
Suggest Features or Report Issues: Have an idea for a new feature or encountering an issue? Let us know on GitHub Issues. Your feedback helps shape the future of Laktory!
Contribute to Laktory: Check out our contributing guide to learn how you can tackle issues and add value to the project.

A Lakehouse DataOps Template

A comprehensive template on how to deploy a lakehouse as code using Laktory is maintained here: https://github.com/okube-ai/lakehouse-as-code

In this template, 4 stacks are used to:

{cloud_provider}_infra: Deploy the required resources on your cloud provider
unity-catalog: Setup users, groups, catalogs, schemas and manage grants
workspace: Setup secrets, clusters and warehouses and common files/notebooks
workflows: The data workflows to build your lakehouse

Okube Company

Okube is dedicated to building open source frameworks, known as the kubes, empowering businesses to build, deploy and operate highly scalable data platforms and AI models.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.12.2

Jun 30, 2026

0.12.1

Jun 5, 2026

0.12.0

Jun 5, 2026

0.11.10

May 20, 2026

0.11.9

May 17, 2026

0.11.8

May 16, 2026

0.11.7

May 16, 2026

0.11.6

May 16, 2026

0.11.5

May 15, 2026

0.11.4

May 7, 2026

0.11.3

May 4, 2026

0.11.2

May 1, 2026

0.11.1

May 1, 2026

0.11.0

Apr 29, 2026

0.10.0

Apr 16, 2026

0.9.5

Apr 9, 2026

0.9.4

Mar 31, 2026

0.9.3

Mar 27, 2026

0.9.2

Mar 27, 2026

0.9.1

Mar 18, 2026

0.9.0

Mar 8, 2026

0.8.17

Dec 17, 2025

0.8.16

Dec 12, 2025

0.8.15

Dec 9, 2025

0.8.14

Dec 3, 2025

0.8.13

Nov 17, 2025

0.8.12

Nov 10, 2025

0.8.11

Nov 3, 2025

0.8.10

Oct 17, 2025

0.8.9

Aug 27, 2025

0.8.8

Aug 26, 2025

0.8.7

Aug 15, 2025

0.8.6

Aug 9, 2025

0.8.5

Aug 4, 2025

0.8.4

Jul 16, 2025

0.8.3

Jul 14, 2025

0.8.2

Jul 11, 2025

0.8.0

Jun 29, 2025

0.7.3

Apr 16, 2025

0.7.2

Apr 3, 2025

0.7.1

Mar 20, 2025

0.7.0

Mar 18, 2025

0.6.10

Mar 14, 2025

0.6.9

Mar 14, 2025

0.6.8

Mar 11, 2025

0.6.7

Mar 10, 2025

0.6.6

Mar 10, 2025

0.6.5

Feb 19, 2025

0.6.4

Jan 24, 2025

0.6.3

Jan 22, 2025

0.6.2

Jan 22, 2025

0.6.1

Jan 20, 2025

0.6.0

Jan 18, 2025

0.5.13

Jan 7, 2025

0.5.12

Dec 30, 2024

0.5.11

Dec 30, 2024

0.5.10

Dec 29, 2024

0.5.9

Dec 20, 2024

0.5.8

Dec 18, 2024

0.5.7

Dec 9, 2024

0.5.6

Dec 3, 2024

0.5.5

Dec 2, 2024

0.5.4

Nov 26, 2024

0.5.3

Nov 22, 2024

0.5.2

Nov 12, 2024

0.5.1

Nov 8, 2024

0.5.0

Nov 5, 2024

0.4.14

Oct 8, 2024

0.4.13

Oct 1, 2024

0.4.12

Sep 18, 2024

0.4.11

Aug 16, 2024

0.4.10

Jul 20, 2024

0.4.9

Jul 20, 2024

0.4.8

Jul 3, 2024

0.4.7

Jun 27, 2024

0.4.6

Jun 27, 2024

0.4.5

Jun 25, 2024

0.4.4

Jun 25, 2024

0.4.3

Jun 12, 2024

0.4.2

Jun 11, 2024

0.4.1

Jun 11, 2024

0.4.0

Jun 11, 2024

0.3.3

May 30, 2024

0.3.2

May 28, 2024

0.3.1

May 28, 2024

0.3.0

May 28, 2024

0.2.1

May 7, 2024

0.2.0

May 2, 2024

0.1.10

Apr 23, 2024

0.1.9

Apr 17, 2024

0.1.8

Mar 25, 2024

0.1.7

Mar 15, 2024

0.1.6

Feb 23, 2024

0.1.5

Feb 14, 2024

0.1.4

Feb 12, 2024

0.1.3

Feb 10, 2024

0.1.2

Feb 5, 2024

0.1.1

Jan 28, 2024

0.1.0

Jan 12, 2024

0.0.29

Dec 20, 2023

0.0.28

Dec 17, 2023

0.0.27

Dec 16, 2023

0.0.26

Dec 16, 2023

0.0.25

Dec 12, 2023

0.0.24

Dec 5, 2023

0.0.23

Dec 1, 2023

0.0.22

Nov 29, 2023

0.0.21

Nov 27, 2023

0.0.20

Nov 27, 2023

0.0.19

Nov 23, 2023

0.0.18

Nov 14, 2023

0.0.17

Nov 13, 2023

0.0.16

Nov 8, 2023

0.0.15

Nov 7, 2023

0.0.14

Nov 6, 2023

0.0.13

Nov 6, 2023

0.0.12

Nov 5, 2023

0.0.11

Nov 5, 2023

0.0.10

Oct 31, 2023

0.0.9

Oct 27, 2023

0.0.8

Oct 24, 2023

0.0.7

Oct 20, 2023

0.0.6

Oct 10, 2023

0.0.5

Sep 28, 2023

0.0.4

Sep 27, 2023

0.0.3

Sep 25, 2023

0.0.2

Sep 24, 2023

0.0.1

Jul 13, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

laktory-0.12.2.tar.gz (751.2 kB view details)

Uploaded Jun 30, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

laktory-0.12.2-py3-none-any.whl (916.8 kB view details)

Uploaded Jun 30, 2026 Python 3

File details

Details for the file laktory-0.12.2.tar.gz.

File metadata

Download URL: laktory-0.12.2.tar.gz
Upload date: Jun 30, 2026
Size: 751.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.5.14

File hashes

Hashes for laktory-0.12.2.tar.gz
Algorithm	Hash digest
SHA256	`817f73abde1ffb8e2c95d02cc793f424dba38bf04ec7f133ba7653e53b2c2a3b`
MD5	`e1cd7ccc6f7da974649fc6b742f5902c`
BLAKE2b-256	`2e055014d0a58c365cc7dcbf42d4ee35fc822613c12aa237dd9ba23833d1a276`

See more details on using hashes here.

File details

Details for the file laktory-0.12.2-py3-none-any.whl.

File metadata

Download URL: laktory-0.12.2-py3-none-any.whl
Upload date: Jun 30, 2026
Size: 916.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.5.14

File hashes

Hashes for laktory-0.12.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f6838ac1bb08662384d4845057e9f2b461d93af22eaf4cc19cf63671cef0c7c2`
MD5	`2e937370ac0799c5973ae73c70a44bda`
BLAKE2b-256	`3adbf49ee6936270abbbfaae51ca126f86e4f61cb7274d20e33319f002d40c8a`

See more details on using hashes here.

laktory 0.12.2

Navigation

Verified details

Owner

Unverified details

Project links

Meta

Classifiers

Project description

Laktory

Help

Installation

A Basic Example

Get Involved

A Lakehouse DataOps Template

Okube Company

Project details

Verified details

Owner

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes