kedro

Kedro helps you build production-ready data and analytics pipelines

These details have not been verified by PyPI

Project links

Homepage

Project description

Kedro Logo Banner

Theme	Status
Python Version
Latest PyPI Release
Latest Conda Release
`master` Branch Build
`develop` Branch Build
Documentation Build
License
Code Style
Questions

What is Kedro?

"The centre of your data pipeline."

Kedro is an open-source Python framework that applies software engineering best-practice to data and machine-learning pipelines. You can use it, for example, to optimise the process of taking a machine learning model into a production environment. You can use Kedro to organise a single user project running on a local environment, or collaborate within a team on an enterprise-level project.

We provide a standard approach so that you can:

Worry less about how to write production-ready code,
Spend more time building data pipelines that are robust, scalable, deployable, reproducible and versioned,
Standardise the way that your team collaborates across your project.

How do I install Kedro?

kedro is a Python package built for Python 3.6, 3.7 and 3.8.

To install Kedro from the Python Package Index (PyPI) simply run:

pip install kedro

You can also install kedro using conda, a package and environment manager program bundled with Anaconda. With conda already installed, simply run:

conda install -c conda-forge kedro

Our Get Started guide contains full installation instructions, and includes how to set up Python virtual environments.

We also recommend the frequently asked questions and the API reference documentation for additional information.

What are the main features of Kedro?

Kedro-Viz Pipeline Visualisation A pipeline visualisation generated using Kedro-Viz

Feature	What is this?
Project Template	A standard, modifiable and easy-to-use project template based on Cookiecutter Data Science.
Data Catalog	A series of lightweight data connectors used for saving and loading data across many different file formats and file systems including local and network file systems, cloud object stores, and HDFS. The Data Catalog also includes data and model versioning for file-based systems. Used with a Python or YAML API.
Pipeline Abstraction	Automatic resolution of dependencies between pure Python functions and data pipeline visualisation using Kedro-Viz.
The Journal	An ability to reproduce pipeline runs with saved pipeline run results.
Coding Standards	Test-driven development using `pytest`, produce well-documented code using Sphinx, create linted code with support for `flake8`, `isort` and `black` and make use of the standard Python logging library.
Flexible Deployment	Deployment strategies that include the use of Docker with Kedro-Docker, conversion of Kedro pipelines into Airflow DAGs with Kedro-Airflow, leveraging a REST API endpoint with Kedro-Server (coming soon) and serving Kedro pipelines as a Python package. Kedro can be deployed locally, on-premise and cloud (AWS, Azure and Google Cloud Platform) servers, or clusters (EMR, EC2, Azure HDinsight and Databricks).

How do I use Kedro?

The Kedro documentation includes three examples to help get you started:

A typical "Hello World" example, for an entry-level description of the main Kedro concepts
The more detailed "spaceflights" tutorial to give you hands-on experience as you learn about Kedro

Additional documentation includes:

An overview of Kedro architecture
How to use the CLI offered by kedro_cli.py (kedro new, kedro run, ...)

Note: The CLI is a convenient tool for being able to run kedro commands but you can also invoke the Kedro CLI as a Python module with python -m kedro

Every Kedro function or class has extensive help, which you can call from a Python session as follows if the item is in local scope:

from kedro.io import MemoryDataSet
help(MemoryDataSet)

Why does Kedro exist?

Kedro is built upon our collective best-practice (and mistakes) trying to deliver real-world ML applications that have vast amounts of raw unvetted data. We developed Kedro to achieve the following:

Collaboration on an analytics codebase when different team members have varied exposure to software engineering best-practice
A focus on maintainable data and ML pipelines as the standard, instead of a singular activity of deploying models in production
A way to inspire the creation of reusable analytics code so that we never start from scratch when working on a new project
Efficient use of time because we're able to quickly move from experimentation into production

The humans behind Kedro

Kedro was originally designed by Aris Valtazanos and Nikolaos Tsaousis to solve challenges they faced in their project work. Their work was later turned into an internal product by Peteris Erins, Ivan Danov, Nikolaos Kaltsas, Meisam Emamjome and Nikolaos Tsaousis.

Currently the core Kedro team consists of:

Former core team members with significant contributions include: Gordon Wrigley, Nasef Khan and Anton Kirilenko.

And last but not least, all the open-source contributors whose work went into all Kedro releases.

Can I contribute?

Yes! Want to help build Kedro? Check out our guide to contributing to Kedro.

Where can I learn more?

There is a growing community around Kedro. Have a look at the Kedro FAQs to find projects using Kedro and links to articles, podcasts and talks.

Who is using Kedro?

What licence do you use?

Kedro is licensed under the Apache 2.0 License.

We're hiring!

Do you want to be part of the team that builds Kedro and other great products at QuantumBlack? If so, you're in luck! QuantumBlack is currently hiring Software Engineers who love using data to drive their decisions. Take a look at our open positions and see if you're a fit.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.19.9

Oct 10, 2024

0.19.8

Aug 22, 2024

0.19.7

Aug 1, 2024

0.19.6

May 27, 2024

0.19.5

Apr 22, 2024

0.19.4

Apr 17, 2024

0.19.3

Feb 27, 2024

0.19.2

Jan 22, 2024

0.19.1

Dec 13, 2023

0.19.0

Dec 12, 2023

0.18.14

Oct 18, 2023

0.18.13

Aug 31, 2023

0.18.12

Aug 1, 2023

0.18.11

Jul 3, 2023

0.18.10

Jun 8, 2023

0.18.9

May 31, 2023

0.18.8

May 2, 2023

0.18.7

Mar 22, 2023

0.18.6

Mar 6, 2023

0.18.5

Feb 20, 2023

0.18.4

Dec 5, 2022

0.18.3

Sep 20, 2022

0.18.2

Jul 8, 2022

0.18.1

May 9, 2022

0.18.0

Mar 31, 2022

0.17.7

Feb 22, 2022

0.17.6

Dec 9, 2021

0.17.5

Sep 14, 2021

0.17.4

Jun 16, 2021

0.17.3

Apr 21, 2021

0.17.2

Mar 15, 2021

0.17.1

Mar 4, 2021

0.17.0

Dec 17, 2020

This version

0.16.6

Oct 23, 2020

0.16.5

Sep 9, 2020

0.16.4

Jul 30, 2020

0.16.3

Jul 13, 2020

0.16.2

Jun 15, 2020

0.16.1

May 21, 2020

0.16.0

May 20, 2020

0.15.9

Apr 6, 2020

0.15.8

Mar 5, 2020

0.15.7

Feb 26, 2020

0.15.6

Feb 26, 2020

0.15.5

Dec 12, 2019

0.15.4

Oct 30, 2019

0.15.3

Oct 17, 2019

0.15.2

Oct 8, 2019

0.15.1

Sep 12, 2019

0.15.0

Aug 13, 2019

0.14.3

Jun 26, 2019

0.14.2

Jun 11, 2019

0.14.1

May 31, 2019

0.14.0

May 16, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kedro-0.16.6.tar.gz (134.0 kB view details)

Uploaded Oct 23, 2020 Source

Built Distribution

kedro-0.16.6-py3-none-any.whl (15.6 MB view details)

Uploaded Oct 23, 2020 Python 3

File details

Details for the file kedro-0.16.6.tar.gz.

File metadata

Download URL: kedro-0.16.6.tar.gz
Upload date: Oct 23, 2020
Size: 134.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/50.3.0.post20201006 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.6.12

File hashes

Hashes for kedro-0.16.6.tar.gz
Algorithm	Hash digest
SHA256	`97a7709e73a61eee476798bf3165e09d90ad930bd2b143e044327649967b83f8`
MD5	`869fbe684ad74488c1bf3d44789dd168`
BLAKE2b-256	`8f270ff4dc6afc453cb2ca83bbd1ba74cdbf0e9e0ae03c1fe281e9b8e6815b08`

See more details on using hashes here.

File details

Details for the file kedro-0.16.6-py3-none-any.whl.

File metadata

Download URL: kedro-0.16.6-py3-none-any.whl
Upload date: Oct 23, 2020
Size: 15.6 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.2.0 pkginfo/1.6.0 requests/2.24.0 setuptools/50.3.0.post20201006 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.6.12

File hashes

Hashes for kedro-0.16.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f486fb498a8c9e18a28d255b97287885bce4b509e88bee3b6283c65e55fa232b`
MD5	`b68156171f2539d9a153e204f89da2da`
BLAKE2b-256	`e5e9fd03dc41f41a72790f23140369060e60b61b24098a0ad80641a3c334f60b`