Skip to main content

Kedro-Dagster makes it easy to deploy Kedro projects to Dagster

Project description

Kedro-Dagster

Python Version License PyPI Version Conda Version CodeCov

Powered by Kedro Slack Organisation

What is Kedro-Dagster?

The Kedro-Dagster plugin enables seamless integration between Kedro, a framework for creating reproducible and maintainable data science code, and Dagster, a data orchestrator for machine learning and data pipelines. This plugin makes use of Dagster's orchestration capabilities to automate and monitor Kedro pipelines effectively.

Kedro-Dagster Asset Graph

Currently, Kedro-Dagster supports Kedro versions 0.19.x and 1.x, and Dagster versions 1.10.x, 1.11.x, and 1.12.x.

What are the features of Kedro-Dagster?

  • Configuration‑driven workflows: Centralize orchestration settings in a dagster.yml file for each Kedro environment. Define jobs from filtered Kedro pipelines, assign executors, schedules.
  • Customization: The core integration lives in the auto‑generated Dagster definitions.py. For advanced use cases, you can extend or override these definitions.
  • Kedro hooks preservation: Kedro hooks are preserved and called at the appropriate time during pipeline execution, so custom logic (e.g., data validation, logging) continues to work seamlessly.
  • MLflow compatibility: Use Kedro-MLflow with Dagster’s MLflow integration to track experiments, log models, and register artifacts.
  • Logger integration: Unifies Kedro and Dagster logging so logs from Kedro nodes appear in the Dagster UI and are easy to trace and debug. Additionally, provides configuration to customize Dagster run loggers.
  • (Experimental) Dagster partition support: Make use of Dagster's partitions to fan-out Kedro nodes acting on partitioned data.

How to install Kedro-Dagster?

Install the Kedro-Dagster plugin using pip:

pip install kedro-dagster

or using uv:

uv pip install kedro-dagster

or using conda:

conda install -c conda-forge kedro-dagster

or using mamba:

mamba install -c conda-forge kedro-dagster

or alternatively, add kedro-dagster to your requirements.txt or pyproject.toml file.

How to get started with Kedro-Dagster?

  1. Initialize the plugin in your Kedro project

Use the following command to generate a definitions.py file, where all translated Kedro objects are available as Dagster objects, and a dagster.yml configuration file:

kedro dagster init --env <ENV_NAME>
  1. Configure jobs, executors, and schedules

Define your job executors and schedules in the dagster.yml configuration file located in your Kedro project's conf/<ENV_NAME> directory. This file allows you to filter Kedro pipelines and assign specific executors and schedules to them.

# conf/local/dagster.yml
schedules:
  daily: # Schedule name
    cron_schedule: "0 0 * * *" # Schedule parameters

executors: # Executor name
  sequential: # Executor parameters
    in_process:

  multiprocess:
    multiprocess:
      max_concurrent: 2

jobs:
  default: # Job name
    pipeline: # Pipeline filter parameters
      pipeline_name: __default__
    executor: sequential

  parallel_data_processing:
    pipeline:
      pipeline_name: data_processing
      node_names:
      - preprocess_companies_node
      - preprocess_shuttles_node
    schedule: daily
    executor: multiprocess

  data_science:
    pipeline:
      pipeline_name: data_science
    schedule: daily
    executor: sequential
  1. Launch the Dagster UI

Start the Dagster UI to monitor and manage your pipelines using the following command:

kedro dagster dev --env <ENV_NAME>

The Dagster UI will be available at http://127.0.0.1:3000.

For a concrete use-case, see the Kedro-Dagster example repository.

How do I use Kedro-Dagster?

Full documentation is available at https://kedro-dagster.readthedocs.io/.

Can I contribute?

We welcome contributions, feedback, and questions:

If you are interested in becoming a maintainer or taking a more active role, please reach out to Guillaume Tauzin on GitHub Discussions.

Where can I learn more?

For questions and discussions, you can also open a discussion.

License

This project is licensed under the terms of the Apache-2.0 License.

Acknowledgements

This project is maintained by stateful-y, an ML consultancy specializing in MLOps and data science & engineering. If you're interested in collaborating or learning more about our services, please visit our website.

Made by stateful-y

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kedro_dagster-0.7.0.tar.gz (3.7 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

kedro_dagster-0.7.0-py3-none-any.whl (66.2 kB view details)

Uploaded Python 3

File details

Details for the file kedro_dagster-0.7.0.tar.gz.

File metadata

  • Download URL: kedro_dagster-0.7.0.tar.gz
  • Upload date:
  • Size: 3.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for kedro_dagster-0.7.0.tar.gz
Algorithm Hash digest
SHA256 a52d63fc3571ed399db7eed2877eb8a9e1cf5f44a821a7ad4cd233608757b782
MD5 e7c59af8035df477a7fc81aa1c23c2f8
BLAKE2b-256 c52f23262a54a01a4dd7100bf53d5f945256da4e9a446beb6fb895e8fb45a254

See more details on using hashes here.

Provenance

The following attestation bundles were made for kedro_dagster-0.7.0.tar.gz:

Publisher: publish-release.yml on stateful-y/kedro-dagster

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file kedro_dagster-0.7.0-py3-none-any.whl.

File metadata

  • Download URL: kedro_dagster-0.7.0-py3-none-any.whl
  • Upload date:
  • Size: 66.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for kedro_dagster-0.7.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0c4d8cbac9d336217be6e017dd0dd60ab91c36614cb5716ed63ad08b1ccd6274
MD5 fe0322682fd0594e9c8aaed5b41862fe
BLAKE2b-256 35c206d5f5ca90fae2b69780a73582ee7744ec292da0d5ba2fd9311754c01f2f

See more details on using hashes here.

Provenance

The following attestation bundles were made for kedro_dagster-0.7.0-py3-none-any.whl:

Publisher: publish-release.yml on stateful-y/kedro-dagster

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page