Skip to main content

pytask is a workflow management system that facilitates reproducible data analyses.

Project description

pytask


PyPI PyPI - Python Version image image PyPI - License image image image pre-commit.ci status Ruff

pytask is a workflow management system that facilitates reproducible data analyses. Its features include:

  • Automatic discovery of tasks.
  • Lazy evaluation. If a task, its dependencies, and its products have not changed, do not execute it.
  • Debug mode. Jump into the debugger if a task fails, get feedback quickly, and be more productive.
  • Repeat a task with different inputs. Loop over task functions to run the same task with different inputs.
  • Select tasks via expressions. Run only a subset of tasks with expressions and marker expressions.
  • Easily extensible with plugins. pytask is built on pluggy, a plugin management framework that allows you to adjust pytask to your needs. Plugins are available for parallelization, LaTeX, R, and Stata and more can be found here. Learn more about plugins in this tutorial.

Installation

pytask is available on PyPI and on conda-forge. Install the package with

$ uv add pytask

or

$ pixi add pytask

Color support is automatically available on non-Windows platforms. On Windows, please, use Windows Terminal, which can be, for example, installed via the Microsoft Store.

To quickly set up a new project, use the cookiecutter-pytask-project template or start from other templates or example projects.

Usage

A task is a function that is detected if the module and the function name are prefixed with task_. Here is an example.

# Content of task_hello.py.

from pathlib import Path

from pytask import Product
from typing import Annotated


def task_hello_earth(path: Annotated[Path, Product] = Path("hello_earth.txt")):
    path.write_text("Hello, earth!")
  • The purpose of the task is to create the file hello_earth.txt and add some content.

  • To tell pytask that hello_earth.txt is a product and not an input, use the Product annotation.

    (If you are not used to type annotations, do not worry. pytask also offers simpler interfaces without type annotations.)

  • Since you pass a pathlib.Path to the function, pytask will check whether the file exists after the function is executed.

To execute the task, enter pytask on the command-line

image

Documentation

You find the documentation https://pytask-dev.readthedocs.io/en/stable with tutorials and guides for best practices.

Changes

Consult the release notes to find out about what is new.

License

pytask is distributed under the terms of the MIT license.

Acknowledgment

The license also includes a copyright and permission notice from pytest since some modules, classes, and functions are copied from pytest. Not to mention how pytest has inspired the development of pytask in general. Without the excellent work of Holger Krekel and pytest's many contributors, this project would not have been possible. Thank you!

pytask owes its beautiful appearance on the command line to rich, written by Will McGugan.

Repeating tasks in loops is inspired by ward written by Darren Burns.

Citation

If you rely on pytask to manage your research project, please cite it with the following key to help others to discover the tool.

@Unpublished{Raabe2020,
    Title  = {A Python tool for managing scientific workflows.},
    Author = {Tobias Raabe},
    Year   = {2020},
    Url    = {https://github.com/pytask-dev/pytask}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pytask-0.6.0.tar.gz (121.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pytask-0.6.0-py3-none-any.whl (155.7 kB view details)

Uploaded Python 3

File details

Details for the file pytask-0.6.0.tar.gz.

File metadata

  • Download URL: pytask-0.6.0.tar.gz
  • Upload date:
  • Size: 121.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.8 {"installer":{"name":"uv","version":"0.11.8","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for pytask-0.6.0.tar.gz
Algorithm Hash digest
SHA256 c7c9f0f68e95646fade14bada5ef89cc8a48abf47fe5d47ab9b671d06abe8f93
MD5 0f170f5d425d8a3c277832f0b18d3606
BLAKE2b-256 f5367a58333979ce598d0c81de5c0bd215dd37230aed26f77a88247fa064086a

See more details on using hashes here.

File details

Details for the file pytask-0.6.0-py3-none-any.whl.

File metadata

  • Download URL: pytask-0.6.0-py3-none-any.whl
  • Upload date:
  • Size: 155.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.8 {"installer":{"name":"uv","version":"0.11.8","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for pytask-0.6.0-py3-none-any.whl
Algorithm Hash digest
SHA256 cc4c31ead39f5c64be037640f7bf589b68bd0e87ea9e1a049ba86ceab42c9d13
MD5 db2b03cd3bcfb25febd959b8a43d64e5
BLAKE2b-256 d754c30cb1d08258612ece1dfa72c6918998bebecb916c54fca6d806bc780f2b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page