Skip to main content

In its highest aspirations, pytask tries to be pytest as a build system.

Project description

pytask
PyPI PyPI - Python Version https://img.shields.io/conda/vn/conda-forge/pytask.svg https://img.shields.io/conda/pn/conda-forge/pytask.svg PyPI - License https://readthedocs.org/projects/pytask-dev/badge/?version=latest https://img.shields.io/github/workflow/status/pytask-dev/pytask/Continuous%20Integration%20Workflow/main https://codecov.io/gh/pytask-dev/pytask/branch/main/graph/badge.svg pre-commit.ci status https://img.shields.io/badge/code%20style-black-000000.svg

In its highest aspirations, pytask tries to be pytest as a build system. It’s main purpose is to facilitate reproducible research by automating workflows in research projects. Its features include:

  • Automatic discovery of tasks.

  • Lazy evaluation. If a task, its dependencies, and its products have not changed, do not execute it.

  • Debug mode. Jump into the debugger if a task fails, get feedback quickly, and be more productive.

  • Select tasks via expressions. Run only a subset of tasks with expressions and marker expressions known from pytest.

  • Easily extensible with plugins. pytask’s architecture is based on pluggy, a plugin management framework, so that you can adjust pytask to your needs. Plugins are, for example, available for parallelization, LaTeX, R, and Stata. Read here how you can use plugins.

New Features

  • Create a visualization of the DAG with pytask dag. (Tutorial)

  • Show a profile of all tasks (duration, size of products) with pytask profile.

Installation

pytask is available on PyPI for Python >= 3.6.1 and on Anaconda.org for Python >= 3.7. Install the package with

$ pip install pytask

# or

$ conda install -c conda-forge pytask

Color support is automatically available on non-Windows platforms. On Windows, please, use Windows Terminal which can be, for example, installed via the Microsoft Store.

Usage

A task is a function which is detected if the module and the function name are prefixed with task_. Here is an example.

# Content of task_hello.py.

import pytask


@pytask.mark.produces("hello_earth.txt")
def task_hello_earth(produces):
    produces.write_text("Hello, earth!")

Here are some details:

  • Dependencies and products of a task are tracked via markers. For dependencies use @pytask.mark.depends_on and for products use @pytask.mark.produces. Use strings and pathlib.Path to specify the location. Pass multiple dependencies or products as a list or a dictionary for positional or key-based access.

  • With produces (and depends_on) as function arguments, you get access to the dependencies and products inside the function via pathlib.Path objects. Here, produces holds the path to "hello_earth.txt".

To execute the task, type the following command on the command-line

$ pytask
========================= Start pytask session =========================
Platform: linux -- Python 3.x.y, pytask 0.x.y, pluggy 0.x.y
Root: xxx
Collected 1 task(s).

.
====================== 1 succeeded in 1 second(s) ======================

Demo

The demo walks you through the following steps.

  1. Write an executable script which produces a text file like you would normally do without pytask.

  2. Rewrite the script to a pytask task.

  3. Execute the task.

  4. Add a task which produces a second text file and another task which merges both text files.

  5. Execute all three tasks.

https://github.com/pytask-dev/misc/raw/main/gif/workflow.gif

Documentation

The documentation can be found under https://pytask-dev.readthedocs.io/en/latest with tutorials and guides for best practices.

Changes

Consult the release notes to find out about what is new.

License

pytask is distributed under the terms of the MIT license.

Acknowledgment

The license also includes a copyright and permission notice from pytest since some modules, classes, and functions are copied from pytest. Not to mention how pytest has inspired the development of pytask in general. Without the amazing work of Holger Krekel and pytest’s many contributors, this project would not have been possible. Thank you!

Citation

If you rely on pytask to manage your research project, please cite it with the following key to help others to discover the tool.

@Unpublished{Raabe2020,
    Title  = {A Python tool for managing scientific workflows.},
    Author = {Tobias Raabe},
    Year   = {2020},
    Url    = {https://github.com/pytask-dev/pytask}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pytask-0.0.15.tar.gz (64.0 kB view details)

Uploaded Source

Built Distribution

pytask-0.0.15-py3-none-any.whl (72.4 kB view details)

Uploaded Python 3

File details

Details for the file pytask-0.0.15.tar.gz.

File metadata

  • Download URL: pytask-0.0.15.tar.gz
  • Upload date:
  • Size: 64.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.5.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.1 CPython/3.9.5

File hashes

Hashes for pytask-0.0.15.tar.gz
Algorithm Hash digest
SHA256 9eda27fdd44eed0ca4a3105b02fed4ccee6dcd90ca8e9f9a72af5f5734a085c6
MD5 45bf088e86d0d495eb0646c9dc8524bd
BLAKE2b-256 1b94d8f25ba67f67284d0c6209a2505e03cd30339eb4f5d0140b9617e4c2cb11

See more details on using hashes here.

File details

Details for the file pytask-0.0.15-py3-none-any.whl.

File metadata

  • Download URL: pytask-0.0.15-py3-none-any.whl
  • Upload date:
  • Size: 72.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.5.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.1 CPython/3.9.5

File hashes

Hashes for pytask-0.0.15-py3-none-any.whl
Algorithm Hash digest
SHA256 c92013551a6313a95077573bd6422bb1764319a92cbbe2ce9997368e02308645
MD5 b154c1bc5e420328d642a59624b851de
BLAKE2b-256 17d368b29a85e122394f7ddd2e734c92b8e2cfff4fb8cad4af2e2edd3c6892ce

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page