Skip to main content

Parallel pipelines for Python

Project description

Description

PaPy, which stands for parallel pipelines in Python, is a highly flexible framework that addresses the problem of creating scalable workflows to process or generate data. A workflow is created from Python functions(nodes) with well-defined call/return semantics, connected by pipes(edges) into a directed acyclic graph. Given the topology and input data, these functions are composed into nested higher-order maps, which are transparently and robustly evaluated in parallel on a single computer or remote hosts. The local and remote computational resources can be flexibly pooled and assigned to functional nodes. This allows to easily load-balance a pipeline and optimize the throughput. Data traverses the graph in batches of adjustable size: a trade-off between lazy-evaluation, parallelism and memory consuption. The simplicity and flexibility of distributed workflows using PaPy bridges the gap between desktop and grid.

Installation

The easiest way to get PaPy is if you have setuptools installed:

easy_install papy

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

papy-1.0b1.tar.gz (1.8 MB view details)

Uploaded Source

File details

Details for the file papy-1.0b1.tar.gz.

File metadata

  • Download URL: papy-1.0b1.tar.gz
  • Upload date:
  • Size: 1.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for papy-1.0b1.tar.gz
Algorithm Hash digest
SHA256 390608b28c29abbdb5910f4b7d22a8cdaf1ca8bc83d6d31d29b7e6467aeb89f6
MD5 2cfa9ac9791eea16fdf8e3c34940e924
BLAKE2b-256 c29a3e8f225dfbc211c87cbc9da2f3158c15b66a8394ddd36cc7db1bfbc8aa9f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page