Skip to main content

The easiest way to create data pipelines in Python.

Project description

prism logo

linux tests badge macos tests badge windows tests badge style tests badge

Prism is currently in alpha.

:wave: Welcome to Prism!

Prism is the easiest way to create data pipelines in Python.

Introduction

Data projects often require multiple steps that need to be executed in sequence (think extract-transform-load, data wrangling, etc.). With Prism, users can break down their project into modular tasks, manage dependencies, and execute complex computations in sequence.

Here are some of Prism's main features:

  • Modules as tasks: Unlike other orchestration platforms, Prism allows tasks to live within their own modules. This not only helps with readability and QC, but also enables users to build powerful, complex pipelines that scale alongside their project.
  • Real-time dependency declaration: With Prism, analysts can declare dependencies using a simple function call. No need to explicitly keep track of the pipeline order — at runtime, Prism automatically parses the function calls and builds the dependency graph.
  • Flexible CLI: Analysts can instantiate, compile, and run projects using a simple command-line interface.
  • Integrations: Prism integrates with several tools that are popular in the data community, including dbt, Snowflake, PySpark and Google BigQuery. We're adding more integrations every day, so let us know what you'd like to see!

Getting Started

Prism can be installed via pip. Prism requires Python >= 3.7.

pip install prism-ds

To get started with Prism projects, check out our documentation. Some sections of interest include:

In addition, check out some example projects.

Product Roadmap

We're always looking to improve our product. Here's what we're working on at the moment:

  • DAG visualizer: a clean UI for visualizing the data flow between tasks
  • Python API: an API that packages projects into a single class that can be called in other programs
  • Additional adapters: Postgres, Redshift, and more!
  • Cloud deployment: deploying projects on Amazon EMR clusters, Docker containers, Databricks clusters, and more!

Let us know if you'd like to see another feature!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

prism-ds-0.1.3a0.tar.gz (70.5 kB view hashes)

Uploaded Source

Built Distribution

prism_ds-0.1.3a0-py3-none-any.whl (155.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page