The easiest way to create data pipelines in Python.
Reason this release was yanked:
broken dependency declarations
Project description
Prism is currently in alpha.
:wave: Welcome to Prism!
Prism is the easiest way to create data pipelines in Python.
Introduction
Data projects often require multiple steps that need to be executed in sequence (think extract-transform-load, data wrangling, etc.). With Prism, users can break down their project into modular tasks, manage dependencies, and execute complex computations in sequence.
Here are some of Prism's main features:
- Modules as tasks: Unlike other orchestration platforms, Prism allows tasks to live within their own modules. This not only helps with readability and QC, but also enables users to build powerful, complex pipelines that scale alongside their project.
- Real-time dependency declaration: With Prism, analysts can declare dependencies using a simple function call. No need to explicitly keep track of the pipeline order — at runtime, Prism automatically parses the function calls and builds the dependency graph.
- Flexible CLI: Analysts can instantiate, compile, and run projects using a simple command-line interface.
- Integrations: Prism integrates with several tools that are popular in the data community, including Snowflake, Google BigQuery, Redshift, PySpark, and dbt. We're adding more integrations every day, so let us know what you'd like to see!
Getting Started
Prism can be installed via pip
. Prism requires Python >= 3.7.
pip install --upgrade pip
pip install prism-ds
Prism also supports a variety of adapters, including Snowflake, Google BigQuery, Google BigQuery, PySpark, and dbt. These can be installed via pip
:
pip install --upgrade pip
pip install "prism-ds[snowflake]"
pip install "prism-ds[bigquery]"
pip install "prism-ds[redshift]"
pip install "prism-ds[pyspark]"
pip install "prism-ds[dbt]"
To get started with Prism projects, check out our documentation. Some sections of interest include:
- :key: Fundamentals
- :seedling: CLI
- :electric_plug: Integrations
- :bulb: Use Cases
In addition, check out some example projects.
Product Roadmap
We're always looking to improve our product. Here's what we're working on at the moment:
- DAG visualizer: a clean UI for visualizing the data flow within and between tasks
- Additional adapters: MySQL, Postgres, Redis, Slack, and more!
- Cloud deployment: deploying and managing projects on Amazon EMR clusters, Docker containers, Databricks clusters, and more!
Let us know if you'd like to see another feature!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for prism_ds-0.1.4a0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6f7197168ce6edbd7d93645075d39dd6e12e13babd3da384a38e7a874d0e4ea3 |
|
MD5 | f689201477f0db21177163b30bf3ebae |
|
BLAKE2b-256 | 1d715379f7083dfee75ece1af2616867625e3c1b776493920af114eb079f9546 |