Build modular data pipelines running inside the postgres database

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

snorkysnark

Project description

Ralsei

Ralsei is a Python framework for building modular data pipelines running inside the postgres database.

It was built with use cases such as web scraping in mind, with the philosophy that all artifacts should be stored in the database: from downloaded html to the parsed results

Tests - Status

Features

Based on the jinja-psycopg library, combining type-safe SQL formatting with jinja's template language
Declarative with minimal boilerplate
Resumable pipelines, both at row-level and table-level granularity - no need to re-compute or re-download what has already been processed

Installation

pip install ralsei

Tip: consider using PDM or Poetry for project-based dependency management

Quick Start

First, create a script from the following template:

from ralsei import RalseiCli

def make_pipeline(args):
    return {} #  Declare your pipeline in the format "name": Task(...)

if __name__ == "__main__":
    cli = RalseiCli() # Here you can add custom arguments
    cli.run(make_pipeline)

To see some example pipelines, take a look at the Builtin Tasks section of the documentation

Alternatives

DBT - jinja + SQL based
more suitable for processing data that you already have
Kedro - python based, more suitable for processing data that you already have

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

snorkysnark

Release history Release notifications | RSS feed

3.1.0.post3

Oct 9, 2024

3.1.0.post2

Oct 8, 2024

3.1.0.post1

Oct 7, 2024

3.1.0

Oct 7, 2024

3.0.0.dev5 pre-release

Oct 7, 2024

3.0.0.dev4 pre-release

Oct 7, 2024

This version

3.0.0.dev3 pre-release

Sep 6, 2024

2.2.0

Nov 7, 2023

2.1.4

Sep 25, 2023

2.1.3

Sep 5, 2023

2.1.2

Sep 3, 2023

2.1.1

Sep 3, 2023

2.0.0rc1 pre-release

Aug 18, 2023

0.1.0

Jun 30, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ralsei-3.0.0.dev3.tar.gz (28.2 kB view hashes)

Uploaded Sep 6, 2024 Source

Built Distribution

ralsei-3.0.0.dev3-py3-none-any.whl (38.9 kB view hashes)

Uploaded Sep 6, 2024 Python 3

Hashes for ralsei-3.0.0.dev3.tar.gz

Hashes for ralsei-3.0.0.dev3.tar.gz
Algorithm	Hash digest
SHA256	`8789be80aa7e8ba9962a7e6e7b9f47bb85c9752b6a6a8432a15afc722e320b99`
MD5	`304918ba9dd0a84ef33be854081617f7`
BLAKE2b-256	`eda8454ca797cbe5fc36bf476f9687f9e88c6e4d5b3665f2c191b087c6eb99a5`

Hashes for ralsei-3.0.0.dev3-py3-none-any.whl

Hashes for ralsei-3.0.0.dev3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`017dc76b9c5131efc1cbe169fe86ed26395458ac2cd95618fefa926e0fc43422`
MD5	`00ab48cd843cc96a1f2bc7fe627af74c`
BLAKE2b-256	`5ea4816388d6c3d318c6e7e68c0973923045d4bc4eb45c59485f124996113e25`