Skip to main content

Seroflow is a Python package designed to help users build and execute efficient data pipelines.

Project description

Seroflow Package Guide

Welcome to Seroflow! This repository contains the source code, documentation, and examples to help you get started and contribute to this project.

Overview

Seroflow is a powerful Python package designed to help users build and execute efficient ETL (or ELT) data pipelines. ETL stands for Extract, Transform, and Load — a process that extracts data from various sources, transforms it into a suitable format, and then loads it into a destination system.

With Seroflow, each phase of your data pipeline is broken down into concrete steps: Extractor steps for data extraction, Transformation steps for data manipulation, and Loader steps for data loading. Think of it like assembling a Lego set—Seroflow provides all the essential bricks, and you simply pick and add the desired step objects sequentially into a Seroflow object, then run seroflow.execute() to run your entire process.

Additionally, the package supports seamless creation of custom Extractors, Loaders, and Transformations through its intuitive interfaces, along with robust features like logging, caching, and chunking. Plus, it comes preloaded with over 70+ predefined transformations, making it an indispensable tool for data pipeline creation and execution.

To Get Started using Seroflow head over to our Get Started Page.

Table of Contents

Prerequisites

Before installing the package, ensure you have the following:

  • Python Version: Python 3.10 or higher.
  • pip: The latest version of pip. You can upgrade pip with:
python -m pip install --upgrade pip
  • Virtual Environment (Recommended): Use venv or virtualenv to create an isolated environment.
python -m venv venv_name

venv_name\Scripts\activate # For Windows OS use
source venv_name/bin/activate  # For macOS use

Installation

Using pip

To install the package from PyPI, run the following command:

pip install seroflow

Installing from Source

If you want to install the package directly from the source code, follow these steps:

  1. Clone the repository:
git clone https://github.com/.../seroflow.git
cd seroflow
  1. Install dependencies:
pip install -r requirements.txt

Documentation

Below are some links to the documentation provided for the various components in the Seroflow package. It is recommended that users review some of these components prior to using the package.

License

This project is licensed under the MIT License. Please review the license for more details.

Contributing

We welcome contributions! Please read our CONTRIBUTING.md for guidelines on how to contribute to the project.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

seroflow-1.0.0.tar.gz (72.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

seroflow-1.0.0-py3-none-any.whl (94.2 kB view details)

Uploaded Python 3

File details

Details for the file seroflow-1.0.0.tar.gz.

File metadata

  • Download URL: seroflow-1.0.0.tar.gz
  • Upload date:
  • Size: 72.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.5

File hashes

Hashes for seroflow-1.0.0.tar.gz
Algorithm Hash digest
SHA256 43342cc73eb09a9eebc6e1cb568a325b321654634d2023a5e699151f1e72f112
MD5 4444f4405d0297f4a4c0874ce421f772
BLAKE2b-256 439e752c4889b294401e0b876c5c5d547e2233676af86742972d5dc38e2ce78a

See more details on using hashes here.

File details

Details for the file seroflow-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: seroflow-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 94.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.5

File hashes

Hashes for seroflow-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 bdcf2717fd101d0c59b0d5a8a86c271fbaf89051345f8fa8a5d1666d964e26c6
MD5 f0eef9eeafe311f6e1c0da6d8465c5c8
BLAKE2b-256 4e0e42651044658317c7deb79446befba360ab0dcd585096386aa1870e060e5d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page