mabel

Python Data Libraries

Project description

mabel is a fully-portable Data Engineering platform designed to run on low-spec compute nodes.

There is no server component, mabel just runs when you need it, where you want it.

Documentation GitHub Wiki
Bug Reports GitHub Issues
Feature Requests GitHub Issues
Source Code GitHub
Discussions GitHub Discussions

Focus on What Matters

We've built mabel to enable Data Analysts to write complex data engineering tasks quickly and easily, so they could get on with doing what they do best.

from mabel import operator
from mabel.operators import EndOperator

@operator
def say_hello(name):
    print(F"Hello, {name}!")

flow = say_hello > EndOperator()
with flow as runner:
    runner("world")  # Hello, world!

Key Features

Programatically define data pipelines
Treats datasets as immutable
On-the-fly compression
Automatic version tracking of processing operations
Trace messages through the pipeline (random sampling)
Automatic retry of failed operations
Low-memory requirements, even with terabytes of data
Indexing and partitioning of data for fast reads (beta)
Cursors for tracking reading position (beta)
SQL Query support (alpha)
Schema and data_expectations validation

Note:

alpha features are subject to change and are not recommended for production systems
beta features may change to resolve issues during testing

Installation

From PyPI (recommended)

pip install --upgrade mabel

From GitHub

pip install --upgrade git+https://github.com/mabel-dev/mabel

Guides

How to Write a Flow
How to Read Data

Dependencies

dateutil is used to convert dates received as strings
mmh3 is used for non-cryptographic hashing
pydantic is used to define internal data models
UltraJSON (AKA ujson) is used where orjson is not available. (1)
zstandard is used for real-time compression

There are a number of optional dependencies which are usually only required for specific features and functionality. These are listed in the requirements.txt file in the tests folder which is used for testing. The exception is orjson which is the preferred JSON library but not available on all platforms.

Integrations

mabel comes with adapters for the following services:

	Service	Support
	Google Cloud Storage	Read/Write
	MinIO	Read/Write
	S3	Read/Write

MongoDB and MQTT Readers are included in the base library but are not supported.

Deployment and Execution

mabel supports running on a range of platforms:

	Platform
	Docker
	Kubernetes
	Raspberry Pi (1)
	Windows (2)
	Linux (3)

MacOS also supported.

Adapters for other data services can be written.

Notice 1 - Raspbian fully functional with ujson.
Notice 2 - Multi-Processing not available on Windows. Alternate indexing libraries may be used on Windows.
Notice 3 - Tested on Debian and Ubuntu.

How Can I Contribute?

All contributions, bug reports, bug fixes, documentation improvements, enhancements, and ideas are welcome.

If you have a suggestion for an improvement or a bug, raise a ticket or start a discussion.

Want to help build mabel? See the contribution guidance.

License

Apache 2.0

Project details

Release history Release notifications | RSS feed

0.6.24

Jun 27, 2024

0.6.23

Jun 27, 2024

0.6.22

Jun 18, 2024

0.6.21

May 12, 2024

0.6.20

Apr 7, 2024

0.6.19

Jan 24, 2024

0.6.18

Jan 17, 2024

0.6.17

Nov 30, 2023

0.6.16

Nov 20, 2023

0.6.15

Nov 16, 2023

0.6.15a4 pre-release

Oct 21, 2023

0.6.15a3 pre-release

Oct 18, 2023

0.6.15a2 pre-release

Oct 17, 2023

0.6.15a1 pre-release

Oct 15, 2023

0.6.13

Jun 26, 2023

0.6.12

Dec 19, 2022

0.6.11

Dec 2, 2022

0.6.10

Nov 24, 2022

0.6.9

Oct 27, 2022

0.6.8

Oct 27, 2022

0.6.7

Sep 13, 2022

0.6.6

Sep 3, 2022

0.6.5

Aug 10, 2022

0.6.4

Aug 9, 2022

0.6.3

Aug 9, 2022

0.6.2

Aug 5, 2022

0.6.1

Aug 5, 2022

0.6.0

Aug 5, 2022

0.5.35

Jul 4, 2022

0.5.34

Jul 3, 2022

0.5.33

Jul 1, 2022

0.5.32

Jun 30, 2022

0.5.31

Jun 6, 2022

0.5.30

Jun 3, 2022

0.5.29

Jun 1, 2022

0.5.28

May 28, 2022

0.5.26

May 11, 2022

0.5.25

May 6, 2022

0.5.24

Apr 1, 2022

0.5.23

Mar 30, 2022

0.5.22

Mar 30, 2022

0.5.21

Mar 29, 2022

0.5.20

Mar 27, 2022

0.5.19

Mar 18, 2022

0.5.18

Mar 10, 2022

0.5.17

Mar 7, 2022

0.5.16

Mar 6, 2022

0.5.15

Mar 5, 2022

0.5.14

Mar 3, 2022

0.5.13

Mar 3, 2022

0.5.12

Feb 24, 2022

0.5.11

Feb 24, 2022

0.5.10

Feb 17, 2022

0.5.9

Feb 17, 2022

0.5.8

Feb 16, 2022

0.5.7

Feb 8, 2022

0.5.6

Feb 8, 2022

0.5.5

Jan 28, 2022

0.5.4

Jan 28, 2022

0.5.3

Jan 20, 2022

0.5.2

Jan 17, 2022

0.5.1

Jan 17, 2022

0.5.0

Jan 15, 2022

0.4.85

Nov 21, 2021

0.4.84

Nov 2, 2021

0.4.83

Oct 29, 2021

0.4.82

Oct 5, 2021

0.4.81

Sep 29, 2021

0.4.80

Sep 27, 2021

0.4.79

Sep 27, 2021

0.4.78

Sep 13, 2021

0.4.77

Sep 8, 2021

0.4.76

Sep 6, 2021

0.4.75

Sep 2, 2021

0.4.74

Aug 31, 2021

0.4.72

Aug 17, 2021

0.4.71

Aug 16, 2021

0.4.70

Aug 15, 2021

0.4.69

Jul 29, 2021

0.4.68

Jul 25, 2021

0.4.67

Jul 22, 2021

0.4.66

Jul 21, 2021

0.4.65

Jul 18, 2021

This version

0.4.64

Jul 17, 2021

0.4.63

Jul 15, 2021

0.4.62

Jul 15, 2021

0.4.61

Jul 12, 2021

0.4.60

Jul 9, 2021

0.4.59

Jul 3, 2021

0.4.58

Jun 25, 2021

0.4.57

Jun 19, 2021

0.4.55

Jun 15, 2021

0.4.54

Jun 14, 2021

0.4.53

Jun 14, 2021

0.4.52

Jun 14, 2021

0.4.51

Jun 13, 2021

0.4.50

Jun 12, 2021

0.4.49

Jun 11, 2021

0.4.48

Jun 11, 2021

0.4.47

Jun 10, 2021

0.4.46

Jun 9, 2021

0.4.45

Jun 8, 2021

0.4.44

Jun 7, 2021

0.4.43

Jun 4, 2021

0.4.42

May 31, 2021

0.4.41

May 31, 2021

0.4.40

May 29, 2021

0.4.39

May 29, 2021

0.4.38

May 26, 2021

0.4.37

May 25, 2021

0.4.36

May 24, 2021

0.4.35

May 23, 2021

0.4.34

May 22, 2021

0.4.33

May 22, 2021

0.4.32

May 21, 2021

0.4.31

May 21, 2021

0.4.30

May 20, 2021

0.4.29

May 20, 2021

0.4.28

May 20, 2021

0.4.27

May 19, 2021

0.4.26

May 19, 2021

0.4.25

May 16, 2021

0.4.24

May 14, 2021

0.4.23

May 14, 2021

0.4.22

May 13, 2021

0.4.21

May 12, 2021

0.4.20

May 12, 2021

0.4.19

May 11, 2021

0.4.18

May 10, 2021

0.4.17

May 8, 2021

0.4.16

May 8, 2021

0.4.15

May 5, 2021

0.4.14

May 3, 2021

0.4.13

May 3, 2021

0.4.12

May 1, 2021

0.4.11

Apr 29, 2021

0.4.10 yanked

Apr 29, 2021

Reason this release was yanked:

dependency issues

0.4.9

Apr 28, 2021

0.4.8

Apr 27, 2021

0.4.7

Apr 26, 2021

0.4.6 yanked

Apr 26, 2021

Reason this release was yanked:

installer issue

0.4.5 yanked

Apr 24, 2021

Reason this release was yanked:

installer issue

0.4.4

Apr 20, 2021

0.4.3

Apr 16, 2021

0.4.2

Apr 15, 2021

0.4.1

Apr 2, 2021

0.3.0a0 pre-release

Aug 20, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mabel-0.4.64.tar.gz (74.7 kB view hashes)

Uploaded Jul 17, 2021 Source

Built Distribution

mabel-0.4.64-py3-none-any.whl (105.1 kB view hashes)

Uploaded Jul 17, 2021 Python 3

Hashes for mabel-0.4.64.tar.gz

Hashes for mabel-0.4.64.tar.gz
Algorithm	Hash digest
SHA256	`7abff1700d0b553fbbd1d674b42c7b250e7d41c884970cea50abc170afd2073a`
MD5	`97fb0b4b88aaa10827db6ecf1dd33a43`
BLAKE2b-256	`3b25b158002a99f9e75a4d92d73113de78ed29eb1c5375c22d52512e9ac1f70a`

Hashes for mabel-0.4.64-py3-none-any.whl

Hashes for mabel-0.4.64-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8ff9aececebb219346b9f31fd3ca10a74963b8105e11ca6068d70ba7e8b4e221`
MD5	`5491e3d930d3f9b0756d1b3a3a56b5e2`
BLAKE2b-256	`0402274593332eb2aea100a6806714f898d22888c35295b16668d784ebb2eb1d`