Package for working with pandas Dataset, but with specialized functions used for Energinet

These details have not been verified by PyPI

Project description

Datamazing

The Datamazing package provides an interface for various transformations of data (filtering, aggregation, merging, etc.)

Interface

The interface is very similar to those of most DataFrame libraries (pandas, pyspark, SQL, etc.). For example, a group-by is implemented as group(df, by=["..."]), and a merge is implemented as merge([df1, df2], on=["..."], how="inner"). So, why not just use native pandas, pyspark, etc.?

The native libraries have some parts, with a little annoying interface (such as pandas inconsistent use of indexing)
Ability to add custom operations, used specifically for the Energinet domain.

Backends

The package contains methods with the same interface, but for different backends. Currently, 2 backends are supported: pandas and pyspark (though not all methods are available for both). For example, when working with pandas DataFrames, one would use

import pandas as pd
import datamazing.pandas as pdz

df = pd.DataFrame([
    {"animal": "cat", "time": pd.Timestamp("2020-01-01"), "age": 1.0},
    {"animal": "cat", "time": pd.Timestamp("2020-01-02"), "age": 3.0},
    {"animal": "dog", "time": pd.Timestamp("2020-01-01"), "age": 5.0},
])

pdz.group(df, by="animal") \
    .resample(on="time", resolution=pd.Timedelta(hours=12)) \ 
    .agg("interpolate")

whereas, when working with pyspark DataFrame, one would instead use

import datetime as dt
import pyspark.sql as ps
import datamazing.pyspark as psz

spark = ps.SparkSession.getActiveSession()

df = spark.createDataFrame([
    {"animal": "cat", "time": dt.datetime(2020, 1, 1), "age": 1.0},
    {"animal": "cat", "time": dt.datetime(2020, 1, 2), "age": 3.0},
    {"animal": "dog", "time": dt.datetime(2020, 1, 1), "age": 5.0},
])

psz.group(df, by="animal") \
    .resample(on="time", resolution=pd.Timedelta(hours=12)) \ 
    .agg("interpolate")

Development

To setup the Python environment, run

$ pip install poetry
$ poetry install

To run test locally one needs java. This can be installed using the following:

$ sudo apt install default-jdk

To execute unit tests, run

$ pytest .

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

9.0.1

May 1, 2026

9.0.0

Apr 30, 2026

8.0.8

Apr 28, 2026

8.0.7

Apr 27, 2026

8.0.6

Apr 20, 2026

8.0.5

Apr 17, 2026

8.0.4

Apr 15, 2026

8.0.3

Apr 9, 2026

8.0.2

Mar 25, 2026

8.0.1

Mar 20, 2026

8.0.0

Mar 20, 2026

7.0.2

Mar 5, 2026

7.0.1

Feb 24, 2026

7.0.0

Feb 18, 2026

6.0.2

Jan 27, 2026

6.0.1

Jan 27, 2026

6.0.0

Jan 26, 2026

This version

5.2.0

Jan 7, 2026

5.1.6

Jul 24, 2025

5.1.5

Nov 22, 2024

5.1.4

Nov 19, 2024

5.1.3

Nov 15, 2024

5.1.2

Oct 10, 2024

5.1.1

Oct 4, 2024

5.1.0

Sep 27, 2024

5.0.3

Sep 9, 2024

5.0.2

Aug 14, 2024

5.0.1

Jul 30, 2024

5.0.0

Jul 24, 2024

4.3.4

Jun 17, 2024

4.3.3

May 2, 2024

4.3.1

Apr 30, 2024

4.3.0

Apr 22, 2024

4.2.0

Apr 8, 2024

4.1.2

Mar 20, 2024

4.1.1

Mar 6, 2024

4.1.0

Feb 21, 2024

4.0.6

Feb 15, 2024

4.0.5

Feb 13, 2024

4.0.4

Feb 8, 2024

4.0.3

Jan 29, 2024

4.0.2

Jan 23, 2024

4.0.1

Jan 8, 2024

4.0.0

Jan 3, 2024

3.1.2

Nov 20, 2023

3.1.1

Nov 20, 2023

3.1.0

Nov 14, 2023

3.0.7

Nov 3, 2023

3.0.6

Oct 30, 2023

3.0.5

Oct 27, 2023

3.0.4

Oct 26, 2023

3.0.3

Oct 23, 2023

3.0.2

Oct 13, 2023

3.0.1

Aug 17, 2023

3.0.0

Aug 17, 2023

2.2.0

Aug 16, 2023

2.1.2

Aug 9, 2023

2.0.2

Jul 26, 2023

2.0.1

Jul 21, 2023

2.0.0

Jul 18, 2023

1.0.3

Jul 18, 2023

1.0.2

Jul 17, 2023

1.0.1

Jul 14, 2023

1.0.0

Jul 14, 2023

0.0.11

Jul 13, 2023

0.0.10

Jul 12, 2023

0.0.9

Jul 12, 2023

0.0.8

Jul 11, 2023

0.0.7

Jul 10, 2023

0.0.6

Jun 23, 2023

0.0.5

Jun 23, 2023

0.0.4

Jun 23, 2023

0.0.3

Jun 20, 2023

0.0.2

Jun 19, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datamazing-5.2.0.tar.gz (13.6 kB view details)

Uploaded Jan 7, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

datamazing-5.2.0-py3-none-any.whl (21.7 kB view details)

Uploaded Jan 7, 2026 Python 3

File details

Details for the file datamazing-5.2.0.tar.gz.

File metadata

Download URL: datamazing-5.2.0.tar.gz
Upload date: Jan 7, 2026
Size: 13.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.2.1 CPython/3.10.19 Linux/6.11.0-1018-azure

File hashes

Hashes for datamazing-5.2.0.tar.gz
Algorithm	Hash digest
SHA256	`930e439df9adf62b6c43dceacb1bf8d3acc9242b6de72d9ce66c6d1a53fe7c29`
MD5	`7cbd3578197e2e3d32d7750395e824ea`
BLAKE2b-256	`5ba6e75f26d876195d3652bded27aa572e1e0defb31ee5aabac0bbe71765909c`

See more details on using hashes here.

File details

Details for the file datamazing-5.2.0-py3-none-any.whl.

File metadata

Download URL: datamazing-5.2.0-py3-none-any.whl
Upload date: Jan 7, 2026
Size: 21.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.2.1 CPython/3.10.19 Linux/6.11.0-1018-azure

File hashes

Hashes for datamazing-5.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7eddd2680d35915bd0dc9e997046489cfa58410d8be45e5b352ae0ed1945a9f5`
MD5	`cdf50278c318474e36b9c8e88e5df245`
BLAKE2b-256	`7dc9baa0aed1c7300adac47f5594fc8a5fde8ef2b3b2be0902b114e94ea8fa77`

See more details on using hashes here.

datamazing 5.2.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Datamazing

Interface

Backends

Development

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes