Skip to main content

dynamic, declarative data transformations with automatic code generation

Project description

convtools

convtools is a specialized Python library designed for defining data transformations dynamically using a declarative approach. It automatically generates custom Python code for the user in the background.

ChatGPT's take on it :)

Introducing ConvTools, the revolutionary Python library for dynamic data
transformations! With its declarative approach, transforming data has never
been easier. Say goodbye to manual coding, ConvTools automatically
generates the code for you, saving you time and effort. Whether you're
working with large datasets or need to make real-time updates, ConvTools is
the solution you've been searching for. Get ready to experience the power
of efficient and effortless data transformation!

License codecov Tests status Docs status PyPI Twitter Downloads Python versions


Installation

pip install convtools

Documentation

convtools.readthedocs.io

Group by example

from convtools import conversion as c

input_data = [
    {"a": 5, "b": "foo"},
    {"a": 10, "b": "foo"},
    {"a": 10, "b": "bar"},
    {"a": 10, "b": "bar"},
    {"a": 20, "b": "bar"},
]

conv = (
    c.group_by(c.item("b"))
    .aggregate(
        {
            "b": c.item("b"),
            "a_first": c.ReduceFuncs.First(c.item("a")),
            "a_max": c.ReduceFuncs.Max(c.item("a")),
        }
    )
    .pipe(
        c.aggregate({
            "b_values": c.ReduceFuncs.Array(c.item("b")),
            "mode_a_first": c.ReduceFuncs.Mode(c.item("a_first")),
            "median_a_max": c.ReduceFuncs.Median(c.item("a_max")),
        })
    )
    .gen_converter()
)

assert conv(input_data) == {
    'b_values': ['foo', 'bar'],
    'mode_a_first': 10,
    'median_a_max': 15.0
}
Built-in reducers like c.ReduceFuncs.First
* Sum
* SumOrNone
* Max
* MaxRow
* Min
* MinRow
* Count
* CountDistinct
* First
* Last
* Average
* Median
* Percentile
* Mode
* TopK
* Array
* ArrayDistinct
* ArraySorted

DICT REDUCERS ARE IN FACT AGGREGATIONS THEMSELVES, BECAUSE VALUES GET REDUCED.
* Dict
* DictArray
* DictSum
* DictSumOrNone
* DictMax
* DictMin
* DictCount
* DictCountDistinct
* DictFirst
* DictLast

AND LASTLY YOU CAN DEFINE YOUR OWN REDUCER BY PASSING ANY REDUCE FUNCTION
OF TWO ARGUMENTS TO ``c.reduce``.

What's the point if there are tools like Pandas / Polars?

  • convtools doesn't need to wrap data in a container to provide functionality, it simply runs the python code it generates on any input
  • convtools is lightweight (though optional black is highly recommended for pretty-printing generated code out of curiosity)
  • convtools fosters building pipelines on top of iterators, allowing for stream processing
  • convtools supports nested aggregations
  • convtools is a set of primitives for code generation, so it's just different.

Reporting a Security Vulnerability

See the security policy.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

convtools-1.2.0.tar.gz (58.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

convtools-1.2.0-py3-none-any.whl (65.7 kB view details)

Uploaded Python 3

File details

Details for the file convtools-1.2.0.tar.gz.

File metadata

  • Download URL: convtools-1.2.0.tar.gz
  • Upload date:
  • Size: 58.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-httpx/0.23.3

File hashes

Hashes for convtools-1.2.0.tar.gz
Algorithm Hash digest
SHA256 1649b0fdf8afeafe0392cd0ae6aa57b3942da70de39881120089faae98386989
MD5 47d7b47b65a0d56011f1b53c68f494d8
BLAKE2b-256 b04a57244afe9144c7a4cfe4497fca7c400ffe88315988690a21a733753856c0

See more details on using hashes here.

File details

Details for the file convtools-1.2.0-py3-none-any.whl.

File metadata

  • Download URL: convtools-1.2.0-py3-none-any.whl
  • Upload date:
  • Size: 65.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-httpx/0.23.3

File hashes

Hashes for convtools-1.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 54d6ed2133319b01ed55a07aa251e381c287b85e45d29a44b2de2aee54b0d38d
MD5 37fa5951f2ba310ba00bc1df2fc78aad
BLAKE2b-256 ded7ea9ea374ccb5f155c17a000c54a31d6efc95a7014c464a8fff763cc9d167

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page