Skip to main content

dynamic, declarative data transformations with automatic code generation

Project description

convtools

convtools is a specialized Python library designed for defining data transformations dynamically using a declarative approach. It automatically generates custom Python code for the user in the background.

ChatGPT's take on it :)

Introducing ConvTools, the revolutionary Python library for dynamic data
transformations! With its declarative approach, transforming data has never
been easier. Say goodbye to manual coding, ConvTools automatically
generates the code for you, saving you time and effort. Whether you're
working with large datasets or need to make real-time updates, ConvTools is
the solution you've been searching for. Get ready to experience the power
of efficient and effortless data transformation!

License codecov Tests status Docs status PyPI Twitter Downloads Python versions


Installation

pip install convtools

Documentation

convtools.readthedocs.io

Group by example

from convtools import conversion as c

input_data = [
    {"a": 5, "b": "foo"},
    {"a": 10, "b": "foo"},
    {"a": 10, "b": "bar"},
    {"a": 10, "b": "bar"},
    {"a": 20, "b": "bar"},
]

conv = (
    c.group_by(c.item("b"))
    .aggregate(
        {
            "b": c.item("b"),
            "a_first": c.ReduceFuncs.First(c.item("a")),
            "a_max": c.ReduceFuncs.Max(c.item("a")),
        }
    )
    .pipe(
        c.aggregate({
            "b_values": c.ReduceFuncs.Array(c.item("b")),
            "mode_a_first": c.ReduceFuncs.Mode(c.item("a_first")),
            "median_a_max": c.ReduceFuncs.Median(c.item("a_max")),
        })
    )
    .gen_converter()
)

assert conv(input_data) == {
    'b_values': ['foo', 'bar'],
    'mode_a_first': 10,
    'median_a_max': 15.0
}
Built-in reducers like c.ReduceFuncs.First
* Sum
* SumOrNone
* Max
* MaxRow
* Min
* MinRow
* Count
* CountDistinct
* First
* Last
* Average
* Median
* Percentile
* Mode
* TopK
* Array
* ArrayDistinct
* ArraySorted

DICT REDUCERS ARE IN FACT AGGREGATIONS THEMSELVES, BECAUSE VALUES GET REDUCED.
* Dict
* DictArray
* DictSum
* DictSumOrNone
* DictMax
* DictMin
* DictCount
* DictCountDistinct
* DictFirst
* DictLast

AND LASTLY YOU CAN DEFINE YOUR OWN REDUCER BY PASSING ANY REDUCE FUNCTION
OF TWO ARGUMENTS TO ``c.reduce``.

What's the point if there are tools like Pandas / Polars?

  • convtools doesn't need to wrap data in a container to provide functionality, it simply runs the python code it generates on any input
  • convtools is lightweight (though optional black is highly recommended for pretty-printing generated code out of curiosity)
  • convtools fosters building pipelines on top of iterators, allowing for stream processing
  • convtools supports nested aggregations
  • convtools is a set of primitives for code generation, so it's just different.

Reporting a Security Vulnerability

See the security policy.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

convtools-1.3.0.tar.gz (58.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

convtools-1.3.0-py3-none-any.whl (66.6 kB view details)

Uploaded Python 3

File details

Details for the file convtools-1.3.0.tar.gz.

File metadata

  • Download URL: convtools-1.3.0.tar.gz
  • Upload date:
  • Size: 58.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-httpx/0.23.3

File hashes

Hashes for convtools-1.3.0.tar.gz
Algorithm Hash digest
SHA256 51b10c28bd79b577811f82813412e7564151ac41ca04790daa0bf8bcd2f4bbd2
MD5 041042a086a56197e610af1bfb54f129
BLAKE2b-256 0054c66619ac4c2dcd0635c97247991c8ee516db8dd756e4d4ed5439fe35918f

See more details on using hashes here.

File details

Details for the file convtools-1.3.0-py3-none-any.whl.

File metadata

  • Download URL: convtools-1.3.0-py3-none-any.whl
  • Upload date:
  • Size: 66.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-httpx/0.23.3

File hashes

Hashes for convtools-1.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 db359e2c634c64658587f5a5b8707ce6dc59d9dd64021d7ecd4041b196bac979
MD5 fc80931f94ab0f0a909f9b8de6ee255a
BLAKE2b-256 dc1df9adc9dd344d4ccfd9ca4fe62f0668caac962212a0d8ce520afe0985b479

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page