Get DB aggregations using Django ORM

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Framework
- Django
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python
- Python :: 3

Project description

Django Rest Framework Aggregation

DRF Mixin for getting aggregations

Key features:

can get multiple aggregations at once
can calculate percentile and percent (must be enabled separately)
grouping by multiple fields
time series (except SQLite)
limiting the number of displayed records

Installing

For installing use pip

    pip install drf-aggregation

Usage

Register mixin

The simplest variant of usage is to create a ViewSet with the provided mixin

from drf_aggregation import AggregationMixin


class TicketViewSet(AggregationMixin, GenericViewSet):
    queryset = Ticket.objects.all()
    serializer_class = TicketSerializer

urlpatterns = [
    path("aggregation/ticket", TicketViewSet.as_view({"post": "aggregation"})),
]

After that you can use it

POST /aggregation/ticket
Content-Type: application/json
{
    "group_by": "service",
    "limit": 5,
    "order_by": "-total_tasks",
    "aggregations": {
        "total_tasks": {
            "type": "count"
        },
        "average_execution_time": {
            "type": "average",
            "field": "execution_time",
        }
    }
}

Usage in code

Almost all a mixin does is call a function that you can use directly at your way

from drf_aggregation import get_aggregations

result = get_aggregations(
    queryset=Ticket.objects.all(),
    aggregations={
        "total_tasks": {
            "type": "count"
        },
    }
)

Available params

aggregations - dictionary with aggregations to obtain
- key - the key under which the aggregation result will be returned
- value - dictionary with aggregation settings
  - type - aggregation type
  - index_by_group - add an index relative to the specified field for further sorting by it
  - field - required for aggregations: sum, average, minimum, maximum, percentile
  - percentile - from 0 to 1, required for percentile
  - additional_filter - filter parser is used from package drf-complex-filter, required for percent
group_by - list of fields to group the result
order_by - list of fields to sort the result
limit - number of groups to return or dictionary with settings:
- limit - number of groups to return
- offset - shift start of returned groups
- by_group - which group to limit the result by, by default - the first field for grouping
- by_aggregation - which aggregation to limit the result by, by default - the first declared aggregation
- show_other - return the remaining records as one additional group
- other_label - label of additional group with recordings beyond the limit

Supported field types

IntegerField
FloatField
DateField (only minimum and maximum)
DateTimeField (only minimum and maximum)
DurationField

Extend aggregation types

By default, only these aggregations are enabled: count, distinct, sum, average, minimum, maximum

Package provide two more aggregations - percent and percentile. But to use them, you need to enable them manually:

# in settings.py
DRF_AGGREGATION_SETTINGS = {
    "AGGREGATION_CLASSES": [
        "drf_aggregation.aggregations.common.CommonAggregations",

        # need to install additional package "drf-complex-filter"
        "drf_aggregation.aggregations.percent.PercentAggregation",

        # works only on PostgreSQL
        "drf_aggregation.aggregations.percentile.PercentileAggregation",
    ],
}

You can also create your own aggregations. To do this, create a class with static methods that will be available as an aggregation type

class MyAwesomeAggregations:
    @staticmethod
    def my_aggregation(aggregation: Aggregation, queryset: models.QuerySet):
        name = aggregation.get("name")
        return {f"{name}": models.Count("id")}

# in settings.py
DRF_AGGREGATION_SETTINGS = {
    "AGGREGATION_CLASSES": [
        "drf_aggregation.aggregations.common.CommonAggregations",
        "path.to.MyAwesomeAggregations",
    ],
}

result = get_aggregations(
    queryset=Ticket.objects.all(),
    aggregations={
        "value": {
            "type": "my_aggregation"
        },
    }
)

Usage examples

Grouping results

To group the result, a comma-separated list of required fields is passed

result = get_aggregations(
    queryset=Ticket.objects.all(),
    aggregations={
        "total_tasks": {
            "type": "count"
        },
    },
    group_by=["field1", "field2"]
)

Sorting the result

When grouping by one field, it is enough to pass a list of fields by which you need to sort the result

result = get_aggregations(
    queryset=Ticket.objects.all(),
    aggregations={
        "total_tasks": {
            "type": "count"
        },
    },
    group_by="field1",
    order_by="field1"
)

The requested aggregations can be used as a sorting key

result = get_aggregations(
    queryset=Ticket.objects.all(),
    aggregations={
        "total_tasks": {
            "type": "count"
        },
    },
    group_by="field1",
    order_by="-total_tasks"
)

When grouping by multiple fields, you can add an index for the desired group and aggregation pair, after which you can use this index for sorting.

result = get_aggregations(
    queryset=Ticket.objects.all(),
    aggregations={
        "total_tasks": {
            "type": "count",
            "index_by_group": "field1"
        },
    },
    group_by=["field1", "field2"],
    order_by="-field1__total_tasks"
)

Limiting the number of displayed groups

If you have a large number of categories or you need to display only top-N, it is possible to limit the number of returned records

result = get_aggregations(
    queryset=Ticket.objects.all(),
    aggregations={
        "total_tasks": {
            "type": "count",
        },
    },
    group_by="field1",
    order_by="-total_tasks",
    limit=2
)

It is also possible to display all other groups as one additional category

result = get_aggregations(
    queryset=Ticket.objects.all(),
    aggregations={
        "total_tasks": {
            "type": "count",
        },
    },
    group_by="field1",
    order_by="-total_tasks",
    limit={
        "limit": 2,
        "show_other": true
    }
)

Other parameters to limit:

by_group - field for selecting the values that will remain, if not passed, the first field for grouping is used
by_aggregation
show_other - if true, all groups not included in the top will be displayed as one additional category
other_label - label for additional category, default "Other"

Time series

Warning! Doesn't work on SQLite because it doesn't have date / time fields.

To get an aggregation for a time series, you must first annotate your queryset with a truncated date field, and then use that field for grouping.

truncate_rules = { "created_at": "day" }
queryset = truncate_date(Ticket.objects.all(), truncate_rules)

result = get_aggregations(
    queryset=queryset,
    aggregations={
        "total_tasks": {
            "type": "count",
        },
    },
    group_by="created_at__trunc__day",
)

If you use AggregationMixin, you just need to pass truncate_rules in the request body.

POST /aggregation/ticket
Content-Type: application/json
{
    "truncate_rules": { "created_at": "day" },
    "group_by": "created_at__trunc__day",
    "aggregations": {
        "total_tasks": {
            "type": "count"
        },
    }
}

Available truncations:

year
quarter
month
week
day
hour
minute
second

For mo details about truncations read Django Docs

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Framework
- Django
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python
- Python :: 3

Release history Release notifications | RSS feed

2.1.0

Oct 15, 2024

This version

2.0.1

Oct 15, 2024

2.0.0

Oct 15, 2024

1.1.2

Mar 29, 2022

1.1.1

Oct 30, 2021

1.1.0

Oct 18, 2021

1.0.1

Oct 18, 2021

1.0.0

Jul 27, 2021

0.8.2

Jul 15, 2021

0.8.1

Jul 15, 2021

0.8.0

Jul 14, 2021

0.7.3

Jan 15, 2021

0.7.2

Dec 30, 2020

0.7.1

Dec 2, 2020

0.7.0

Dec 2, 2020

0.6.0

Nov 30, 2020

0.5.2

Nov 26, 2020

0.5.1

Nov 23, 2020

0.5.0

Nov 22, 2020

0.4.1

Nov 21, 2020

0.4.0

Nov 21, 2020

0.3.0

Nov 21, 2020

0.2.0

Nov 15, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

drf_aggregation-2.0.1.tar.gz (18.8 kB view hashes)

Uploaded Oct 15, 2024 Source

Built Distribution

drf_aggregation-2.0.1-py3-none-any.whl (14.2 kB view hashes)

Uploaded Oct 15, 2024 Python 3

Hashes for drf_aggregation-2.0.1.tar.gz

Hashes for drf_aggregation-2.0.1.tar.gz
Algorithm	Hash digest
SHA256	`5c4139ecd15b06e9993eb56d6fbe4bd625a64aaf64483bcd66cdaadfded939d8`
MD5	`29168ea4c9731ffea9f95d457b84d003`
BLAKE2b-256	`829be22a86747cd87eabb2e085a6962f5fd6e29f3a76f86db65f8ba6cfdff702`

Hashes for drf_aggregation-2.0.1-py3-none-any.whl

Hashes for drf_aggregation-2.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3c25aa4de3a426bb59129f31b7abe7018af784a657b0d72b96ce2ec686930630`
MD5	`07027242eedd3b096e62315522ac0c17`
BLAKE2b-256	`82f9e773f9cc31092ed8d4051522296c02b8271fd7ca399c88e3ecce90f87c0d`