Skip to main content

Utilities for deduping Django model instances

Project description

Build status codecov Python version

A collection of classes and utilities to aid in de-duping Django model instances.

Requirements

  • Python 3.6

  • Django 1.11

Install

pip install django-super-deduper

Usage

Merging Duplicate Instances

By default any empty values on the primary object will take the value from the duplicates. Additionally, any related one-to-one, one-to-many, and many-to-many related objects will be updated to reference the primary object.

> from django_super_deduper.merge import MergedModelInstance
> primary_object = Model.objects.create(attr_A=None, attr_B='')
> alias_object_1 = Model.objects.create(attr_A=X)
> alias_object_2 = Model.objects.create(attr_B=Y)
> merged_object = MergedModelInstance.create(primary_object, [alias_object_1, alias_object_2])
> merged_object.attr_A
X
> merged_object.attr_B
Y

Improvements

  • Support multiple merging strategies

  • Recursive merging of related one-to-one objects

Logging

This package does have some rudimentary logging for debugging purposes. Add this snippet to your Django logging settings to enable it:

LOGGING = {
    'loggers': {
        'django_super_deduper': {
            'handlers': ['console'],
            'level': 'DEBUG',
        },
    },
}

References

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

django-super-deduper-0.1.4.tar.gz (9.7 kB view hashes)

Uploaded Source

Built Distribution

django_super_deduper-0.1.4-py3-none-any.whl (5.6 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page