Skip to main content

Generate gap-less sequences of integer values.

Project description

The problem

Django’s default, implicit primary keys aren’t guaranteed to be sequential.

If a transaction inserts a row and then is rolled back, the sequence counter isn’t rolled back for performance reasons, creating a gap in primary keys.

This can cause compliance issues for some use cases such as accounting.

This risk isn’t well known. Since most transactions succeed, values look sequential. Gaps will only be revealed by audits.

The solution

django-sequences provides just one function, get_next_value, which is designed to be used as follows:

from django.db import transaction

from sequences import get_next_value

from invoices.models import Invoice

with transaction.atomic():
    Invoice.objects.create(number=get_next_value('invoice_numbers'))

The guarantees of django-sequences only apply if you call get_next_value and save its return value to the database within the same transaction!

Installation

Install django-sequences:

$ pip install django-sequences

Add it to the list of applications in your project’s settings:

INSTALLED_APPS += ['sequences.apps.SequencesConfig']

Run migrations:

$ django-admin migrate

API

get_next_value generates a gap-less sequence of integer values:

>>> get_next_value()
1
>>> get_next_value()
2
>>> get_next_value()
3

It supports multiple independent sequences:

>>> get_next_value('cases')
1
>>> get_next_value('cases')
2
>>> get_next_value('invoices')
1
>>> get_next_value('invoices')
2

The first value defaults to 1. It can be customized:

>>> get_next_value('customers', initial_value=1000)  # pro growth hacking

The initial_value parameter only matters when get_next_value is called for the first time for a given sequence — assuming the corresponding database transaction gets committed; as discussed above, if the transaction is rolled back, the generated value isn’t consumed. It’s also possible to initialize a sequence in a data migration and not use initial_value in actual code.

Database transactions that call get_next_value for a given sequence are serialized. In other words, when you call get_next_value in a database transaction, other callers which attempt to get a value from the same sequence will block until the transaction completes, either with a commit or a rollback. You should keep such transactions short to minimize the impact on performance.

Passing nowait=True will cause get_next_value to raise an exception instead of blocking. This will rarely be useful. Also it doesn’t work for the first call. (Arguably this is a bug. Patches welcome.)

Calls to get_next_value for distinct sequences don’t interact with one another.

Finally, passing using='...' allows selecting the database on which the current sequence value is stored. When this parameter isn’t provided, the current value is stored in the default database for writing to models of the sequences application. See below for details.

To sum up, the complete signature of get_next_value is:

get_next_value(sequence_name='default', initial_value=1,
               *, nowait=False, using=None)

Under the hood, it relies on the database’s transactional integrity to guarantee that each value will be returned exactly once.

Contributing

You can run tests with:

$ make test

If you’d like to contribute, please open an issue or a pull request on GitHub!

Other databases

INTEGER PRIMARY KEY AUTOINCREMENT fields on SQLite don’t have this problem.

The author doesn’t know if this problem can happens on MySQL or Oracle. If it does, then the current implementation of django-sequences should work. If you test this, please open an issue on GitHub to report your findings. Note that MySQL won’t support the nowait parameter.

Multiple databases

Since django-sequences relies on the database to guarantee transactional integrity, the current value for a given sequence must be stored in the same database as models containing generated values.

In a project that uses multiple databases, you must write a suitable database router to create tables for the sequences application on all databases storing models containing sequential numbers.

Each database has its own namespace: a sequence with the same name stored in two databases will have independent counters in each database.

Changelog

2.0

  • Add support for multiple databases.

  • Add translations.

  • nowait becomes keyword-only argument.

  • Drop support for Python 2.

1.0

  • Initial stable release.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

django-sequences-2.0.tar.gz (11.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

django_sequences-2.0-py2.py3-none-any.whl (29.2 kB view details)

Uploaded Python 2Python 3

File details

Details for the file django-sequences-2.0.tar.gz.

File metadata

File hashes

Hashes for django-sequences-2.0.tar.gz
Algorithm Hash digest
SHA256 2322bbdfdc452129614bcb2e6511274868f63d4f6b37cf1cda9d1d39f623d34d
MD5 abe4cf4968979d3077e8c26ea0ae6846
BLAKE2b-256 9c9686ca6128f94fdd4048e08f7848bcf082591737da5563f5a18a7547ec09b2

See more details on using hashes here.

File details

Details for the file django_sequences-2.0-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for django_sequences-2.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 62aff435caad7d8c1b15ebd59c4d9b46140dd2ba9ebd3640a3e558e4396257cf
MD5 aa60901518cc6094091412c12af3bc58
BLAKE2b-256 0ecdbc2c029a5b5bbce8ed7b505cb63c6b3892775e5daf381e96a57bcb9925b2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page