Skip to main content

Minimalistic implementation of the ULID specification for python (https://github.com/ulid/spec). No integrated type or specification checks but system integrity check.

Project description

Tool for Unique Lexicographically Identifier (ULID)

[Original title] Unique Lexicographically Sortable Identifier

ULID is a widely used specification for identifiers which, unlike UUID, aims for readability, anonymity, compatibility, and practical use through sortability. (Original specification: https://github.com/ulid/spec)

By modifying the original specification, this implementation enables the generation of guaranteed unique ULIDs within a file system.

The ULID specification does not achieve guaranteed cross-system uniqueness. Even the widely used UUID versions cannot achieve absolute uniqueness, even though they are considered unique in practice. This project deals with the highest possible uniqueness in time and space with the highest possible anonymity.

Statute

  • This module implements the original specification and is extended by modifications that follow the idea and properties.
  • The module contains utils as separate modules/functions/methods for the creation or handling of ULIDs.
  • The module contains system integrity checks for the consistent creation of ULIDs.
  • Python type checks during runtime are not performed.

Properties and basic concept

UUID vs ULID

Properties of a UUID

Depending on the version, a UUID can consist of

  • Timestamp

  • The Mac address

    The use of a MAC address can be considered sensitive information about the creator that is being disclosed.

  • Random values

    Random values can never be considered absolutely unique. Even though the probability of two identical UUIDv4s occurring is practically zero. Furthermore, random values naturally do not carry any information.

  • A centrally managed seed

    A central, generally recognized authority is required.

Properties of a ULID

  • 128-bit compatibility with UUID
  • 1.21e+24 unique ULIDs per millisecond
  • Lexicographically sortable
  • Canonically encoded as a 26 character string, as opposed to the 36 character UUID
  • Uses Crockford's base32 for better efficiency and readability (5 bits per character)
  • Case insensitive
  • No special characters (URL safe)
  • Monotonic sort order (correctly detects and handles the same millisecond)

Binary Layout (original specification)

A ULID has a fixed length of 128 bits coded in 16 bytes/octets and consists of 2 components. Each component is coded in big-endian format (network byte order).

      0                   1                   2                   3
      0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1 2 3 4 5 6 7 8 9 0 1
     |               |               |               |               |
     +---------------------------------------------------------------+
     | ms since epoch                                                |
     + - - - - - - - - - - - - - - - +-------------------------------+
     | ...                           | randomnes                     |
     +-------------------------------+ - - - - - - - - - - - - - - - +
     | ...                                                           |
     + - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - +
     | ...                                                           |
     +---------------------------------------------------------------+

Variations

ULID.runtime_lexical

Replaces the random part with a monotonic counter that counts up from 0 within a runtime and automatically restarts when the space is used up. Not thread-safe.

ULID.local_lexical

Similar to runtime_lexical. Reads the last status of the counter from the file system and writes the status when the Python interpreter terminates. Not thread-safe.

ULID.env_lexical

Here, the random part consists of 72 bits of a monotonic counter + 8 bits from a one-time read ID per interpreter start. Not sub-thread or sub-process safe. However, for up to 256 simultaneously running main processes.

ULID.thread_env_lexical

Structured like env_lexical. Here, the thread ID is used for system-wide uniqueness. Thread-safe.

ULID.short_env_lexical

Heavily modified version. The random part consists of only 8 bits. 4 bits from a monotonic counter + 4 bits from a uniquely read ID per interpreter start. Not sub-thread or sub-process safe. However, for up to 16 simultaneously running main processes.

Module usages

if bypass_checks:
    # By default, several system integrity checks are performed during import. 
    # To bypass this, the following code can be used
    import ulid_tool
    ulid_tool.SYSTEM_CHECKS = False
    from ulid_tool.ulid import ULID
else:
    from ulid_tool.ulid import ULID

# ULID of ORIGINAL SPECIFICATION
ulid = ULID()

# VARIATIONS
ULID.runtime_lexical()
ULID.local_lexical()
ULID.env_lexical()
ULID.thread_env_lexical()
ULID.short_env_lexical()

# INTERFACES

## CONVERSIONS
ulid.bytes
ulid.int
ulid.str
ulid.codec
ulid.repr
ulid.hex
ulid.oct
ulid.bin

## PARTS
ulid.randomness
ulid.timestamp
ulid.prime

## CREATE FROM
ulid.from_interfaces(ulid.timestamp, ulid.randomness)
ulid.from_codec(ulid.codec)
ulid.from_bytes(ulid.bytes)
ulid.from_str(ulid.str)
ulid.from_int(ulid.int)
ulid.from_hex(ulid.hex)
ulid.from_oct(ulid.oct)
ulid.from_bin(ulid.bin)
ulid.from_repr(ulid.repr)
### UTIL
ulid.prime_from_type_of(ulid.oct)

## PROGRESSION
assert ulid.copy().this_forwards(n=1) == ulid.next
assert ulid.copy().this_backwards(n=1) == ulid.previous
ulid.igenerator(ni=3)
reversed(ulid.igenerator(ni=3))

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ulid_tool-4.tar.gz (17.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ulid_tool-4-py3-none-any.whl (14.6 kB view details)

Uploaded Python 3

File details

Details for the file ulid_tool-4.tar.gz.

File metadata

  • Download URL: ulid_tool-4.tar.gz
  • Upload date:
  • Size: 17.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for ulid_tool-4.tar.gz
Algorithm Hash digest
SHA256 f99d76bd7cb8fc4592d83edc7adc3adb02173468eb84a6d4c7e8a354a78f010d
MD5 2d7d8a3ac8d89d891813ce3206d23364
BLAKE2b-256 1b74c8690b250fcae027aaa6895b2247e0a79611660a5c1b7682800a579c6091

See more details on using hashes here.

File details

Details for the file ulid_tool-4-py3-none-any.whl.

File metadata

  • Download URL: ulid_tool-4-py3-none-any.whl
  • Upload date:
  • Size: 14.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for ulid_tool-4-py3-none-any.whl
Algorithm Hash digest
SHA256 149e467cfc3d94f2629edd169cce7f2df1d435136e98160ab154ceaaf29179d1
MD5 23df702fbe7061864ed8845e1a8a45bd
BLAKE2b-256 2fa0c14196562edcd3ac4a2f65365b11fa5704ad3b9937067570561b50c08506

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page