Strict typed UUID identifier base class with exact runtime subtype preservation and optional Pydantic v2 support.
Project description
base-typed-id
base_typed_id is a small Python library for building domain-specific UUID identifier types that remain real str objects at runtime.
Strict typed UUID identifier base class with exact runtime subtype preservation and optional Pydantic v2 support.
Why
Sometimes an identifier is semantically important enough to deserve its own type, but operationally it should still behave like a normal string.
Examples:
UserIdOrderIdExternalEventIdWorkspaceIdIntegrationId
Using plain str everywhere loses domain meaning.
Using plain uuid.UUID changes runtime behavior.
Using wrappers adds interoperability friction.
Using NewType helps only static typing.
base_typed_id gives you a middle ground:
domain-specific UUID-backed identifiers with validation, while keeping real str behavior at runtime.
What it guarantees
- accepts valid UUID strings and
uuid.UUIDvalues - supports auto-generation when called without an explicit value
- validates UUID format at construction time
- preserves the exact subclass type at construction and validation boundaries
- behaves like normal
str - normal string operations return plain
str - preserves subtype through pickle roundtrip
- supports Pydantic v2, but does not require it
- serializes and exports as plain string
- generates OpenAPI
type: stringandformat: uuid - ships
py.typed
What it intentionally does not do
- no built-in domain rules beyond UUID parsing and optional version checks
- no normalization layer
- no non-UUID identifier support
- no domain-specific methods
This package is intentionally minimal.
Domain rules should live in your subclasses or in your application layer.
Why not plain str / uuid.UUID / NewType / Annotated / custom wrapper?
Why not plain str?
Because plain str does not communicate domain intent.
def get_user(user_id: str, workspace_id: str) -> None:
...
This is easy to misuse:
- parameters can be swapped accidentally
- type annotations do not explain domain meaning
- static analysis cannot distinguish semantic identifier types
With typed subclasses:
def get_user(user_id: UserId, workspace_id: WorkspaceId) -> None:
...
the intent is explicit.
Why not uuid.UUID?
uuid.UUID validates structure, but it is not the same thing as a domain identifier type.
from uuid import UUID
def get_user(user_id: UUID, workspace_id: UUID) -> None:
...
This still loses semantic distinction:
user_idandworkspace_idare the same runtime type- static analysis cannot distinguish one UUID-based domain identifier from another
- annotations do not preserve domain meaning
- exported JSON often requires explicit conversion to string
- many integrations expect plain
str, notUUID - exact domain subtype identity such as
type(user_id) is UserIdis not available
base_typed_id keeps UUID validation while preserving domain-specific type identity and plain string interoperability.
Why not typing.NewType?
NewType is a static typing tool, not a runtime type.
from typing import NewType
UserId = NewType("UserId", str)
user_id: UserId = UserId("123e4567-e89b-42d3-a456-426614174000")
assert type(user_id) is str
assert isinstance(user_id, str)
This means:
- runtime values are still plain
str - there is no real subclass at runtime
- runtime boundaries cannot preserve a concrete semantic subtype
- introspection and runtime behavior cannot distinguish
UserIdfrom plainstr - UUID validation is not part of construction
base_typed_id creates a real runtime subtype with UUID validation.
Why not Annotated[str, ...]?
Annotated can attach metadata for validators and frameworks, but it still does not create a distinct runtime type.
That means:
- runtime values are still plain
str type(value)is not your domain identifier type- exact subtype identity is not preserved inside Python objects
If you need runtime identity such as type(user_id) is UserId, Annotated[str, ...] is not enough.
Why not a custom wrapper class?
A wrapper can model a domain value, but it stops being a real string.
Typical trade-offs:
isinstance(value, str)becomesFalse- JSON serialization often needs custom handling
- many libraries expect plain
str, not wrapper objects - you often need explicit
.valueextraction - interoperability becomes noisier
A wrapper is useful when you want rich behavior and strict encapsulation.
base_typed_id is for the opposite case:
keep the identifier operationally identical to str, while still having a named domain type with UUID guarantees.
When base_typed_id is the right choice
Use it when you want:
- semantic identifier types in annotations
- UUID validation at construction and validation boundaries
- real
strbehavior at runtime - plain string serialization
- clean interoperability with Python and library code
- Pydantic / OpenAPI compatibility
Do not use it when you need:
- heavy domain logic on the identifier itself
- mutable state
- multiple fields
- non-UUID runtime representation
- non-UUID identifiers
Installation
Base package
pip install base-typed-id
With Pydantic v2 support
pip install "base-typed-id[pydantic]"
For development
pip install "base-typed-id[dev]"
Quick start
from base_typed_id import BaseTypedId
class UserId(BaseTypedId):
pass
user_id: UserId = UserId("123e4567-e89b-42d3-a456-426614174000")
generated_user_id: UserId = UserId()
assert user_id == "123e4567-e89b-42d3-a456-426614174000"
assert isinstance(user_id, str)
assert isinstance(user_id, UserId)
assert type(user_id) is UserId
assert type(generated_user_id) is UserId
How to use it in your project
Create a module for your domain identifier types.
For example, create a file named domain_identifiers.py:
from base_typed_id import BaseTypedId
class UserId(BaseTypedId):
"""User identifier."""
class WorkspaceId(BaseTypedId):
"""Workspace identifier."""
Then use these types in your application code:
from .domain_identifiers import UserId, WorkspaceId
def get_user(user_id: UserId, workspace_id: WorkspaceId) -> None:
print(user_id, workspace_id)
This gives you:
- domain-specific names in type annotations
- UUID validation at boundaries
- real
strvalues at runtime - plain string serialization behavior
- reconstruction through validation layers such as Pydantic
Runtime behavior
BaseTypedId is a real str subclass backed by UUID validation.
from base_typed_id import BaseTypedId
class UserId(BaseTypedId):
pass
user_id: UserId = UserId("123e4567-e89b-42d3-a456-426614174000")
assert isinstance(user_id, str)
assert isinstance(user_id, UserId)
assert type(user_id) is UserId
assert user_id == "123e4567-e89b-42d3-a456-426614174000"
Normal string operations return plain str
from base_typed_id import BaseTypedId
class UserId(BaseTypedId):
pass
user_id: UserId = UserId("123e4567-e89b-42d3-a456-426614174000")
uppercased_value: str = user_id.upper()
concatenated_value: str = user_id + "_debug"
replaced_value: str = user_id.replace("-", "_")
assert type(uppercased_value) is str
assert type(concatenated_value) is str
assert type(replaced_value) is str
This behavior is intentional.
The typed subtype is preserved at construction and validation boundaries, not across ordinary string operations.
Constructor rules
Valid inputs are:
- no argument
- UUID string
uuid.UUID
Calling the constructor without an argument auto-generates a UUID when uuid_version is 4 or None.
from uuid import UUID
from base_typed_id import BaseTypedId
class UserId(BaseTypedId):
pass
value_from_string: UserId = UserId("123e4567-e89b-42d3-a456-426614174000")
value_from_uuid: UserId = UserId(UUID("123e4567-e89b-42d3-a456-426614174000"))
generated_value: UserId = UserId()
Invalid input raises BaseTypedIdInvalidInputValueError.
from base_typed_id import BaseTypedId, BaseTypedIdInvalidInputValueError
class UserId(BaseTypedId):
pass
try:
UserId("not-a-uuid")
except BaseTypedIdInvalidInputValueError:
pass
try:
UserId(123)
except BaseTypedIdInvalidInputValueError:
pass
UUID version control
By default, BaseTypedId enforces UUID v4.
from base_typed_id import BaseTypedId
class UserId(BaseTypedId):
pass
generated_user_id: UserId = UserId()
Use another version explicitly:
from base_typed_id import BaseTypedId
class ExternalEventId(BaseTypedId):
uuid_version = 5
Disable version restriction:
from base_typed_id import BaseTypedId
class FlexibleId(BaseTypedId):
uuid_version = None
When uuid_version is not 4 or None, auto-generation from None is intentionally rejected.
Deterministic identifiers
For idempotent identifiers, the package provides deterministically_from_words.
from base_typed_id import BaseTypedId, deterministically_from_words
class ExternalEventId(BaseTypedId):
uuid_version = 5
event_id: ExternalEventId = deterministically_from_words(
ExternalEventId,
words=[
"workspace:house-of-ai",
"provider:telegram",
"event:message-created",
"message:42",
],
)
Properties:
- same words -> same identifier
- order matters
- deterministic generation requires
uuid_version = 5oruuid_version = None
Pydantic v2 support
When used as a Pydantic field type:
- validation accepts UUID objects and UUID strings
- runtime model values preserve the exact subtype
- exported payloads are plain strings
- generated schema keeps
type: stringandformat: uuid
from pydantic import BaseModel
from base_typed_id import BaseTypedId
class UserId(BaseTypedId):
pass
class UserModel(BaseModel):
user_id: UserId
user_model: UserModel = UserModel.model_validate(
{
"user_id": "123e4567-e89b-42d3-a456-426614174000",
}
)
assert type(user_model.user_id) is UserId
dumped_python: dict[str, object] = user_model.model_dump()
assert dumped_python == {
"user_id": "123e4567-e89b-42d3-a456-426614174000",
}
assert type(dumped_python["user_id"]) is str
Important boundary
Inside the validated model, the exact subtype is preserved.
After serialization or export, values intentionally become plain strings.
This is a feature, not a bug.
Pickle support
Pickle roundtrip preserves the exact subtype.
import pickle
from base_typed_id import BaseTypedId
class UserId(BaseTypedId):
pass
source_user_id: UserId = UserId("123e4567-e89b-42d3-a456-426614174000")
serialized_user_id: bytes = pickle.dumps(source_user_id)
restored_user_id: object = pickle.loads(serialized_user_id)
assert restored_user_id == "123e4567-e89b-42d3-a456-426614174000"
assert type(restored_user_id) is UserId
JSON behavior
Since BaseTypedId inherits from str, standard JSON serialization naturally produces plain JSON strings.
import json
from base_typed_id import BaseTypedId
class UserId(BaseTypedId):
pass
value: UserId = UserId("123e4567-e89b-42d3-a456-426614174000")
serialized_value: str = json.dumps(value)
restored_value: object = json.loads(serialized_value)
assert serialized_value == '"123e4567-e89b-42d3-a456-426614174000"'
assert restored_value == "123e4567-e89b-42d3-a456-426614174000"
assert type(restored_value) is str
This behavior is intentional.
JSON is a plain data boundary.
The exact runtime subtype exists only inside Python objects. After serialization, values become plain strings and do not carry subtype information.
Public API
from base_typed_id import BaseTypedId
from base_typed_id import BaseTypedIdError
from base_typed_id import BaseTypedIdInvalidInputValueError
from base_typed_id import BaseTypedIdInvariantViolationError
from base_typed_id import deterministically_from_words
Exceptions
BaseTypedIdError
Root exception for all package-specific errors.
BaseTypedIdInvalidInputValueError
Raised when an invalid UUID input value is provided.
BaseTypedIdInvariantViolationError
Raised when an internal invariant or contract is violated.
Design notes
BaseTypedId is intended for projects that want domain-specific UUID identifier names without giving up normal str runtime behavior.
This is especially useful when you have many semantic identifiers such as:
UserIdWorkspaceIdOrderIdExternalEventIdIntegrationId
The base class stays intentionally small so that your domain layer remains explicit and predictable.
Development
Run tests
pytest
Run lint
ruff check .
Run type checking
mypy src tests
pyright
Build package
python -m build
Validate distribution metadata
twine check dist/*
Compatibility
- Python 3.10+
- CPython
- optional Pydantic v2 support
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file base_typed_id-0.1.1.tar.gz.
File metadata
- Download URL: base_typed_id-0.1.1.tar.gz
- Upload date:
- Size: 15.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
34a6879849546972be598da6b8690989d623a59620070b4dffe0ed15cff7bdaa
|
|
| MD5 |
21cc6fcc5d1c4d98c4556d870ce300e9
|
|
| BLAKE2b-256 |
ef6f8cd1d773587231ed07485db14b20c810c929a98c604d57c49beef515fc11
|
File details
Details for the file base_typed_id-0.1.1-py3-none-any.whl.
File metadata
- Download URL: base_typed_id-0.1.1-py3-none-any.whl
- Upload date:
- Size: 10.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7e66dde0fced9c12aa119f6ba0aacb12c3ad741a4525ed724f02051eb54f6da1
|
|
| MD5 |
b7bb07acc0e9423172f3444de03793c1
|
|
| BLAKE2b-256 |
7ef3a6eb8d9aaff46c04ecda1e9902e264e94cbc69387115ae27679d7bb4c141
|