Flexible taxonomy management for generic items — categories, tags, and multi-parent hierarchies with pluggable storage.
Project description
taxomesh
Reusable taxonomy engine for products, content, media, or any domain object you already have.
taxomesh lets you attach categories, tags, and item relationships to existing
entities without baking taxonomy logic into your core models or re-implementing
the same validation, admin, and API workflows in every project.
Use it when "we just need categories" stops being simple:
- categories can have more than one parent
- the same item must appear in multiple branches
- ordering depends on the parent category
- your real entities already live in another system or model
- the same taxonomy rules must work from Python, CLI, Django admin, or your own API
What you get:
- multi-parent category DAGs
- per-parent sort ordering
- free-form item tags
- typed item-to-item relations
- pluggable storage backends (YAML, JSON, Django)
- one service layer with optional CLI, HTTP, and Django integrations
- typo-tolerant fuzzy search over items and categories
What Taxomesh Does
At a high level, taxomesh is a reusable taxonomy layer.
It stores and validates the structure around your entities:
- categories and subcategories
- item placement inside one or more categories
- tags
- typed relations between items
- slugs, metadata, and external IDs for integration
Your actual business objects can stay where they already are. In many projects,
taxomesh is the missing layer between "our app already has products/articles/assets"
and "we need a serious taxonomy on top of them."
Typical Use Cases
- Ecommerce catalogs where a product appears in several navigation paths
- Editorial or CMS systems with sections, topics, and reusable tagging
- Media catalogs with genre, format, collection, and related-item links
- Internal content or knowledge systems that need taxonomy without custom admin work
Status
taxomesh is currently pre-alpha (0.1.x).
API and behavior can still change between releases.
Installation
Requires Python 3.11+.
pip install taxomesh
Optional Django integration:
pip install "taxomesh[django]"
Quick Start
Example: your application already has a product, track, or article identified by an external ID, and you want to place it in a reusable taxonomy.
With no explicit repository configured, TaxomeshService() auto-discovers
taxomesh.toml; otherwise it falls back to the default YAML backend.
from taxomesh import TaxomeshService
svc = TaxomeshService()
music = svc.create_category(name="Music")
jazz = svc.create_category(name="Jazz")
formats = svc.create_category(name="Formats")
vinyl = svc.create_category(name="Vinyl")
svc.add_category_parent(jazz.category_id, music.category_id, sort_index=10)
svc.add_category_parent(vinyl.category_id, formats.category_id, sort_index=20)
album = svc.create_item(
external_id="catalog:42",
name="Kind of Blue",
slug="kind-of-blue",
)
svc.place_item_in_category(album.item_id, jazz.category_id, sort_index=1)
svc.place_item_in_category(album.item_id, vinyl.category_id, sort_index=3)
featured = svc.create_tag(name="featured")
svc.assign_tag(featured.tag_id, album.item_id)
print(album.external_id) # "catalog:42"
print([node.category.name for node in svc.get_graph().roots]) # ["Music", "Formats"]
The item still belongs to your application. taxomesh manages the taxonomy layer
around it: placement, ordering, tags, relations, slugs, and traversal.
Resolving items and categories by external_id
Use the dedicated lookup methods for point lookups by external_id:
# Correct — uses the database index directly
items = svc.get_items_by_external_id("catalog:42") # list[Item]
categories = svc.get_categories_by_external_id("solo") # list[Category]
Do not use list_items() or list_categories() with a Python filter — that
performs a full table scan and bypasses the index.
The external_id field is indexed on the Django backend but not unique.
Result length indicates the state of your data:
| Length | Meaning |
|---|---|
0 |
No match — the external ID is an orphan or was never assigned |
1 |
Unique match — the expected case for a well-maintained catalog |
>= 2 |
Duplicates exist — review and deduplicate as needed |
Fuzzy Search
search_items() and search_categories() find matches by name, slug, and external ID
with typo tolerance, accent-insensitivity, and ranked results — no extra infrastructure
required.
# Typo-tolerant: finds "Piazzolla" even with a misspelling
results = svc.search_items("piazola")
# Accent-insensitive: finds "Agustín Magaldi" without the accent
results = svc.search_items("agustin magaldi")
# Scoped to a subtree
results = svc.search_items("tango", category_id=cat.category_id, recursive=True)
# Category search, children of a specific parent only
results = svc.search_categories("orkesta tipika", parent_id=parent.category_id)
Results are sorted by match quality: exact matches first, then prefix, substring, and
fuzzy matches. Pass fuzzy=False to restrict to exact/prefix/substring matching only.
Pass enabled_only=False to include disabled items and categories.
See Python API — Fuzzy Search for the full parameter reference.
Why This Exists
Taxonomy work is usually underestimated. A simple category table becomes more complex once you need:
- multiple parents instead of a strict tree
- branch-specific ordering
- items linked to existing models by external ID
- reusable validation and errors across app code, CLI, admin, and APIs
- storage that fits both local development and production integration
taxomesh packages those concerns into a single component so they do not have to be
re-solved in each codebase.
Core Concepts
- Item: an entity in your taxonomy, usually linked to a business object through
external_id - Category: a taxonomy node with optional
name,description,metadata,external_id,enabled, and uniqueslug - Tag: a free-form label assigned to items
- ItemRelationLink: a directed, typed relation between two items such as
covers,version_of, orperformed_by - CategoryParentLink: the link from a category to one of its parents, including
sort_index - ItemParentLink: the link from an item to a category, including
sort_index - TaxomeshGraph: a read snapshot returned by
get_graph()for traversal - Repository: the storage backend used by
TaxomeshService
Documentation
| Topic | Description |
|---|---|
| What Taxomesh Solves | Product overview, common use cases, and why taxonomy gets complex |
| Python API | Categories, Items, Tags, Graph, slug and external-ID lookups |
| Django integration | Django ORM + admin setup, model bridging |
| HTTP API integration | Reuse request models, handlers, and error mapping in your existing web app |
| Repositories | YAML, JSON, and Django storage backends; custom backends |
| Configuration | taxomesh.toml reference |
| CLI reference | Command-line interface for categories, items, tags, and graph |
Design
taxomesh keeps a stable application-facing shape while letting storage and integration
details vary:
- Service layer:
TaxomeshServiceis the main entry point for application code - Domain rules: taxonomy validation, including DAG constraints and typed errors
- Repositories: YAML, JSON, Django, or a custom backend behind the same service API
- Optional integrations: CLI, Django admin + ORM, and framework-agnostic HTTP helpers
Development
uv sync --dev
uv run pytest
uv run ruff check .
uv run mypy .
Contributing
Contributions are welcome.
This project follows a spec-first workflow. Please align implementation PRs with the specs/ directory.
License
MIT.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file taxomesh-0.1.0a23.tar.gz.
File metadata
- Download URL: taxomesh-0.1.0a23.tar.gz
- Upload date:
- Size: 747.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a7a556eb6218aec310c37c21b8685217ae862e0d116bbc38c5e4cedb0cd0de62
|
|
| MD5 |
b435a0f3cb107fc333253c00388008ed
|
|
| BLAKE2b-256 |
9b8b78d1d57a58da92d66e4d0f1ee9282ee4213641d588fe3a2fc17dbf19defc
|
File details
Details for the file taxomesh-0.1.0a23-py3-none-any.whl.
File metadata
- Download URL: taxomesh-0.1.0a23-py3-none-any.whl
- Upload date:
- Size: 92.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fc4e0f4efd61bbad8abdf479e4514594ef321d54d309516171814807ef2ac49b
|
|
| MD5 |
67ed450d8c736d7b8abe640e2a85b45d
|
|
| BLAKE2b-256 |
0a0f5627835325371d4d3cd32bc655168eea9c38739540ca2fa9d98e40171f48
|