ULMEN: The number one serialization format across size, tokens, speed, and memory

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

makroumi

These details have not been verified by PyPI

Project description

ULMEN V1

Ultra Lightweight Minimal Encoding Notation

ULMEN is a serialization format engineered to be the smallest, fastest, and most token-efficient way to move structured data between services, into storage, and through language model context windows.

It ships as a pure Python library with an optional Rust acceleration layer that is drop-in compatible and byte-identical in output.

Benchmarks
At a Glance
Surfaces
Installation
Quick Start
API Reference
Wire Format Constants
Utilities
Architecture
Running Tests
Format Specification
Versioning

Benchmarks

Measured on 1,000 records, 10 mixed-type columns (int, float, str, bool). Speed = median of 50 runs, full construction included (pool build + encode). Machine: x86_64 Linux, Python 3.12, rustc 1.92.

Size

Format	Bytes	vs JSON
JSON	145,664	100.0%
Pickle protocol 4	62,177	42.7%
CSV	61,717	42.4%
ULMEN text	57,403	39.4%
ULMEN text	46,779	32.1%
ULMEN binary	32,701	22.4%
ULMEN zlib-6	2,453	1.7%
ULMEN zlib-9	2,450	1.7%

Python and Rust produce byte-identical output.

Speed - Encode (median ms, 1,000 records)

Format	Encode ms	Decode ms
JSON	2.137	2.771
Pickle protocol 4	0.644	0.807
ULMEN text (Python)	19.707	-
ULMEN binary (Python)	22.008	0.736
ULMEN zlib-6 (Python)	22.251	-
ULMEN (Python)	2.433	1.405
ULMEN text (Rust)	1.726	-
ULMEN binary (Rust)	1.721	0.738
ULMEN zlib-6 (Rust)	2.109	-
ULMEN (Rust)	5.697	1.397

Rust acceleration: 12.8x faster binary encode, 11.4x faster text encode vs Python.

ULMEN binary (Rust) encode is comparable to JSON encode while producing output 4.5x smaller.

Streaming (median 50 runs, 1,000 records)

Surface	Encode ms	Decode ms	MB/s
`stream_encode`	2.298	0.726	14.2
`stream_encode_windowed` (ws=100)	1.660	—	—

Wire format identical to batch encode. Rust backend selected automatically.

At a Glance

	ULMEN binary	ULMEN text	ULMEN	JSON
Size vs JSON	22.4%	32.1%	39.4%	100%
Zlib compressed	1.7%	-	-	-
Rust encode (ms)	1.721	1.726	5.697	2.137
Python encode (ms)	22.008	19.707	2.433	2.137
Self-describing	yes	yes	yes	yes
LLM-generatable	-	-	yes	partial
Round-trip exact	yes	yes	yes	no (NaN/inf)

Surfaces

ULMEN exposes four surfaces over a single data model:

Binary: `LUMB` prefix

Columnar binary format. Smallest on wire. Designed for storage and IPC. Supports delta encoding, bitpacking, RLE, string pooling, and zlib.

Text: `records[N]:` prefix

Line-oriented, diff-friendly, human-readable. Compatible with standard text tools. Uses the same pool and strategy system as binary.

ULMEN: `L|` prefix

LLM-native CSV surface. Every payload is self-describing via a typed header line. Language models can read and generate ULMEN without special training or prompt engineering.

Streaming: `UlmenStreamEncoder` / `stream_encode`

Zero-materialisation streaming encode surface. Feed records one at a time or in batches, then flush to an iterator of bytes chunks. The Rust backend is selected automatically. Wire format is identical to batch binary encode — every chunk is independently decodable. For truly unbounded streams use stream_encode_windowed which encodes fixed-size windows into independent sub-payloads, each decodable standalone.

ULMEN-AGENT: `ULMEN-AGENT v1` prefix

Structured protocol for agentic AI communication. Typed record schemas for messages, tool calls, results, plans, observations, errors, memory, RAG chunks, hypotheses, and chain-of-thought steps.

Extended capabilities:

Extended header fields: payload_id, parent_payload_id, agent_id, session_id, schema_version, context_window, context_used, meta_fields
Meta fields appended to every row: parent_id, from_agent, to_agent, priority
Context compression: completed_sequences, keep_types, sliding_window
Priority-based retention: MUST_KEEP, KEEP_IF_ROOM, COMPRESSIBLE
Unlimited context via chunk_payload, merge_chunks, build_summary_chain
LLM output auto-repair via parse_llm_output
Exact BPE token counting via count_tokens_exact (cl100k_base)
Multi-agent routing via AgentRouter
Cross-payload thread tracking via ThreadRegistry
Append-only audit trail via ReplayLog
Programmatic system prompt generation via generate_system_prompt
ULMEN bridge: convert_agent_to_ulmen, convert_ulmen_to_agent
Structured validation errors via ValidationError
Context budget enforcement via ContextBudgetExceededError
Streaming decode via decode_agent_stream
Subgraph extraction by thread, step range, type
Memory deduplication via dedup_mem, get_latest_mem
MessagePack compatibility via encode_msgpack, decode_msgpack

Installation

From source (with Rust acceleration)

git clone https://github.com/makroumi/ulmen
cd ulmen
pip install maturin
maturin develop --release

Python only (no Rust required)

pip install -e .

The library detects automatically whether the Rust extension is available and falls back to the pure Python implementation silently.

Quick Start

from ulmen import UlmenDict, UlmenDictRust, encode_ulmen_llm, decode_ulmen_llm

records = [
    {"id": 1, "name": "Alice", "city": "London", "score": 98.5, "active": True},
    {"id": 2, "name": "Bob",   "city": "London", "score": 91.0, "active": False},
    {"id": 3, "name": "Carol", "city": "Paris",  "score": 87.3, "active": True},
]

# Binary (smallest)
ld     = UlmenDict(records)
binary = ld.encode_binary_pooled()
zlib_  = ld.encode_binary_zlib()

# Text (human-readable)
text = ld.encode_text()

# ULMEN (LLM-native)
ulmen = encode_ulmen_llm(records)
back  = decode_ulmen_llm(ulmen)

# Rust acceleration (drop-in, byte-identical)
ld_rust = UlmenDictRust(records)
binary  = ld_rust.encode_binary_pooled()
text    = ld_rust.encode_text()
ulmen   = ld_rust.encode_ulmen_llm()

ULMEN-AGENT

from ulmen import (
    encode_agent_payload,
    decode_agent_payload,
    decode_agent_payload_full,
    validate_agent_payload,
    compress_context,
    chunk_payload,
    merge_chunks,
    build_summary_chain,
    parse_llm_output,
    count_tokens_exact,
    AgentRouter,
    ThreadRegistry,
    ReplayLog,
    generate_system_prompt,
    convert_agent_to_ulmen,
    convert_ulmen_to_agent,
    dedup_mem,
    get_latest_mem,
    estimate_context_usage,
    extract_subgraph,
    extract_subgraph_payload,
    make_validation_error,
    AgentHeader,
    ValidationError,
    ContextBudgetExceededError,
)

records = [
    {
        "type": "msg", "id": "m1", "thread_id": "t1", "step": 1,
        "role": "user", "turn": 1, "content": "Hello", "tokens": 5,
        "flagged": False,
    },
    {
        "type": "tool", "id": "tc1", "thread_id": "t1", "step": 2,
        "name": "search", "args": '{"q":"ulmen"}', "status": "pending",
    },
    {
        "type": "res", "id": "tc1", "thread_id": "t1", "step": 3,
        "name": "search", "data": "ULMEN is fast", "status": "done",
        "latency_ms": 42,
    },
]

# Encode with extended header fields
payload = encode_agent_payload(
    records,
    thread_id="t1",
    context_window=8000,
    payload_id="uuid-abc",
    parent_payload_id="uuid-prev",
    agent_id="agent-alpha",
    session_id="sess-001",
    schema_version="1.0.0",
    auto_context=True,
    auto_payload_id=False,
    enforce_budget=False,
)

# Decode (records only)
decoded = decode_agent_payload(payload)

# Decode (records + parsed header)
records_out, header = decode_agent_payload_full(payload)
print(header.payload_id)
print(header.context_used)

# Validate
ok, err = validate_agent_payload(payload)

# Validate with structured error object
ok, err = validate_agent_payload(payload, structured=True)
if not ok:
    print(err.message, err.row, err.field, err.suggestion)

# Stream decode one record at a time
from ulmen import decode_agent_stream
for rec in decode_agent_stream(iter(payload.splitlines(keepends=True))):
    print(rec["type"])

# Context compression
from ulmen.core._agent import COMPRESS_COMPLETED_SEQUENCES
compressed = compress_context(
    records,
    strategy=COMPRESS_COMPLETED_SEQUENCES,
    preserve_cot=True,
)

# Memory deduplication
clean = dedup_mem(records)
latest = get_latest_mem(records, key="user_pref")

# Context usage estimation
usage = estimate_context_usage(records)
print(usage["tokens"], usage["by_type"])

# Chunking for unlimited context
chunks = chunk_payload(records, token_budget=2000, thread_id="t1", overlap=1)
merged = merge_chunks(chunks)

# Summary chain for unlimited context
chain = build_summary_chain(records, token_budget=2000, thread_id="t1")

# LLM output auto-repair
repaired = parse_llm_output(raw_llm_text)
repaired = parse_llm_output(raw_llm_text, strict=True)

# Exact token counting
n_tokens = count_tokens_exact(payload)

# Subgraph extraction
filtered = extract_subgraph(records, thread_id="t1", step_min=2, types=["tool","res"])
filtered_payload = extract_subgraph_payload(payload, types=["cot"])

# Multi-agent routing
router = AgentRouter()
router.register("planner", "executor", lambda rec: print(rec))
router.dispatch(records)

# Cross-payload thread tracking
registry = ThreadRegistry()
registry.add_payload("pid-1", records)

# Audit trail
log = ReplayLog()
log.append({"event": "encode", "payload_id": "pid-1"})

# System prompt generation
prompt = generate_system_prompt(include_examples=True, include_validation=True)

# ULMEN bridge
ulmen   = convert_agent_to_ulmen(payload)
payload2 = convert_ulmen_to_agent(ulmen, thread_id="t1")

# Validation error payload
err_payload = make_validation_error("bad step", thread_id="t1")

# Context budget enforcement
try:
    encode_agent_payload(records, context_window=10, enforce_budget=True)
except ContextBudgetExceededError as e:
    print(e.overage)

API Reference

UlmenDict

Pure Python record container. Zero runtime dependencies.

ld = UlmenDict(records)

ld.encode_text()               # str   ULMEN text format
ld.encode_binary()             # bytes raw binary
ld.encode_binary_pooled()      # bytes binary with full strategy selection
ld.encode_binary_zlib(level=6) # bytes binary + zlib, level 0-9
ld.encode_ulmen_llm()          # str   ULMEN format

ld.decode_text(text)           # UlmenDict
ld.decode_binary(data)         # UlmenDict
ld.decode_ulmen_llm(text)      # UlmenDict

ld.to_json()                   # str standard JSON (NaN/inf replaced with null)
ld.append(record)              # mutate, rebuilds pool, invalidates cache

len(ld)                        # number of records
ld.pool_size                   # number of interned strings
ld[0]                          # direct index access

UlmenDictRust

Extended pool variant. Strategies always enabled.

ldf = UlmenDictFull(records, pool_size_limit=256)
ldf.encode_binary()
ldf.encode_text()
ldf.encode_ulmen_llm()

UlmenDictRust / UlmenDictFullRust

Rust-accelerated drop-in replacements. Byte-identical output.

from ulmen import UlmenDictRust, UlmenDictFullRust, RUST_AVAILABLE

print(RUST_AVAILABLE)
ld = UlmenDictRust(records, optimizations=False, pool_size_limit=64)
ld.encode_text()
ld.encode_binary_pooled()
ld.encode_binary_zlib(level=6)
ld.encode_ulmen_llm()

Streaming encode

See ulmen.core._streaming for full API.

from ulmen import UlmenStreamEncoder, stream_encode, stream_encode_windowed

# One-shot
for chunk in stream_encode(records, chunk_size=65536):
    socket.sendall(chunk)

# Stateful
enc = UlmenStreamEncoder(pool_size_limit=64, chunk_size=65536)
enc.feed(record)
enc.feed_many(records)
for chunk in enc.flush():
    sink.write(chunk)
print(enc.rust_backed)  # True when Rust extension active

# Unbounded windowed
for chunk in stream_encode_windowed(records, window_size=1000):
    decode_binary_records(chunk)

Model-level encode/decode

from ulmen import (
    encode_ulmen_llm,
    decode_ulmen_llm,
    encode_binary_records,
    decode_binary_records,
    encode_text_records,
    decode_text_records,
    build_pool,
    detect_column_strategy,
)

ULMEN-AGENT core

from ulmen import (
    encode_agent_payload,
    decode_agent_payload,
    decode_agent_payload_full,
    decode_agent_record,
    encode_agent_record,
    decode_agent_stream,
    validate_agent_payload,
    make_validation_error,
    extract_subgraph,
    extract_subgraph_payload,
    AgentHeader,
    ValidationError,
    ContextBudgetExceededError,
)

'encode_agent_payload' parameters:

Parameter	Type	Description
records	list[dict]	Records to encode
thread_id	str or None	Written to header
context_window	int or None	Token budget declared in header
meta_fields	tuple	Extra fields appended to every row
auto_context	bool	Compute context_used automatically
enforce_budget	bool	Raise ContextBudgetExceededError if over budget
payload_id	str or None	Unique ID for this payload
parent_payload_id	str or None	Links to prior payload in chain
agent_id	str or None	ID of the producing agent
session_id	str or None	Session this payload belongs to
schema_version	str or None	Protocol version for negotiation
auto_payload_id	bool	Generate a UUID payload_id automatically

Context compression

from ulmen import compress_context, dedup_mem, get_latest_mem, estimate_context_usage
from ulmen.core._agent import (
    COMPRESS_COMPLETED_SEQUENCES,
    COMPRESS_KEEP_TYPES,
    COMPRESS_SLIDING_WINDOW,
    PRIORITY_MUST_KEEP,
    PRIORITY_KEEP_IF_ROOM,
    PRIORITY_COMPRESSIBLE,
)

compressed = compress_context(
    records,
    strategy=COMPRESS_COMPLETED_SEQUENCES,
    keep_priority=PRIORITY_KEEP_IF_ROOM,
    preserve_cot=True,
)

clean  = dedup_mem(records)
latest = get_latest_mem(records, key="pref")
usage  = estimate_context_usage(records)

Strategies:

completed_sequences: replace completed tool+res pairs with mem summaries
keep_types: keep only specified record types
sliding_window: keep recent records verbatim, summarize older ones

Unlimited context

from ulmen import chunk_payload, merge_chunks, build_summary_chain

chunks = chunk_payload(
    records,
    token_budget=4000,
    thread_id="t1",
    overlap=2,
    parent_payload_id="prev-id",
    session_id="sess-1",
)
merged = merge_chunks(chunks)

chain = build_summary_chain(
    records,
    token_budget=4000,
    thread_id="t1",
    session_id="sess-1",
)

LLM output repair

from ulmen import parse_llm_output

repaired = parse_llm_output(raw_text)
repaired = parse_llm_output(raw_text, thread_id="t1", strict=True)

Uses cl100k_base BPE (GPT-4 / Claude compatible). Falls back to character estimate when tiktoken is unavailable.

Multi-agent routing

from ulmen import AgentRouter, validate_routing_consistency

router = AgentRouter()
router.register("agent_a", "agent_b", handler_fn)
router.dispatch(records)
router.dispatch_one(record)

ok, err = validate_routing_consistency(records)

Cross-payload thread tracking

from ulmen import ThreadRegistry, merge_threads

registry = ThreadRegistry()
registry.add_payload("pid-1", records)
threads  = registry.get_threads()

merged = merge_threads([payload1_records, payload2_records])

Audit trail

from ulmen import ReplayLog

log    = ReplayLog()
log.append({"event": "encode", "ts": 1234})
events = log.all()

System prompt generation

from ulmen import generate_system_prompt

prompt = generate_system_prompt(
    include_examples=True,
    include_validation=True,
)

ULMEN bridge

from ulmen import convert_agent_to_ulmen, convert_ulmen_to_agent

ulmen   = convert_agent_to_ulmen(agent_payload)
payload = convert_ulmen_to_agent(ulmen, thread_id="t1")

MessagePack compatibility

from ulmen.core._msgpack_compat import encode_msgpack, decode_msgpack

packed   = encode_msgpack(records)
unpacked = decode_msgpack(packed)

Wire Format Constants

from ulmen import (
    MAGIC,    # b'LUMB'
    VERSION,  # bytes([3, 3])
    T_STR_TINY, T_STR, T_INT, T_FLOAT, T_BOOL, T_NULL,
    T_LIST, T_MAP, T_POOL_DEF, T_POOL_REF, T_MATRIX,
    T_DELTA_RAW, T_BITS, T_RLE,
    S_RAW, S_DELTA, S_RLE, S_BITS, S_POOL,
    AGENT_MAGIC,   # "ULMEN-AGENT v1"
    AGENT_VERSION, # "1.0.0"
    RECORD_TYPES,  # frozenset of 10 type tags
    FIELD_COUNTS,  # dict[type -> total field count per row including common fields]
    META_FIELDS,   # ("parent_id", "from_agent", "to_agent", "priority")
    COMPRESS_COMPLETED_SEQUENCES,
    COMPRESS_KEEP_TYPES,
    COMPRESS_SLIDING_WINDOW,
    PRIORITY_MUST_KEEP,    # 1
    PRIORITY_KEEP_IF_ROOM, # 2
    PRIORITY_COMPRESSIBLE, # 3
)

Utilities

from ulmen import (
    estimate_tokens,   # rough LLM token count (chars / 4)
    deep_size,         # recursive memory footprint in bytes
    deep_eq,           # structural equality handling NaN and inf
    fnv1a, fnv1a_str,  # FNV-1a 32-bit hash
)

Architecture

ulmen/
├── Cargo.lock
├── Cargo.toml
├── pyproject.toml
├── README.md
├── SPEC.md
├── src/
│   └── lib.rs
├── ulmen/
│   ├── __init__.py
│   ├── core.py
│   └── core/
│       ├── __init__.py
│       ├── _constants.py
│       ├── _primitives.py
│       ├── _strategies.py
│       ├── _text.py
│       ├── _binary.py
│       ├── _ulmen_llm.py
│       ├── _agent.py
│       ├── _api.py
│       ├── _repair.py
│       ├── _replay.py
│       ├── _routing.py
│       ├── _threading.py
│       ├── _tokens.py
│       ├── _msgpack_compat.py
│       └── _streaming.py
├── tests/
│   ├── conftest.py
│   ├── smoke_test_comprehensive.py
│   ├── integration/
│   │   ├── test_edge_cases.py
│   │   ├── test_init_coverage.py
│   │   └── test_rust_layer.py
│   ├── perf/
│   │   ├── test_benchmark.py
│   │   ├── test_size.py
│   │   └── test_speed.py
│   └── unit/
│       ├── test_agent.py
│       ├── test_core_coverage.py
│       ├── test_encoders.py
│       ├── test_ulmendict.py
│       ├── test_ulmen_llm.py
│       ├── test_msgpack_compat.py
│       ├── test_primitives.py
│       ├── test_repair.py
│       ├── test_replay.py
│       ├── test_routing.py
│       ├── test_strategies.py
│       ├── test_streaming.py
│       ├── test_threading.py
│       └── test_tokens.py
└── docs/
    ├── index.md
    ├── getting-started/
    │   ├── installation.md
    │   └── quickstart.md
    ├── guides/
    │   ├── binary-format.md
    │   ├── text-format.md
    │   ├── ulmen.md
    │   └── compression.md
    ├── reference/
    │   ├── api.md
    │   ├── constants.md
    │   ├── primitives.md
    │   └── benchmarks.md
    ├── agent/
    │   ├── overview.md
    │   ├── spec.md
    │   └── system-prompt.md
    └── internals/
        ├── architecture.md
        └── wire-format.md

Design principle: the Python layer is the normative specification. The Rust layer is an optimization producing identical output at higher speed. All encode results are cached after the first call and invalidated on mutation.

Running Tests

pytest tests/ -v
pytest tests/ --cov=ulmen --cov-report=term-missing

1,364 tests across unit, integration, performance, and smoke suites. 100% statement coverage across all modules. All tests pass with and without the Rust extension.

Format Specification

See SPEC.md for the complete wire format specification including all tag values, encoding rules, strategy selection logic, and full ULMEN and ULMEN-AGENT protocol details.

Versioning

1.0.0

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

makroumi

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.0.2

Apr 15, 2026

This version

1.0.1

Apr 15, 2026

1.0.0

Apr 15, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ulmen-1.0.1.tar.gz (81.0 kB view details)

Uploaded Apr 15, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ulmen-1.0.1-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (402.0 kB view details)

Uploaded Apr 15, 2026 CPython 3.10+manylinux: glibc 2.17+ x86-64

File details

Details for the file ulmen-1.0.1.tar.gz.

File metadata

Download URL: ulmen-1.0.1.tar.gz
Upload date: Apr 15, 2026
Size: 81.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for ulmen-1.0.1.tar.gz
Algorithm	Hash digest
SHA256	`0bd03e4eb2e7cf3e51599475cd019b65d8ebb967c33947b67a95c021297354fd`
MD5	`be2e82bfb535c9c6b437c8b12fd83a91`
BLAKE2b-256	`7c3573cc5d6e6ed21efd4ab144be411769d8a686a2017df18458cd8cbd5213fb`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ulmen-1.0.1.tar.gz:

Publisher: ci.yml on makroumi/ulmen

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ulmen-1.0.1.tar.gz
- Subject digest: 0bd03e4eb2e7cf3e51599475cd019b65d8ebb967c33947b67a95c021297354fd
- Sigstore transparency entry: 1302472615
- Sigstore integration time: Apr 15, 2026
Source repository:
- Permalink: makroumi/ulmen@3023381663c53565e09a42c8bbdc8d2d323e9ea7
- Branch / Tag: refs/heads/main
- Owner: https://github.com/makroumi
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: ci.yml@3023381663c53565e09a42c8bbdc8d2d323e9ea7
- Trigger Event: push

File details

Details for the file ulmen-1.0.1-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

Download URL: ulmen-1.0.1-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Upload date: Apr 15, 2026
Size: 402.0 kB
Tags: CPython 3.10+, manylinux: glibc 2.17+ x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for ulmen-1.0.1-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm	Hash digest
SHA256	`e20c21a3fd04fe32696495f5bdb70bbdaf02b7f777e151167203548e048bda99`
MD5	`6718ea10f718119381eb1b03676f4a9d`
BLAKE2b-256	`20c22d08356a8079d2168911e5e59abe5083a604098d3a3dcb8dcc8823d24474`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ulmen-1.0.1-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl:

Publisher: ci.yml on makroumi/ulmen

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ulmen-1.0.1-cp310-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
- Subject digest: e20c21a3fd04fe32696495f5bdb70bbdaf02b7f777e151167203548e048bda99
- Sigstore transparency entry: 1302472699
- Sigstore integration time: Apr 15, 2026
Source repository:
- Permalink: makroumi/ulmen@3023381663c53565e09a42c8bbdc8d2d323e9ea7
- Branch / Tag: refs/heads/main
- Owner: https://github.com/makroumi
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: ci.yml@3023381663c53565e09a42c8bbdc8d2d323e9ea7
- Trigger Event: push

ulmen 1.0.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

ULMEN V1

Table of Contents

Benchmarks

Size

Speed - Encode (median ms, 1,000 records)

Streaming (median 50 runs, 1,000 records)

At a Glance

Surfaces

Binary: LUMB prefix

Text: records[N]: prefix

ULMEN: L| prefix

Streaming: UlmenStreamEncoder / stream_encode

ULMEN-AGENT: ULMEN-AGENT v1 prefix

Installation

From source (with Rust acceleration)

Python only (no Rust required)

Quick Start

ULMEN-AGENT

API Reference

UlmenDict

UlmenDictRust

UlmenDictRust / UlmenDictFullRust

Streaming encode

Model-level encode/decode

ULMEN-AGENT core

Context compression

Unlimited context

LLM output repair

Multi-agent routing

Cross-payload thread tracking

Audit trail

System prompt generation

ULMEN bridge

MessagePack compatibility

Wire Format Constants

Utilities

Architecture

Running Tests

Format Specification

Versioning

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Binary: `LUMB` prefix

Text: `records[N]:` prefix

ULMEN: `L|` prefix

Streaming: `UlmenStreamEncoder` / `stream_encode`

ULMEN-AGENT: `ULMEN-AGENT v1` prefix