json permutation library

These details have not been verified by PyPI

Project links

Project description

J-Perm

A composable JSON transformation DSL with a powerful, extensible architecture.

J-Perm lets you describe data transformations as executable specifications — a list of steps that can be applied to input documents. It supports JSON Pointer addressing with slicing (arrays and strings), template interpolation with ${...} syntax, special constructs ($ref, $eval, $cast, $raw), logical and comparison operators ($and, $or, $not), comparison operators (6 operators plus $in and $exists), mathematical operations (6 operators), comprehensive string manipulation (11 operations), regular expressions (5 operations), user-defined functions ($def, $func, $raise) with loop/function control flow ($break, $continue, $return), error handling (try-except-finally), and a rich set of built-in operations — all with configurable security limits to prevent DoS attacks.

Quick Example

from j_perm import build_default_engine

engine = build_default_engine()

# Source data
source = {
    "users": [
        {"name": "Alice", "age": "17"},
        {"name": "Bob", "age": "22"}
    ]
}

# Transformation spec using foreach and the &: prefix for the loop variable
spec = {
    "op": "foreach",
    "in": "/users",
    "as": "item",
    "do": {
        "op": "if",
        "cond": "${?args.item.age >= `18`}",
        "then": {"/adults[]": "&:/item"},
    },
}

result = engine.apply(spec, source=source, dest={})
# → {"adults": [{"name": "Bob", "age": "22"}]}

Installation

pip install j-perm

(or copy the package into your project)

Architecture Overview

J-Perm is built on a pipeline architecture with two main levels:

┌─────────────────────────────────────────────────────────┐
│  spec (user input)                                      │
│    │                                                    │
│    ▼                                                    │
│  ┌──────────────────────────────────────────────────┐   │
│  │ STAGES (batch preprocessing, priority order)     │   │
│  │  • ShorthandExpansion → expand ~delete, etc      │   │
│  │  • YourCustomStage                               │   │
│  └──────────────────────────────────────────────────┘   │
│    │                                                    │
│    ▼                                                    │
│  List[step]                                             │
│    │                                                    │
│    ▼  for each step:                                    │
│  ┌──────────────────────────────────────────────────┐   │
│  │ MIDDLEWARES (per-step, priority order)           │   │
│  │  • Validation, logging, etc.                     │   │
│  └──────────────────────────────────────────────────┘   │
│    │                                                    │
│    ▼                                                    │
│  ┌──────────────────────────────────────────────────┐   │
│  │ REGISTRY (hierarchical dispatch tree)            │   │
│  │  • SetHandler, CopyHandler, ForeachHandler, ...  │   │
│  └──────────────────────────────────────────────────┘   │
│    │                                                    │
│    │  handlers call ctx.engine.process_value(...)       │
│    └─────────────────────────────────────┐              │
│                                          ▼              │
│  ┌──────────────────────────────────────────────────┐   │
│  │ VALUE PIPELINE (stabilization loop)              │   │
│  │  • SpecialResolveHandler ($ref, $eval)           │   │
│  │  • TemplSubstHandler (${...})                    │   │
│  │  • RecursiveDescentHandler (containers)          │   │
│  │  • IdentityHandler (scalars)                     │   │
│  └──────────────────────────────────────────────────┘   │
└─────────────────────────────────────────────────────────┘

Core Components

Component	Purpose
Engine	Orchestrates pipelines, manages context, runs stabilization loop
Pipeline	Runs stages → middlewares → registry dispatch for each step
StageRegistry	Tree of batch preprocessors (run-all, priority order)
ActionTypeRegistry	Tree of action handlers (first-match or run-all)
ValueResolver	Abstraction for addressing (JSON Pointer implementation)

Core API

Building an Engine

from j_perm import build_default_engine

# Default engine with all built-ins and default security limits
engine = build_default_engine()

# Custom specials (None = use defaults: $ref, $eval, $cast, $and, $or, $not, comparison, math, string, regex)
engine = build_default_engine(
    specials={"$ref": my_ref_handler, "$custom": my_handler},
    casters={"int": lambda x: int(x), "json": lambda x: json.loads(x)},  # Used in ${type:...} AND $cast
    jmes_options=jmespath.Options(custom_functions=CustomFunctions())
)

# Custom security limits (see Security and Limits section)
engine = build_default_engine(
    max_operations=10_000,
    max_function_recursion_depth=50,
    max_loop_iterations=1_000,
    regex_timeout=1.0,
    pow_max_exponent=100,
    # ... see factory.py for all available limits
)

# Logging / debugging (see Logging and Debugging section)
engine = build_default_engine(
    trace_logging=True,      # DEBUG-log every executed step
    trace_repr_max=None,     # show steps without truncation (default: 200)
)

Applying Transformations

result = engine.apply(
    spec,  # DSL script (dict or list)
    source=source,  # Source context (for pointers, templates)
    dest=dest,  # Initial destination (default: {})
)

Returns: Deep copy of the final dest after all transformations.

Security and Limits

J-Perm includes comprehensive protection against DoS attacks through configurable limits. All limits can be customized via build_default_engine() parameters.

Global Limits

Parameter	Default	Description
`max_operations`	1,000,000	Maximum total operations across entire transformation
`max_function_recursion_depth`	100	Maximum depth for recursive function calls

Example: Preventing infinite recursion

engine = build_default_engine(max_function_recursion_depth=50)

# This will raise RuntimeError if recursion exceeds 50 levels
spec = [
    {"$def": "factorial", "params": ["n"], "body": [
        {"op": "if", "cond": {"$eq": [{"$ref": "&:/n"}, 0]},
         "then": [{"/result": 1}],
         "else": [{"/result": {"$mul": [
             {"$ref": "&:/n"},
             {"$func": "factorial", "args": [{"$sub": [{"$ref": "&:/n"}, 1]}]}
         ]}}]}
    ], "return": "/result"},
    {"/output": {"$func": "factorial", "args": [100]}}  # Too deep!
]

Loop and Iteration Limits

Parameter	Default	Description
`max_loop_iterations`	10,000	Maximum iterations for `while` loops
`max_foreach_items`	100,000	Maximum items to process in `foreach`

Example: Preventing infinite loops

engine = build_default_engine(max_loop_iterations=1000)

# This will raise RuntimeError if loop exceeds 1000 iterations
spec = {
    "op": "while",
    "cond": {"$lt": [{"$ref": "@:/counter"}, 999999]},  # Never stops!
    "do": [{"/counter": {"$add": [{"$ref": "@:/counter"}, 1]}}]
}

Mathematical Operation Limits

Parameter	Default	Description
`pow_max_base`	1,000,000	Maximum base value for `$pow`
`pow_max_exponent`	1,000	Maximum exponent value for `$pow`
`mul_max_operand`	1,000,000,000	Maximum numeric operand in `$mul`
`mul_max_string_result`	1,000,000	Maximum string length from `$mul` (e.g., `"x" * n`)
`add_max_number_result`	1e15	Maximum numeric result from `$add`
`add_max_string_result`	100,000,000	Maximum string length from `$add` (concatenation)
`sub_max_number_result`	1e15	Maximum numeric result from `$sub`

Example: Preventing CPU exhaustion

engine = build_default_engine(
    pow_max_base=1000,
    pow_max_exponent=10
)

# This will raise ValueError: exponent exceeds limit
spec = {"/result": {"$pow": [2, 1000]}}  # 2^1000 would consume massive CPU

# This will raise ValueError: base exceeds limit
spec = {"/result": {"$pow": [999999, 2]}}

String Operation Limits

Parameter	Default	Description
`str_max_split_results`	100,000	Maximum results from `$str_split`
`str_max_join_result`	10,000,000	Maximum length of `$str_join` result
`str_max_replace_result`	10,000,000	Maximum length of `$str_replace` result

Example: Preventing memory exhaustion

engine = build_default_engine(str_max_split_results=1000)

# This will raise ValueError if split produces more than 1000 results
spec = {"/words": {"$str_split": {"string": "${/large_text}", "delimiter": " "}}}

Regex Protection (ReDoS Prevention)

Parameter	Default	Description
`regex_timeout`	2.0	Timeout in seconds for regex operations
`regex_allowed_flags`	None	Bitmask of allowed regex flags (None = default safe flags: IGNORECASE, MULTILINE, DOTALL, VERBOSE, ASCII; -1 = all flags allowed)

Example: Preventing ReDoS attacks

engine = build_default_engine(regex_timeout=1.0)

# This will raise TimeoutError if regex takes more than 1 second
spec = {
    "/result": {
        "$regex_match": {
            "pattern": "(a+)+b",  # Catastrophic backtracking pattern
            "string": "aaaaaaaaaaaaaaaaaaaaaaaac"  # No match, tries all combinations
        }
    }
}

Restricting regex flags:

import re

# Only allow case-insensitive and multiline flags
engine = build_default_engine(
    regex_allowed_flags=re.IGNORECASE | re.MULTILINE
)

# This will raise ValueError: prohibited regex flags
spec = {
    "/result": {
        "$regex_match": {
            "pattern": "test",
            "string": "TEST",
            "flags": re.DOTALL  # Not allowed!
        }
    }
}

# Allow all flags (not recommended for untrusted input)
engine = build_default_engine(regex_allowed_flags=-1)

Customizing Limits

All limits can be configured when building the engine:

from j_perm import build_default_engine

# Conservative limits for untrusted input
secure_engine = build_default_engine(
    max_operations=10_000,
    max_function_recursion_depth=10,
    max_loop_iterations=100,
    max_foreach_items=1_000,
    regex_timeout=0.5,
    pow_max_exponent=100,
    str_max_join_result=100_000,
)

# Relaxed limits for trusted environments
permissive_engine = build_default_engine(
    max_operations=10_000_000,
    max_function_recursion_depth=1000,
    max_loop_iterations=1_000_000,
    regex_timeout=10.0,
)

Best practices:

Use conservative limits when processing untrusted user input
Use permissive limits for internal data transformations
Monitor max_operations counter to detect suspicious activity
Test your transformations with realistic data sizes
Tune limits based on your specific use case

Logging and Debugging

J-Perm uses Python's standard logging module under the logger name j_perm.

Error Logging (Language Call Stack)

When an unhandled exception escapes Engine.apply(), j-perm automatically logs the language-level call stack at ERROR level — showing exactly which operations were executing when the error occurred, without Python internals.

import logging
logging.basicConfig(level=logging.ERROR)

engine = build_default_engine()

engine.apply(
    spec=[
        {"op": "foreach", "in": "/users", "as": "user", "do": [
            {"op": "if", "cond": True, "then": [
                {"op": "set", "path": "/result/-", "value": {"$ref": "/missing/path"}}
            ]}
        ]}
    ],
    source={"users": ["Alice"]},
    dest={},
)

Output:

ERROR j_perm: j-perm execution failed: KeyError: 'missing'
Language call stack (innermost last):
  #1   {'op': 'foreach', 'in': '/users', 'as': 'user', 'do': [1 items]}
  #2   {'op': 'if', 'cond': True, 'then': [1 items]}
  #3   {'op': 'set', 'path': '/result/-', 'value': {'$ref': '/missing/path'}}

Important: Errors caught by {"op": "try", ...} inside the spec are not logged — only errors that propagate all the way out of apply() appear in the log. Control flow signals ($break, $continue, $return) are never treated as errors.

The call stack is also attached to the exception itself for programmatic access:

try:
    engine.apply(spec, source=src, dest={})
except Exception as e:
    stack = getattr(e, "_j_perm_lang_stack", None)
    if stack:
        for i, frame in enumerate(stack, 1):
            print(f"  #{i} {frame}")

Trace Logging (Full Execution Log)

To log every step as it executes — even without errors — enable trace_logging:

import logging
logging.basicConfig(level=logging.DEBUG)

engine = build_default_engine(trace_logging=True)

engine.apply(
    spec=[
        {"op": "set", "path": "/name", "value": "Alice"},
        {"op": "foreach", "in": "/tags", "as": "tag", "do": [
            {"op": "set", "path": "/out/-", "value": {"$ref": "&:/tag"}}
        ]},
    ],
    source={"tags": ["x", "y"]},
    dest={},
)

Output (each step indented by nesting depth):

DEBUG j_perm: → {'op': 'set', 'path': '/name', 'value': 'Alice'}
DEBUG j_perm: → {'op': 'foreach', 'in': '/tags', 'as': 'tag', 'do': [1 items]}
DEBUG j_perm:   → {'op': 'set', 'path': '/out/-', 'value': {'$ref': '&:/tag'}}
DEBUG j_perm:   → {'op': 'set', 'path': '/out/-', 'value': {'$ref': '&:/tag'}}

Controlling Step Representation Length

By default, each step is truncated to 200 characters in the call stack and trace output. Use trace_repr_max to change this:

# Increase the limit
engine = build_default_engine(trace_logging=True, trace_repr_max=500)

# Disable truncation — show every step in full
engine = build_default_engine(trace_logging=True, trace_repr_max=None)

Parameter	Default	Description
`trace_logging`	`False`	Emit `DEBUG` log for every executed step
`trace_repr_max`	`200`	Max characters per step representation. `None` = no limit

Value Resolution Tracing

To see how each value is resolved through the value pipeline (template substitution, $ref, $cast, etc.), enable the j_perm.values sub-logger at DEBUG level:

import logging
logging.basicConfig(level=logging.DEBUG)

engine = build_default_engine(trace_logging=True)  # also enable step trace for full picture

engine.apply(
    spec=[{"op": "set", "path": "/greeting", "value": "Hello, ${/name}!"}],
    source={"name": "Alice"},
    dest={},
)

Output — j_perm shows the step, j_perm.values shows each transformation:

DEBUG j_perm:        → {'op': 'set', 'path': '/greeting', 'value': 'Hello, ${/name}!'}
DEBUG j_perm.values:   'Hello, ${/name}!' → 'Hello, Alice!'

Value tracing is independent of trace_logging — you can enable it alone:

import logging

# Enable only value resolution tracing, suppress step-level trace
logging.getLogger("j_perm.values").setLevel(logging.DEBUG)
logging.getLogger("j_perm").setLevel(logging.ERROR)  # suppress step trace

Each line shows one stabilization pass: input → output. Multi-step resolution (e.g., $ref returning a template that itself gets substituted) appears as multiple lines, indented to the current call depth.

Named Pipeline Tracing

Each named pipeline gets its own logger: j_perm.pipeline.<name>. This lets you turn on tracing for all pipelines at once or zoom in on a specific one.

import logging

# All named pipelines
logging.getLogger("j_perm.pipeline").setLevel(logging.DEBUG)

# Only the "normalize" pipeline
logging.getLogger("j_perm.pipeline.normalize").setLevel(logging.DEBUG)

# Silence a specific pipeline while keeping others
logging.getLogger("j_perm.pipeline.verbose_one").setLevel(logging.WARNING)

To produce step-level output inside a named pipeline, create it with track_execution=True:

from j_perm import Pipeline, ActionTypeRegistry, ActionNode

my_reg = ActionTypeRegistry()
# ... register handlers ...
my_pipeline = Pipeline(registry=my_reg, track_execution=True)
engine.register_pipeline("normalize", my_pipeline)

When run_pipeline("normalize", ...) is called, the pipeline's logger emits a → [pipeline:normalize] entry, and if track_execution=True, each step follows indented relative to the caller's depth:

DEBUG j_perm: → {'op': 'foreach', 'in': '/items', 'as': 'item', 'do': [1 items]}
DEBUG j_perm.pipeline.normalize:   → [pipeline:normalize]
DEBUG j_perm.pipeline.normalize:   → {'op': 'set', 'path': '/value', 'value': ...}

On error, the call stack (including both the outer context and the pipeline's own steps) is logged at ERROR level to j_perm.pipeline.<name>.

Logger Hierarchy

J-Perm uses four loggers, all configurable independently via Python's standard logging module:

Logger	Level	When active
`j_perm`	`ERROR`	Unhandled error — logs language call stack
`j_perm`	`DEBUG`	Step trace (requires `trace_logging=True` on engine)
`j_perm.values`	`DEBUG`	Value resolution steps in `process_value`
`j_perm.pipeline.<name>`	`ERROR`	Named pipeline error — logs call stack
`j_perm.pipeline.<name>`	`DEBUG`	Named pipeline step trace (requires `track_execution=True` on pipeline)

All j_perm.pipeline.* loggers are children of j_perm.pipeline, which is itself a child of j_perm — so the standard Python logger hierarchy applies:

import logging

# Everything from j-perm (step trace + value trace + all pipeline traces)
logging.getLogger("j_perm").setLevel(logging.DEBUG)

# Only errors, no trace noise
logging.getLogger("j_perm").setLevel(logging.ERROR)

# Suppress all j-perm logging
logging.getLogger("j_perm").setLevel(logging.CRITICAL)

# Step trace only, no value noise
logging.getLogger("j_perm").setLevel(logging.DEBUG)
logging.getLogger("j_perm.values").setLevel(logging.WARNING)

# All named pipeline traces, but not main-pipeline step trace
logging.getLogger("j_perm").setLevel(logging.WARNING)
logging.getLogger("j_perm.pipeline").setLevel(logging.DEBUG)

# Only one specific pipeline
logging.getLogger("j_perm.pipeline").setLevel(logging.WARNING)
logging.getLogger("j_perm.pipeline.normalize").setLevel(logging.DEBUG)

Features

1. JSON Pointer Addressing

J-Perm uses RFC 6901 JSON Pointer with extensions:

from j_perm import PointerResolver

resolver = PointerResolver()

# Basic pointers
resolver.get("/users/0/name", data)  # → "Alice"

# Root references (work on scalars too!)
resolver.get(".", 42)  # → 42
resolver.get("/", "text")  # → "text"

# Parent navigation
resolver.get("/a/b/../c", data)  # → data["a"]["c"]

# Slices (work on lists and strings)
resolver.get("/items[1:3]", data)  # → [item1, item2] for lists
resolver.get("/text[0:5]", {"text": "hello world"})  # → "hello" for strings
resolver.get("/text[-5:]", {"text": "hello world"})  # → "world" (negative indices)

# Append notation
resolver.set("/items/-", data, "new")  # Append to list

Key feature: Unlike standard JSON Pointer, PointerResolver works on any type (scalars, lists, dicts) for root references.

Data Source Prefixes

J-Perm supports prefixes to specify which context to read from:

Prefix	Source	Description
`/path` or `_:/path`	source	Read from the immutable source document
`@:/path`	dest	Read from the destination being built
`&:/path`	args	Read from `temp_read_only` — function arguments, loop variables, error info
`!:/path`	temp	Read from `temp` — mutable scratch space, not in final output

The &: prefix is the standard way to access:

Function parameters inside $def bodies
Loop variables inside foreach do blocks
Error info (_error_message, _error_type) inside try except blocks

Example: Accessing dest in templates

# Build incrementally, referencing previous values
spec = [
    {"/name": "Alice"},
    {"/greeting": "Hello, ${@:/name}!"}  # Reference dest value
]

result = engine.apply(spec, source={}, dest={})
# → {"name": "Alice", "greeting": "Hello, Alice!"}

Example: Function parameters via &:

spec = [
    {
        "$def": "greet",
        "params": ["name"],
        "body": [{"/msg": "Hello, ${&:/name}!"}],
        "return": "/msg",
    },
    {"/result": {"$func": "greet", "args": ["World"]}},
]

result = engine.apply(spec, source={}, dest={})
# → {"result": "Hello, World!"}

Example: Loop variable via &:

spec = {
    "op": "foreach",
    "in": "/items",
    "as": "item",
    "do": {"/out[]": "&:/item"},
}

result = engine.apply(spec, source={"items": [1, 2, 3]}, dest={})
# → {"out": [1, 2, 3]}

2. Template Interpolation (`${...}`)

Templates are resolved by TemplSubstHandler in the value pipeline.

JSON Pointer lookup

"${/user/name}"     # → Resolve pointer from source
"${@:/total}"       # → Read from dest
"${&:/param_name}"  # → Read function argument / loop variable
"${!:/scratch}"     # → Read from temp scratch space
"${_:/user/name}"   # → Same as ${/user/name} (source alias)

Type casters (built-in)

"${int:/age}"  # → int(value)
"${float:/price}"
"${bool:/flag}"  # → bool(int(value)) if int/str, else bool(value)
"${str:/id}"

Note: Type casters can also be used via the $cast construct (see Special Constructs section).

JMESPath queries

"${?source.items[?price > `10`].name}"  # → Query source with JMESPath
"${?dest.total}"                         # → Query destination
"${?add(dest.x, source.y)}"              # → Mix source and dest
"${?args.item.age >= `18`}"              # → Query function arg / loop variable
"${?temp.scratch}"                       # → Query temp scratch space

Built-in JMESPath functions: add(a, b), subtract(a, b)

JMESPath data namespaces:

Namespace	Context field	Description
`source.*`	`ctx.source`	Source document
`dest.*`	`ctx.dest`	Destination being built
`args.*`	`ctx.temp_read_only`	Function args, loop vars, error info
`temp.*`	`ctx.temp`	Mutable scratch space

Nested templates

"${${/path_to_field}}"  # → Resolve inner template first

Escaping

$${ → ${ (literal)
$$  → $  (literal)

3. Special Constructs

Special values are resolved by SpecialResolveHandler.

`$ref` — Reference resolution

{
    "$ref": "/path/to/value",
    "$default": "fallback"
}

Resolves pointer from source context (supports all prefixes: @:, &:, !:, _:)
Returns deep copy (no aliasing)
Supports $default fallback

`$eval` — Nested evaluation

{
    "$eval": [
        {
            "op": "set",
            "path": "/x",
            "value": 1
        }
    ],
    "$select": "/x"
}

Executes nested DSL with dest={}
Optionally selects sub-path from result

`$cast` — Type casting

{
    "$cast": {
        "value": "42",
        "type": "int"
    }
}

Applies a registered type caster to a value
value — the value to cast (supports templates, $ref, etc.)
type — name of the registered caster (built-in: int, float, bool, str)
Alternative to template syntax ${type:...}

Examples:

# Cast string to int
{"/age": {"$cast": {"value": "25", "type": "int"}}}

# Cast with template substitution
{"/count": {"$cast": {"value": "${/raw_count}", "type": "int"}}}

# Cast with $ref
{"/price": {"$cast": {"value": {"$ref": "/data/price"}, "type": "float"}}}

# Dynamic type selection
{"/result": {"$cast": {"value": "123", "type": "${/target_type}"}}}

Custom casters:

# Define custom caster
def custom_upper(x):
    return str(x).upper()

engine = build_default_engine(casters={"upper": custom_upper})

# Use in spec
{"/name": {"$cast": {"value": "alice", "type": "upper"}}}  # → "ALICE"

`$raw` — Return a literal without processing

$raw has two forms:

Wrapper construct — returns the value as-is, preventing all value-pipeline evaluation:

{"$raw": {"$ref": "/not/evaluated"}}
{"$raw": "hello ${not_substituted}"}
{"$raw": [{"$ref": "/a"}, {"$ref": "/b"}]}

The wrapped value is never passed through template substitution, $ref resolution, or any other pipeline stage. Use this to store construct-shaped data as a literal.

Flag on any construct — add "$raw": true to stop the stabilisation loop after the construct resolves:

{"$ref": "/path", "$raw": true}
{"$func": "myFunc", "$raw": true}
{"$add": [1, 2], "$raw": true}

Without the flag, process_value keeps iterating until the result stabilises — so if $ref returns a value that itself contains a $ref, that too will be resolved. With "$raw": true, the loop stops after the first resolution and returns the result as-is.

Example — preventing chain resolution:

# source["/a"] contains another construct
source = {"a": {"$ref": "/b"}, "b": "final"}

# Without $raw: True — both hops resolved
spec = {"/result": {"$ref": "/a"}}
# → {"result": "final"}

# With $raw: True — only first hop resolved
spec = {"/result": {"$ref": "/a", "$raw": True}}
# → {"result": {"$ref": "/b"}}

Example — storing a construct as a literal:

spec = [
    # Store a construct literally (not evaluated)
    {"/template": {"$raw": {"$ref": "/data"}}},
    # Later retrieve it — still unevaluated
    {"/copy": {"$ref": "@:/template", "$raw": True}},
]
result = engine.apply(spec, source={"data": "value"}, dest={})
# → {"template": {"$ref": "/data"}, "copy": {"$ref": "/data"}}

`$and` — Logical AND with short-circuit

{
    "$and": [
        {"$ref": "/x"},
        {"$gt": [{"$ref": "/y"}, 10]},
        {"$eq": [{"$ref": "/status"}, "active"]}
    ]
}

Processes values in order through value pipeline
Returns last result if all are truthy
Short-circuits and returns first falsy result

Example:

# Check multiple conditions
spec = {
    "/is_valid": {
        "$and": [
            {"$ref": "/user/name"},           # truthy if name exists
            {"$gte": [{"$ref": "/user/age"}, 18]},  # age >= 18
            {"$in": ["admin", {"$ref": "/user/roles"}]}  # has admin role
        ]
    }
}

`$or` — Logical OR with short-circuit

{
    "$or": [
        {"$ref": "/x"},
        {"$ref": "/y"},
        {"$ref": "/z"}
    ]
}

Processes values in order through value pipeline
Returns first truthy result
Returns last result if all are falsy

Example:

# Provide fallback values
spec = {
    "/display_name": {
        "$or": [
            {"$ref": "/user/preferred_name"},
            {"$ref": "/user/full_name"},
            {"$ref": "/user/email"},
            "Unknown User"
        ]
    }
}

`$not` — Logical negation

{
    "$not": {"$ref": "/disabled"}
}

Processes value through value pipeline
Returns logical negation of the result

Example:

# Negate condition
spec = {
    "/is_enabled": {
        "$not": {"$ref": "/settings/disabled"}
    }
}

Comparison Operators

J-Perm provides comparison operators that work with any values:

$gt — Greater than

{"$gt": [10, 5]}  → true
{"$gt": ["${/age}", 18]}  → true if age > 18

$gte — Greater than or equal

{"$gte": [10, 10]}  → true
{"$gte": [{"$ref": "/count"}, 100]}  → true if count >= 100

$lt — Less than

{"$lt": [5, 10]}  → true
{"$lt": ["${/price}", 50]}  → true if price < 50

$lte — Less than or equal

{"$lte": [10, 10]}  → true
{"$lte": [{"$ref": "/temperature"}, 30]}  → true if temperature <= 30

$eq — Equal

{"$eq": [10, 10]}  → true
{"$eq": ["${/status}", "active"]}  → true if status == "active"

$ne — Not equal

{"$ne": [10, 5]}  → true
{"$ne": ["${/role}", "admin"]}  → true if role != "admin"

Usage in conditions:

spec = [
    {"/age": 25},
    {
        "op": "if",
        "cond": {"$gte": [{"$ref": "@:/age"}, 18]},
        "then": [{"/is_adult": True}],
        "else": [{"/is_adult": False}],
    },
]

result = engine.apply(spec, source={}, dest={})
# → {"age": 25, "is_adult": True}

Features:

All operators accept exactly 2 values in a list
Values are processed through process_value (support templates, $ref, $cast, etc.)
Can be nested and combined with logical operators

Membership and Existence Operators

$in — Python-style membership test

Works with strings (substring), lists (element), and dicts (key):

{"$in": ["world", "hello world"]}  → true (substring)
{"$in": [2, [1, 2, 3]]}             → true (element in list)
{"$in": ["key", {"key": "val"}]}    → true (key in dict)

$exists — Check if a path resolves

Returns true if the pointer can be resolved without error, false otherwise. Supports all context prefixes (@:, &:, !:, _:, or plain /).

{"$exists": "/user/name"}    → true if source has user.name
{"$exists": "@:/result"}     → true if dest has /result
{"$exists": "&:/param"}      → true if arg named 'param' was passed to the function

Example — conditional processing:

spec = {
    "op": "if",
    "cond": {"$exists": "/optional_field"},
    "then": [{"/result": "${/optional_field}"}],
    "else": [{"/result": "default"}],
}

Example — template path:

{"/ok": {"$exists": "/user/${/field_name}"}}

Mathematical Operators

J-Perm provides mathematical operators with support for 1+ operands:

$add — Addition

{"$add": [10]}           → 10
{"$add": [10, 5]}        → 15
{"$add": [1, 2, 3, 4]}   → 10  (1 + 2 + 3 + 4)

$sub — Subtraction

{"$sub": [10]}           → 10
{"$sub": [10, 5]}        → 5
{"$sub": [100, 20, 10]}  → 70  ((100 - 20) - 10)

$mul — Multiplication

{"$mul": [5]}            → 5
{"$mul": [10, 5]}        → 50
{"$mul": [2, 3, 4]}      → 24  ((2 * 3) * 4)

$div — Division

{"$div": [10]}           → 10
{"$div": [10, 5]}        → 2.0
{"$div": [100, 2, 5]}    → 10.0  ((100 / 2) / 5)

$pow — Exponentiation

{"$pow": [2]}            → 2
{"$pow": [2, 3]}         → 8
{"$pow": [2, 3, 2]}      → 64  ((2 ** 3) ** 2)

$mod — Modulo

{"$mod": [10]}           → 10
{"$mod": [10, 3]}        → 1
{"$mod": [100, 7, 3]}    → 2  ((100 % 7) % 3)

Nested expressions:

# Calculate: (price * quantity) + shipping
spec = {
    "/total": {
        "$add": [
            {"$mul": [{"$ref": "/price"}, {"$ref": "/quantity"}]},
            {"$ref": "/shipping"}
        ]
    }
}

# Complex: ((10 + 5) * 2) - 3 = 27
spec = {
    "/result": {
        "$sub": [
            {"$mul": [{"$add": [10, 5]}, 2]},
            3
        ]
    }
}

Features:

Accept 1+ operands (1 operand: returns the value itself)
2+ operands: apply operation left-to-right
Values are processed through process_value (support templates, $ref, $cast, etc.)
Can be nested to create complex expressions
Work seamlessly with comparison operators in conditions

4. String Operations

J-Perm provides comprehensive string manipulation constructs:

Split and Join

# Split string by delimiter
{"$str_split": {"string": "a,b,c", "delimiter": ","}}  → ["a", "b", "c"]
{"$str_split": {"string": "a:b:c", "delimiter": ":", "maxsplit": 1}}  → ["a", "b:c"]

# Join array into string
{"$str_join": {"array": ["a", "b", "c"], "separator": "-"}}  → "a-b-c"
{"$str_join": {"array": [1, 2, 3], "separator": ","}}  → "1,2,3"

Slicing

# Extract substring
{"$str_slice": {"string": "hello", "start": 1, "end": 4}}  → "ell"
{"$str_slice": {"string": "hello", "start": 2}}  → "llo"
{"$str_slice": {"string": "hello", "end": 3}}  → "hel"
{"$str_slice": {"string": "hello", "start": -3}}  → "llo"

Note: String slicing is also supported in JSON Pointer syntax:

{"$ref": "/text[0:5]"}    # first 5 characters
{"$ref": "/text[6:]"}     # from 6th character to end
{"$ref": "/text[-5:]"}    # last 5 characters

Case Conversion

{"$str_upper": "hello"}  → "HELLO"
{"$str_lower": "HELLO"}  → "hello"

Trimming

# Strip whitespace (default)
{"$str_strip": "  hello  "}  → "hello"
{"$str_lstrip": "  hello  "}  → "hello  "
{"$str_rstrip": "  hello  "}  → "  hello"

# Strip specific characters
{"$str_strip": {"string": "***hello***", "chars": "*"}}  → "hello"
{"$str_lstrip": {"string": "___hello", "chars": "_"}}  → "hello"
{"$str_rstrip": {"string": "hello___", "chars": "_"}}  → "hello"

Replace

{"$str_replace": {"string": "hello", "old": "ll", "new": "rr"}}  → "herro"
{"$str_replace": {"string": "aaa", "old": "a", "new": "b", "count": 2}}  → "bba"

String Checks

{"$str_contains": {"string": "hello world", "substring": "world"}}  → true
{"$str_startswith": {"string": "hello", "prefix": "he"}}  → true
{"$str_endswith": {"string": "hello", "suffix": "lo"}}  → true

5. Regular Expressions

J-Perm supports powerful regex operations using Python's re module:

Match and Search

# Check if entire string matches pattern
{"$regex_match": {"pattern": "^\\d+$", "string": "123"}}  → true
{"$regex_match": {"pattern": "^\\d+$", "string": "abc"}}  → false

# Find first occurrence
{"$regex_search": {"pattern": "\\d+", "string": "abc123def"}}  → "123"
{"$regex_search": {"pattern": "\\d+", "string": "abc"}}  → null

Find All Matches

{"$regex_findall": {"pattern": "\\d+", "string": "a1b2c3"}}  → ["1", "2", "3"]
{"$regex_findall": {"pattern": "\\d+", "string": "abc"}}  → []

Replace with Regex

# Simple replacement
{"$regex_replace": {"pattern": "\\d+", "replacement": "X", "string": "a1b2c3"}}  → "aXbXcX"

# With backreferences
{"$regex_replace": {
    "pattern": "(\\w+)@(\\w+)",
    "replacement": "\\1 AT \\2",
    "string": "user@domain"
}}  → "user AT domain"

# Limited replacements
{"$regex_replace": {"pattern": "\\d+", "replacement": "X", "string": "a1b2c3", "count": 2}}  → "aXbXc3"

Extract Capture Groups

{"$regex_groups": {"pattern": "(\\w+)@(\\w+)", "string": "user@domain"}}  → ["user", "domain"]
{"$regex_groups": {"pattern": "(\\d+)-(\\d+)", "string": "123-456"}}  → ["123", "456"]

Optional flags parameter: All regex constructs accept optional flags parameter (e.g., re.IGNORECASE = 2):

{"$regex_match": {"pattern": "^hello$", "string": "HELLO", "flags": 2}}  → true

6. Functions and Error Handling

J-Perm supports defining reusable functions and controlled error handling.

`$def` — Define a function

{
    "$def": "myFunction",
    "params": ["arg1", "arg2"],
    "body": [
        {"/result": "${&:/arg1}"},
        {"/total": "${int:${&:/arg2}}"}
    ],
    "return": "/total",
    "context": "copy",
    "on_failure": [
        {"/error": "Function failed"}
    ]
}

params — list of parameter names (optional, default: [])
body — actions to execute when function is called
return — path in local context to return (optional, default: entire dest); superseded by $return if used inside the body
context — how the function's dest is initialized (see below)
on_failure — error handler actions (optional)

Accessing parameters:

Inside the function body, parameters are available via the &: prefix:

spec = [
    {
        "$def": "greet",
        "params": ["name"],
        "body": [{"/msg": "Hello, ${&:/name}!"}],
        "return": "/msg",
    },
    {"/result": {"$func": "greet", "args": ["World"]}},
]
# → {"result": "Hello, World!"}

Accessing original source:

The original source document is always accessible via the plain / pointer (or _: alias):

spec = [
    {
        "$def": "getConfig",
        "body": [{"/cfg": {"$ref": "/config/key"}}],
        "return": "/cfg",
    },
    {"/result": {"$func": "getConfig"}},
]

result = engine.apply(spec, source={"config": {"key": "production"}}, dest={})
# → {"result": "production"}

context parameter — dest initialization mode:

Value	Behavior
`"copy"` (default)	Function body operates on a deep copy of the caller's `dest`. Mutations stay local.
`"new"`	Function body starts with an empty `dest = {}`. Cannot see the caller's dest.
`"shared"`	Function body operates on the same dest object as the caller. Mutations are visible to the caller.

# context: "copy" (default) — isolated
spec = [
    {"$def": "f", "body": [{"/internal": 99}]},
    {"/result": {"$func": "f"}},
]
# "internal" does NOT appear at the top level of the outer dest

# context: "new" — fresh slate
spec = [
    {"/outer": "hello"},
    {
        "$def": "f",
        "context": "new",
        "body": [{"/saw_outer": {"$exists": "@:/outer"}}],
        "return": "/saw_outer",
    },
    {"/result": {"$func": "f"}},
]
# → {"outer": "hello", "result": false}  (function can't see /outer)

# context: "shared" — direct mutation
spec = [
    {"$def": "f", "context": "shared", "body": [{"/shared_key": True}]},
    {"$func": "f"},
]
# → {"shared_key": true}  (mutation visible in outer dest)

`$func` — Call a function

{
    "$func": "myFunction",
    "args": [10, 20]
}

args — list of arguments to pass (optional, default: [])

Functions are stored in the execution context metadata and can be called multiple times within the same transformation.

`$raise` — Raise an error

{
    "$raise": "Invalid data: ${/error_details}"
}

Raises a JPermError with the specified message. The error can be:

Caught by on_failure handlers in function definitions
Used for validation and control flow
Combined with templates for dynamic error messages

Example with error handling:

spec = [
    {
        "$def": "validateAge",
        "params": ["age"],
        "body": [
            {
                "op": "if",
                "cond": {"$lt": [{"$ref": "&:/age"}, 0]},
                "then": [{"$raise": "Age cannot be negative"}]
            },
            {"/valid": True}
        ],
        "return": "/valid",
        "on_failure": [{"/validation_failed": True}]
    },
    {"/result": {"$func": "validateAge", "args": [25]}}
]

7. Loop Control Flow

$break, $continue, and $return are control flow commands for interrupting loops and functions. They are top-level actions (registered in the main pipeline) and work from anywhere inside a loop or function body — including inside nested if, try, or even other loops.

`$break` — Exit a loop

{"$break": null}

Stops the innermost foreach or while loop immediately. Any changes made to dest before $break in the current iteration are preserved.

spec = {
    "op": "foreach",
    "in": "/items",
    "as": "item",
    "do": [
        {
            "op": "if",
            "cond": {"$eq": [{"$ref": "&:/item"}, "stop"]},
            "then": [{"$break": None}],
        },
        {"/result[]": "&:/item"},
    ],
}

result = engine.apply(spec, source={"items": ["a", "b", "stop", "c"]}, dest={"result": []})
# → {"result": ["a", "b"]}

`$continue` — Skip to the next iteration

{"$continue": null}

Skips the remaining actions in the current iteration and moves to the next element (foreach) or re-evaluates the condition (while). Changes made before $continue are preserved.

spec = {
    "op": "foreach",
    "in": "/numbers",
    "as": "n",
    "do": [
        {
            "op": "if",
            "cond": {"$eq": [{"$mod": [{"$ref": "&:/n"}, 2]}, 0]},
            "then": [{"$continue": None}],   # skip even numbers
        },
        {"/odds[]": "&:/n"},
    ],
}

result = engine.apply(spec, source={"numbers": [1, 2, 3, 4, 5]}, dest={"odds": []})
# → {"odds": [1, 3, 5]}

`$return` — Return a value from a function

{"$return": <value>}

Exits the current function immediately, returning <value> as the function result. The value is evaluated through the value pipeline (supports $ref, templates, constructs). Use null to return None.

This supersedes the "return": "/path" parameter in $def when you need to return from multiple points in the body (early return, return from inside a loop, etc.).

spec = [
    {
        "$def": "sign",
        "params": ["x"],
        "body": [
            {
                "op": "if",
                "cond": {"$gt": [{"$ref": "&:/x"}, 0]},
                "then": [{"$return": "positive"}],
            },
            {
                "op": "if",
                "cond": {"$lt": [{"$ref": "&:/x"}, 0]},
                "then": [{"$return": "negative"}],
            },
            {"$return": "zero"},
        ],
    },
    {"/result": {"$func": "sign", "args": [-3]}},
]

result = engine.apply(spec, source={}, dest={})
# → {"result": "negative"}

Early return from inside a loop:

spec = [
    {
        "$def": "find_first",
        "params": ["items", "target"],
        "body": [
            {
                "op": "foreach",
                "in": "&:/items",
                "as": "item",
                "do": [
                    {
                        "op": "if",
                        "cond": {"$eq": [{"$ref": "&:/item"}, {"$ref": "&:/target"}]},
                        "then": [{"$return": {"$ref": "&:/item"}}],
                    },
                ],
            },
            {"$return": None},   # not found
        ],
    },
    {"/found": {"$func": "find_first", "args": [["a", "b", "c"], "b"]}},
]

result = engine.apply(spec, source={}, dest={})
# → {"found": "b"}

Interaction with `try`

Control flow signals propagate through try blocks — they are never caught by except. The finally block still runs before the signal continues propagating.

spec = [
    {
        "$def": "func",
        "body": [
            {
                "op": "try",
                "do": [{"$return": "early"}],
                "except": [{"/caught": True}],   # NOT reached
                "finally": [{"/cleanup": True}],  # always runs
            },
        ],
    },
    {"/answer": {"$func": "func"}},
]

result = engine.apply(spec, source={}, dest={})
# → {"answer": "early"}   ("caught" is never set, "cleanup" is set inside the function)

8. Shorthand Syntax

Shorthands are expanded by priority-ordered StageProcessors before execution.

`~assert`

{
    "~assert": {
        "/x": 10,
        "/y": 20
    }
}

Expands to:

[
    {
        "op": "assert",
        "path": "/x",
        "equals": 10
    },
    {
        "op": "assert",
        "path": "/y",
        "equals": 20
    }
]

`~delete`

{
    "~delete": [
        "/tmp",
        "/cache"
    ]
}

Expands to:

[
    {
        "op": "delete",
        "path": "/tmp"
    },
    {
        "op": "delete",
        "path": "/cache"
    }
]

Append notation (`field[]`)

{
    "/items[]": 123
}

Expands to:

{
    "op": "set",
    "path": "/items/-",
    "value": 123
}

Pointer assignment

When the value starts with /, @:, &:, !:, or _:, it is treated as a copy-from pointer and expands to an op: copy step:

{"/name": "/user/fullName"}

Expands to:

{
    "op": "copy",
    "from": "/user/fullName",
    "path": "/name",
    "ignore_missing": true
}

This also works with context prefixes:

{"/copy_of": "@:/existing_dest_key"}
{"/arg_val": "&:/param_name"}

Literal assignment

{
    "/status": "active"
}

Expands to:

{
    "op": "set",
    "path": "/status",
    "value": "active"
}

Priority order: ~assert (100) → ~delete (50) → pointer/literal assignment (0)

Built-in Operations

All operations are registered as ActionHandler instances in the main registry.

`set`

Write value to destination path.

{
    "op": "set",
    "path": "/target",
    "value": "...",
    "create": true,
    // Auto-create parents (default: true)
    "extend": true
    // Extend lists on append (default: true)
}

Special: path ending with /- appends to list.

`copy`

Copy value from source to destination.

{
    "op": "copy",
    "from": "/source/path",
    "path": "/dest/path",
    "ignore_missing": false,
    // Skip if missing (default: false)
    "default": "..."
    // Fallback value
}

The from pointer supports all context prefixes (@:, &:, !:, _:).

`delete`

Remove value at path.

{
    "op": "delete",
    "path": "/remove",
    "ignore_missing": true
    // Don't error if missing (default: true)
}

`foreach`

Iterate over array/mapping.

{
    "op": "foreach",
    "in": "/items",
    "as": "item",
    // Variable name (default: "item")
    "do": [
        ...
    ],
    // Nested actions
    "skip_empty": true,
    // Skip if empty (default: true)
    "default": []
    // Fallback if missing
}

Note: If source is a dict, iterates over (key, value) tuples.

Loop control: Use $break to exit the loop early or $continue to skip the rest of the current iteration (see Loop Control Flow).

Accessing the loop variable:

The loop variable is stored in temp_read_only and is accessible inside do blocks via:

&:/item — pointer syntax
${&:/item} — template syntax
{"$ref": "&:/item"} — $ref syntax
${?args.item} — JMESPath syntax

The original source is accessible via the plain / pointer as usual.

spec = {
    "op": "foreach",
    "in": "/products",
    "as": "product",
    "do": {
        "/results[]": {
            "$eval": [
                {"/name": "${&:/product/name}"},
                {"/price": "${&:/product/price}"},
                {"/tax": {"$ref": "/config/tax"}},  # from original source
            ]
        }
    }
}

result = engine.apply(
    spec,
    source={
        "products": [{"name": "A", "price": 100}],
        "config": {"tax": 1.2}
    },
    dest={}
)
# → {"results": [{"name": "A", "price": 100, "tax": 1.2}]}

`while`

Loop while condition holds.

Path mode:

{
    "op": "while",
    "path": "/counter",
    "equals": 0,
    // Or "exists": true
    "do": [
        ...
    ],
    "do_while": false
    // Execute at least once (default: false)
}

Expression mode:

{
    "op": "while",
    "cond": "${?dest.counter < `10`}",
    "do": [
        ...
    ]
}

Note: Condition is checked against destination state. Use do_while: true to execute body at least once before checking condition.

Loop control: Use $break to exit the loop early or $continue to skip the rest of the current iteration and re-evaluate the condition (see Loop Control Flow).

`if`

Conditional execution.

Path mode:

{
    "op": "if",
    "path": "/check",
    "equals": "value",
    // Optional
    "exists": true,
    // Optional
    "then": [
        ...
    ],
    // Success branch
    "else": [
        ...
    ]
    // Failure branch
}

Expression mode:

{
    "op": "if",
    "cond": "${?source.age >= `18`}",
    "then": [
        ...
    ]
}

`exec`

Execute nested script.

From source:

{
    "op": "exec",
    "from": "/script",
    "merge": false
    // Replace dest (default) or merge into it
}

Inline:

{
    "op": "exec",
    "actions": [
        ...
    ]
}

`update`

Merge mapping into target.

{
    "op": "update",
    "path": "/obj",
    "value": {
        "b": 2
    },
    // Or "from": "/source/obj"
    "deep": false
    // Recursive merge (default: false)
}

`distinct`

Remove duplicates from list.

{
    "op": "distinct",
    "path": "/items",
    "key": "/id"
    // Optional: compare by nested field
}

`assert`

Assert value existence/equality.

Basic usage:

{
    "op": "assert",
    // Check source
    "path": "/required",
    "equals": "value"
    // Optional
}

With direct value:

{
    "op": "assert",
    "value": "${?source.computed}",
    // Check computed value instead of path
    "equals": "expected"
}

With return mode:

{
    "op": "assert",
    "path": "/optional",
    "return": true,
    // Return value instead of raising error
    "to_path": "/result"
    // Optional: write result to destination
}

return: true — returns value on success, false on failure (instead of raising error)
to_path — destination path for return value
value — alternative to path, checks direct value

`try`

Execute actions with error handling (try-except-finally pattern).

Basic try-except:

{
    "op": "try",
    "do": [
        {"op": "copy", "from": "/might_not_exist", "path": "/result"}
    ],
    "except": [
        {"/error": "Failed to copy value"}
    ]
}

Access error information:

Inside the except block, error info is available via the &: prefix:

{
    "op": "try",
    "do": [
        {"$raise": "Something went wrong"}
    ],
    "except": [
        {"/error_message": "${&:/_error_message}"},
        {"/error_type": "${&:/_error_type}"}
    ]
}

With finally cleanup:

{
    "op": "try",
    "do": [
        {"/status": "processing"},
        {"op": "exec", "from": "/dangerous_operation"}
    ],
    "except": [
        {"/status": "error"},
        {"/error_msg": "${&:/_error_message}"}
    ],
    "finally": [
        {"/processed_at": "2024-01-01"},
        {"/cleanup": true}
    ]
}

Behavior:

Executes actions in do block
If error occurs:
- Error info stored in temp_read_only (_error_type, _error_message)
- If except block provided, executes it with error info accessible via &:/ prefix
- If no except, re-raises error after executing finally (if present)
finally block always executes (even on error)
Control flow signals ($break, $continue, $return) are not caught by except — they propagate through the try block; finally still runs before they propagate

Error info in except:

&:/_error_type — error class name (e.g., "JPermError")
&:/_error_message — error message string

Example: Validation with fallback

spec = {
    "op": "try",
    "do": [
        {"/age": {"$cast": {"value": "${/user_input}", "type": "int"}}},
        {
            "op": "if",
            "cond": {"$lt": [{"$ref": "@:/age"}, 0]},
            "then": [{"$raise": "Age cannot be negative"}]
        },
        {"/valid": True}
    ],
    "except": [
        {"/valid": False},
        {"/error": "${&:/_error_message}"}
    ]
}

result = engine.apply(spec, source={"user_input": "-5"}, dest={})
# → {"age": -5, "valid": False, "error": "Age cannot be negative"}

Extending J-Perm

Custom Operations

Create a new ActionHandler and register it:

from j_perm import ActionHandler, ActionNode, OpMatcher, ExecutionContext


class MyOpHandler(ActionHandler):
    def execute(self, step, ctx: ExecutionContext):
        # Your logic here
        return ctx.dest


# Register in main registry (in build_default_engine or custom factory)
registry.register(ActionNode(
    name="my_op",
    priority=10,
    matcher=OpMatcher("my_op"),
    handler=MyOpHandler(),
))

Custom Special Constructs

Add a SpecialFn to the specials dict:

def my_special(node, ctx):
    value = ctx.engine.process_value(node["$mySpecial"], ctx)
    return value.upper()


engine = build_default_engine(specials={
    "$ref": ref_handler,
    "$eval": eval_handler,
    "$mySpecial": my_special,
})

Custom Stages

Create a StageProcessor for batch preprocessing:

from j_perm import StageProcessor, StageNode, StageRegistry


class ValidateStage(StageProcessor):
    def apply(self, steps, ctx):
        # Validate/transform steps
        return steps


# Register in main pipeline stages
stages = build_default_shorthand_stages()
stages.register(StageNode(
    name="validate",
    priority=200,  # Higher = runs earlier
    processor=ValidateStage(),
))

# Use in custom engine
main_pipeline = Pipeline(stages=stages, registry=main_registry)

Custom Casters

Casters are used in both template syntax (${type:...}) and the $cast construct.

Provide custom casters via build_default_engine:

import json
from j_perm import build_default_engine

custom_casters = {
    "int": lambda x: int(x),
    "float": lambda x: float(x),
    "json": lambda x: json.loads(x),
    "upper": lambda x: str(x).upper(),
}

engine = build_default_engine(casters=custom_casters)

# Now you can use them in both ways:
spec = [
    {"/age": "${int:/raw_age}"},              # Template syntax
    {"/data": {"$cast": {"value": "{}", "type": "json"}}},  # $cast construct
    {"/name": {"$cast": {"value": "alice", "type": "upper"}}},
]

Or use the default built-in casters: int, float, bool, str.

Using Construct Groups

J-Perm provides pre-organized groups of construct handlers for convenient registration:

from j_perm import build_default_engine
from j_perm import (
    CORE_HANDLERS,                # $ref, $eval
    LOGICAL_HANDLERS,             # $and, $or, $not
    COMPARISON_HANDLERS,          # $gt, $gte, $lt, $lte, $eq, $ne, $in, $exists
    MATH_HANDLERS,                # $add, $sub, $mul, $div, $pow, $mod (with default limits)
    STRING_HANDLERS,              # All string operations (11 constructs, with default limits)
    REGEX_HANDLERS,               # All regex operations (5 constructs, with default limits)
    get_all_handlers,             # Function to get all handlers with casters
    get_all_handlers_with_limits, # Function to get all handlers with custom limits
)

# Build engine with specific groups only
engine = build_default_engine(specials={
    **CORE_HANDLERS,
    **STRING_HANDLERS,
    **REGEX_HANDLERS,
})

# Or extend default engine with additional handlers
from j_perm.casters import BUILTIN_CASTERS

all_handlers = get_all_handlers(casters=BUILTIN_CASTERS)
engine = build_default_engine(specials={
    **all_handlers,
    "$custom": my_custom_handler,
})

# Import individual handlers
from j_perm import (
    str_split_handler,
    str_join_handler,
    regex_match_handler,
    add_handler,
    exists_handler,
    # ... etc
)

Available groups:

CORE_HANDLERS — Core constructs ($ref, $eval)
LOGICAL_HANDLERS — Logical operators ($and, $or, $not)
COMPARISON_HANDLERS — Comparison operators ($gt, $gte, $lt, $lte, $eq, $ne, $in, $exists)
MATH_HANDLERS — Mathematical operators with default limits ($add, $sub, $mul, $div, $pow, $mod)
STRING_HANDLERS — String operations with default limits (11 constructs)
REGEX_HANDLERS — Regular expression operations with default limits (5 constructs)
ALL_HANDLERS_NO_CAST — All handlers except $cast
get_all_handlers(casters) — Function returning all handlers including $cast (with default limits)
get_all_handlers_with_limits(casters, **limits) — Function returning all handlers with custom limits

Example with custom limits:

from j_perm import get_all_handlers_with_limits
from j_perm.casters import BUILTIN_CASTERS

# Build handlers with conservative limits
secure_handlers = get_all_handlers_with_limits(
    casters=BUILTIN_CASTERS,
    regex_timeout=1.0,
    pow_max_exponent=100,
    str_max_join_result=100_000,
    mul_max_string_result=100_000,
    add_max_string_result=1_000_000,
    sub_max_number_result=1e10,
)

engine = build_default_engine(specials=secure_handlers)

Custom Matchers

Implement ActionMatcher or StageMatcher:

from j_perm import ActionMatcher


class PrefixMatcher(ActionMatcher):
    def __init__(self, prefix):
        self.prefix = prefix

    def matches(self, step):
        return isinstance(step, dict) and
            step.get("op", "").startswith(self.prefix)

Async Support

J-Perm provides full support for asynchronous operations through parallel async infrastructure.

Overview

All core components have async counterparts that work seamlessly with Python's async/await:

Sync pipeline (engine.apply()) - for synchronous handlers
Async pipeline (engine.apply_async()) - for async handlers and I/O operations
Mixed mode - sync and async handlers can coexist in the same pipeline

Async Base Classes

from j_perm import AsyncActionHandler, AsyncStageProcessor, AsyncMiddleware

class AsyncHttpHandler(AsyncActionHandler):
    """Async handler for HTTP requests."""

    async def execute(self, step, ctx):
        url = await ctx.engine.process_value_async(step["url"], ctx)

        async with aiohttp.ClientSession() as session:
            async with session.get(url) as response:
                data = await response.json()
                ctx.dest["response"] = data

        return ctx.dest

Using Async Engine

import asyncio
from j_perm import build_default_engine, ActionNode, OpMatcher

# Build engine as usual
engine = build_default_engine()

# Register async handler
engine.main_pipeline.registry.register(ActionNode(
    name="http",
    priority=10,
    matcher=OpMatcher("http"),
    handler=AsyncHttpHandler(),
))

# Use async apply
async def main():
    spec = [
        {"op": "http", "url": "https://api.example.com/data"},
        {"/result": "${@:/response/value}"}
    ]

    result = await engine.apply_async(spec, source={}, dest={})
    print(result)

asyncio.run(main())

Async Methods

Method	Description
`engine.apply_async()`	Async version of `apply()`
`engine.apply_to_context_async()`	Async version of `apply_to_context()`
`engine.process_value_async()`	Async value stabilization
`engine.run_pipeline_async()`	Run named pipeline asynchronously
`pipeline.run_async()`	Async pipeline execution
`registry.run_all_async()`	Async stage execution (for `StageRegistry`)

Mixing Sync and Async

The async pipeline automatically handles both sync and async components:

# Sync handler
class SyncSetHandler(ActionHandler):
    def execute(self, step, ctx):
        ctx.dest["sync"] = True
        return ctx.dest

# Async handler
class AsyncFetchHandler(AsyncActionHandler):
    async def execute(self, step, ctx):
        data = await fetch_data()  # async I/O
        ctx.dest["async"] = data
        return ctx.dest

# Both work in apply_async()
result = await engine.apply_async([
    {"op": "set", ...},      # sync handler
    {"op": "fetch", ...},    # async handler
], source={}, dest={})

When to Use Async

Use async handlers for:

Network I/O - HTTP requests, API calls, webhooks
Database operations - async DB queries
File I/O - async file reads/writes
External services - Cloud APIs, microservices
Concurrent operations - when you need to parallelize work

Sync handlers are fine for:

Pure transformations - data mapping, filtering
Simple operations - set, copy, delete
CPU-bound work - computations without I/O

Example: Async HTTP Handler

import aiohttp
from j_perm import AsyncActionHandler, ActionNode, OpMatcher

class HttpGetHandler(AsyncActionHandler):
    """Fetch data from HTTP endpoint."""

    async def execute(self, step, ctx):
        # Process URL with template support
        url = await ctx.engine.process_value_async(step["url"], ctx)
        headers = await ctx.engine.process_value_async(
            step.get("headers", {}), ctx
        )

        # Make async HTTP request
        async with aiohttp.ClientSession() as session:
            async with session.get(url, headers=headers) as response:
                response.raise_for_status()
                data = await response.json()

        # Write result to destination
        path = await ctx.engine.process_value_async(step["path"], ctx)
        ctx.engine.resolver.set(path, ctx.dest, data)

        return ctx.dest

# Register and use
engine.main_pipeline.registry.register(ActionNode(
    name="http_get",
    priority=10,
    matcher=OpMatcher("http_get"),
    handler=HttpGetHandler(),
))

# Usage
spec = {
    "op": "http_get",
    "url": "https://api.github.com/users/${/username}",
    "headers": {"Accept": "application/json"},
    "path": "/user_data"
}

result = await engine.apply_async(spec, source={"username": "octocat"}, dest={})

Async Stages and Middlewares

You can also create async stages and middlewares:

from j_perm import AsyncStageProcessor, AsyncMiddleware

class AsyncValidationStage(AsyncStageProcessor):
    """Async validation of steps."""

    async def apply(self, steps, ctx):
        # Async validation logic
        await validate_steps(steps)
        return steps

class AsyncLoggingMiddleware(AsyncMiddleware):
    """Log each step asynchronously."""

    name = "async_logger"
    priority = 10

    async def process(self, step, ctx):
        await log_step(step)  # async logging
        return step

Note: Stages and middlewares remain sync by default. Only use async versions when you have actual async I/O in preprocessing/middleware logic.

Advanced Topics

Value Stabilization Loop

When handlers call ctx.engine.process_value(value, ctx), the value pipeline runs repeatedly until:

Output equals input (stable)
value_max_depth iterations reached (default: 50)

This resolves nested templates and special constructs:

# Input: {"$ref": "/path_to_template"}
# Pass 1: {"$ref": ...} → "${/nested}"
# Pass 2: "${/nested}" → "final"
# Pass 3: "final" → "final" (stable ✓)

To stop the loop early and return the current result as-is, use the $raw: true flag or the $raw wrapper construct (see $raw).

PipelineSignal — extensible in-pipeline control flow

PipelineSignal is an abstract base class (defined in core.py) for signals that Pipeline.run intercepts. When a handler raises a PipelineSignal, Pipeline.run calls signal.handle(ctx). If handle re-raises, the signal propagates up to the caller (e.g. Engine.process_value).

This lets you add new pipeline-level behaviours without touching core.py:

from j_perm import PipelineSignal

class MySignal(PipelineSignal):
    def __init__(self, value):
        self.value = value

    def handle(self, ctx):
        ctx.dest = self.value
        raise self  # propagate to stop the stabilisation loop

Built-in signals that inherit from PipelineSignal:

Signal	Raised by	Caught by
`RawValueSignal`	`raw_handler`, `SpecialResolveHandler` (flag)	`Engine.process_value`

Control flow signals inherit from ControlFlowSignal (not PipelineSignal):

Signal	Raised by	Caught by
`BreakSignal`	`$break`	`foreach` / `while` handler
`ContinueSignal`	`$continue`	`foreach` / `while` handler
`ReturnSignal`	`$return`	`$func` call handler

ControlFlowSignal is the common base for all three. Pipeline.run treats them as non-errors and never attaches a language call stack to them. You can catch the base class if you need to intercept any control flow signal propagating out of a custom handler:

from j_perm import ControlFlowSignal

try:
    engine.apply(spec, source=src, dest={})
except ControlFlowSignal:
    # $break / $continue / $return used outside their valid scope
    ...

Hierarchical Registries

Both StageRegistry and ActionTypeRegistry support tree structures:

# Group related operations
math_registry = ActionTypeRegistry()
math_registry.register(ActionNode("add", 10, AddMatcher(), AddHandler()))
math_registry.register(ActionNode("sub", 10, SubMatcher(), SubHandler()))

# Mount as sub-tree
main_registry.register_group(
    "math",
    math_registry,
    matcher=OpMatcher("math"),
    priority=50,
)

Priority and Execution Order

Stages: All matching stages run in priority order (high → low).

Actions: First matching handler executes (unless exclusive=False).

Shorthands:

AssertShorthandProcessor (100) — extracts ~assert
DeleteShorthandProcessor (50) — extracts ~delete
AssignShorthandProcessor (0) — fallback for all remaining keys

Unescape Rules

After value stabilization, registered UnescapeRule callables strip escape sequences:

from j_perm import UnescapeRule

# Built-in: template_unescape (strips $${ → ${, $$ → $)
# Registered at priority 0

# Add custom unescape
engine.unescape_rules.append(
    UnescapeRule(name="custom", priority=10, unescape=my_unescape_fn)
)

API Reference

Core Classes

from j_perm import (
    # Core infrastructure
    ExecutionContext,
    ValueResolver,
    ValueProcessor,
    Engine,
    Pipeline,

    # Stage system
    StageProcessor,
    AsyncStageProcessor,  # Async version
    StageMatcher,
    StageNode,
    StageRegistry,

    # Action system
    ActionHandler,
    AsyncActionHandler,  # Async version
    ActionMatcher,
    ActionNode,
    ActionTypeRegistry,

    # Middleware
    Middleware,
    AsyncMiddleware,  # Async version

    # Pipeline control flow (base classes for extensible signals)
    ControlFlowSignal,  # Base for $break / $continue / $return signals
    PipelineSignal,     # Base for value-pipeline signals (e.g. RawValueSignal)

    # Unescape
    UnescapeRule,
)

Handlers

from j_perm import (
    # Value handlers
    TemplMatcher,
    TemplSubstHandler,
    SpecialMatcher,
    SpecialResolveHandler,
    ContainerMatcher,
    RecursiveDescentHandler,
    IdentityHandler,

    # Special construct functions
    raw_handler,        # $raw wrapper / literal escape
    ref_handler,
    eval_handler,
    make_cast_handler,  # Factory for $cast handler
    and_handler,
    or_handler,
    not_handler,

    # Comparison operators
    gt_handler,
    gte_handler,
    lt_handler,
    lte_handler,
    eq_handler,
    ne_handler,
    in_handler,
    exists_handler,

    # Mathematical operators
    add_handler,
    make_add_handler,     # Factory with configurable limits
    sub_handler,
    make_sub_handler,     # Factory with configurable limits
    mul_handler,
    make_mul_handler,     # Factory with configurable limits
    div_handler,
    pow_handler,
    make_pow_handler,     # Factory with configurable limits
    mod_handler,

    # String operations
    str_split_handler,
    make_str_split_handler,    # Factory with configurable limits
    str_join_handler,
    make_str_join_handler,     # Factory with configurable limits
    str_slice_handler,
    str_upper_handler,
    str_lower_handler,
    str_strip_handler,
    str_lstrip_handler,
    str_rstrip_handler,
    str_replace_handler,
    make_str_replace_handler,  # Factory with configurable limits
    str_contains_handler,
    str_startswith_handler,
    str_endswith_handler,

    # Regex operations
    regex_match_handler,
    make_regex_match_handler,   # Factory with configurable limits
    regex_search_handler,
    make_regex_search_handler,  # Factory with configurable limits
    regex_findall_handler,
    make_regex_findall_handler, # Factory with configurable limits
    regex_replace_handler,
    make_regex_replace_handler, # Factory with configurable limits
    regex_groups_handler,
    make_regex_groups_handler,  # Factory with configurable limits

    # Function handlers
    DefMatcher,
    CallMatcher,
    DefHandler,
    CallHandler,
    RaiseMatcher,
    RaiseHandler,
    JPermError,
    ReturnMatcher,
    ReturnHandler,

    # Loop control flow handlers
    BreakMatcher,
    BreakHandler,
    ContinueMatcher,
    ContinueHandler,

    # Control flow signals (exceptions)
    BreakSignal,       # raised by $break
    ContinueSignal,    # raised by $continue
    ReturnSignal,      # raised by $return (.value holds the return value)
    RawValueSignal,    # raised by $raw / $raw:True flag (.value = raw result)

    # Operation handlers
    SetHandler,
    CopyHandler,
    DeleteHandler,
    ForeachHandler,
    WhileHandler,
    IfHandler,
    ExecHandler,
    UpdateHandler,
    DistinctHandler,
    AssertHandler,
    TryHandler,
)

Utilities

from j_perm import (
    # Matchers
    OpMatcher,
    AlwaysMatcher,

    # Resolver
    PointerResolver,

    # Processor
    PointerProcessor,

    # Casters
    BUILTIN_CASTERS,  # Built-in type casters (int, float, bool, str)

    # Shorthand stages
    AssertShorthandProcessor,
    DeleteShorthandProcessor,
    AssignShorthandProcessor,

    # Factory
    build_default_engine,
    build_default_shorthand_stages,
)

Examples

Example 1: Data Filtering with foreach

spec = {
    "op": "foreach",
    "in": "/products",
    "as": "item",
    "do": {
        "op": "if",
        "cond": "${?args.item.price < `100`}",
        "then": {"/affordable[]": "&:/item"}
    }
}

Example 2: Conditional Copy with Default

spec = {
    "/result": {
        "$ref": "/maybe_missing",
        "$default": "not found"
    }
}

Example 3: Nested Evaluation

spec = {
    "/computed": {
        "$eval": [
            {"op": "set", "path": "/x", "value": "${int:/a}"},
            {"op": "set", "path": "/y", "value": "${int:/b}"}
        ],
        "$select": "${?add(dest.x, dest.y)}"
    }
}

Example 4: Mixed Shorthands

spec = [
    {"~assert": {"/user/id": 123}},
    {"~delete": "/temp"},
    {"/output": "/user/name"}
]

Example 5: Functions with Parameters and Error Handling

spec = [
    {
        "$def": "validateAge",
        "params": ["age"],
        "body": [
            {
                "op": "if",
                "cond": {"$lt": [{"$ref": "&:/age"}, 0]},
                "then": [{"$raise": "Age cannot be negative: ${&:/age}"}]
            },
            {"/valid": True}
        ],
        "return": "/valid",
        "on_failure": [
            {"/validation_failed": True},
            {"/last_error": "Validation error occurred"}
        ]
    },
    {"/user_age_valid": {"$func": "validateAge", "args": [25]}},
]

Example 6: try-except with Error Info

spec = {
    "op": "try",
    "do": [
        {"/age": {"$cast": {"value": "${/user_input}", "type": "int"}}},
        {
            "op": "if",
            "cond": {"$lt": [{"$ref": "@:/age"}, 0]},
            "then": [{"$raise": "Age cannot be negative"}]
        },
        {"/valid": True}
    ],
    "except": [
        {"/valid": False},
        {"/error": "${&:/_error_message}"}
    ]
}

result = engine.apply(spec, source={"user_input": "-5"}, dest={})
# → {"age": -5, "valid": False, "error": "Age cannot be negative"}

Example 7: Loop Control with $break and $continue

# Collect items until sentinel, skipping nulls
spec = {
    "op": "foreach",
    "in": "/stream",
    "as": "item",
    "do": [
        {
            "op": "if",
            "cond": {"$eq": [{"$ref": "&:/item"}, None]},
            "then": [{"$continue": None}],  # skip null
        },
        {
            "op": "if",
            "cond": {"$eq": [{"$ref": "&:/item"}, "END"]},
            "then": [{"$break": None}],     # stop at sentinel
        },
        {"/result[]": "&:/item"},
    ],
}

result = engine.apply(
    spec,
    source={"stream": ["a", None, "b", "END", "c"]},
    dest={"result": []},
)
# → {"result": ["a", "b"]}

Example 8: Early Return from a Function

spec = [
    {
        "$def": "first_positive",
        "params": ["nums"],
        "body": [
            {
                "op": "foreach",
                "in": "&:/nums",
                "as": "n",
                "do": [
                    {
                        "op": "if",
                        "cond": {"$gt": [{"$ref": "&:/n"}, 0]},
                        "then": [{"$return": {"$ref": "&:/n"}}],
                    },
                ],
            },
            {"$return": None},
        ],
    },
    {"/result": {"$func": "first_positive", "args": [[-3, -1, 0, 5, 8]]}},
]

result = engine.apply(spec, source={}, dest={})
# → {"result": 5}

Example 9: $exists for Optional Fields

spec = [
    {
        "op": "if",
        "cond": {"$exists": "/user/middle_name"},
        "then": [{"/display": "${/user/first_name} ${/user/middle_name} ${/user/last_name}"}],
        "else": [{"/display": "${/user/first_name} ${/user/last_name}"}],
    }
]

License

MIT (or adapt to your project as needed)

Contributing

Issues and pull requests welcome!

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.8.0

Jun 2, 2026

1.7.0

May 4, 2026

1.6.1

Mar 9, 2026

This version

1.6.0

Feb 20, 2026

1.5.0

Feb 19, 2026

1.4.0

Feb 19, 2026

1.3.1

Feb 17, 2026

1.3.0

Feb 16, 2026

1.2.0

Feb 14, 2026

1.1.0

Feb 13, 2026

1.0.0

Feb 6, 2026

0.2.4

Feb 4, 2026

0.1.3.1

Dec 2, 2025

0.1.3

Dec 1, 2025

0.1.2

Nov 26, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

j_perm-1.6.0.tar.gz (110.0 kB view details)

Uploaded Feb 20, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

j_perm-1.6.0-py3-none-any.whl (77.4 kB view details)

Uploaded Feb 20, 2026 Python 3

File details

Details for the file j_perm-1.6.0.tar.gz.

File metadata

Download URL: j_perm-1.6.0.tar.gz
Upload date: Feb 20, 2026
Size: 110.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.0

File hashes

Hashes for j_perm-1.6.0.tar.gz
Algorithm	Hash digest
SHA256	`8fd6e268e4283d3adbc85d66bb8d1f3aafdcfc0d689c90435b1127a7f592903f`
MD5	`c1b7558cad0ef98605835e9e902d87c4`
BLAKE2b-256	`67023ad68936a8ac9cddee78fd8641d5273cf433e506087147eccf0dcedb19e6`

See more details on using hashes here.

File details

Details for the file j_perm-1.6.0-py3-none-any.whl.

File metadata

Download URL: j_perm-1.6.0-py3-none-any.whl
Upload date: Feb 20, 2026
Size: 77.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.0

File hashes

Hashes for j_perm-1.6.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2d4184dcd2cb36ec9f80e6752553c1db1c909e404014b424bbc951ccb5cae3bc`
MD5	`f8d8bb87c6edbb9c3b4b779e27ffbd39`
BLAKE2b-256	`32927ac31e07050d2906c693fb0bff6e1e0e805c2282613ffd0ff330f4c1c65f`

See more details on using hashes here.

j-perm 1.6.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

J-Perm

Quick Example

Installation

Architecture Overview

Core Components

Core API

Building an Engine

Applying Transformations

Security and Limits

Global Limits

Loop and Iteration Limits

Mathematical Operation Limits

String Operation Limits

Regex Protection (ReDoS Prevention)

Customizing Limits

Logging and Debugging

Error Logging (Language Call Stack)

Trace Logging (Full Execution Log)

Controlling Step Representation Length

Value Resolution Tracing

Named Pipeline Tracing

Logger Hierarchy

Features

1. JSON Pointer Addressing

Data Source Prefixes

2. Template Interpolation (${...})

JSON Pointer lookup

Type casters (built-in)

JMESPath queries

Nested templates

Escaping

3. Special Constructs

$ref — Reference resolution

$eval — Nested evaluation

$cast — Type casting

$raw — Return a literal without processing

$and — Logical AND with short-circuit

$or — Logical OR with short-circuit

$not — Logical negation

Comparison Operators

Membership and Existence Operators

Mathematical Operators

4. String Operations

Split and Join

Slicing

Case Conversion

Trimming

Replace

String Checks

5. Regular Expressions

Match and Search

Find All Matches

Replace with Regex

Extract Capture Groups

6. Functions and Error Handling

$def — Define a function

$func — Call a function

$raise — Raise an error

7. Loop Control Flow

$break — Exit a loop

$continue — Skip to the next iteration

$return — Return a value from a function

Interaction with try

8. Shorthand Syntax

~assert

~delete

Append notation (field[])

Pointer assignment

Literal assignment

Built-in Operations

set

copy

2. Template Interpolation (`${...}`)

`$ref` — Reference resolution

`$eval` — Nested evaluation

`$cast` — Type casting

`$raw` — Return a literal without processing

`$and` — Logical AND with short-circuit

`$or` — Logical OR with short-circuit

`$not` — Logical negation

`$def` — Define a function

`$func` — Call a function

`$raise` — Raise an error

`$break` — Exit a loop

`$continue` — Skip to the next iteration

`$return` — Return a value from a function

Interaction with `try`

`~assert`

`~delete`

Append notation (`field[]`)

`set`

`copy`

`delete`

`foreach`

`while`

`if`

`exec`

`update`

`distinct`

`assert`

`try`