Skip to main content

json permutation library

Project description

J-Perm

A composable JSON transformation DSL with a powerful, extensible architecture.

J-Perm lets you describe data transformations as executable specifications — a list of steps that can be applied to input documents. It supports JSON Pointer addressing with slicing (arrays and strings), template interpolation with ${...} syntax, special constructs ($ref, $eval, $cast), logical and comparison operators ($and, $or, $not), comparison operators (6 operators plus $in and $exists), mathematical operations (6 operators), comprehensive string manipulation (11 operations), regular expressions (5 operations), user-defined functions ($def, $func, $raise) with loop/function control flow ($break, $continue, $return), error handling (try-except-finally), and a rich set of built-in operations — all with configurable security limits to prevent DoS attacks.


Quick Example

from j_perm import build_default_engine

engine = build_default_engine()

# Source data
source = {
    "users": [
        {"name": "Alice", "age": "17"},
        {"name": "Bob", "age": "22"}
    ]
}

# Transformation spec using foreach and the &: prefix for the loop variable
spec = {
    "op": "foreach",
    "in": "/users",
    "as": "item",
    "do": {
        "op": "if",
        "cond": "${?args.item.age >= `18`}",
        "then": {"/adults[]": "&:/item"},
    },
}

result = engine.apply(spec, source=source, dest={})
# → {"adults": [{"name": "Bob", "age": "22"}]}

Installation

pip install j-perm

(or copy the package into your project)


Architecture Overview

J-Perm is built on a pipeline architecture with two main levels:

┌─────────────────────────────────────────────────────────┐
│  spec (user input)                                      │
│    │                                                    │
│    ▼                                                    │
│  ┌──────────────────────────────────────────────────┐   │
│  │ STAGES (batch preprocessing, priority order)     │   │
│  │  • ShorthandExpansion → expand ~delete, etc      │   │
│  │  • YourCustomStage                               │   │
│  └──────────────────────────────────────────────────┘   │
│    │                                                    │
│    ▼                                                    │
│  List[step]                                             │
│    │                                                    │
│    ▼  for each step:                                    │
│  ┌──────────────────────────────────────────────────┐   │
│  │ MIDDLEWARES (per-step, priority order)           │   │
│  │  • Validation, logging, etc.                     │   │
│  └──────────────────────────────────────────────────┘   │
│    │                                                    │
│    ▼                                                    │
│  ┌──────────────────────────────────────────────────┐   │
│  │ REGISTRY (hierarchical dispatch tree)            │   │
│  │  • SetHandler, CopyHandler, ForeachHandler, ...  │   │
│  └──────────────────────────────────────────────────┘   │
│    │                                                    │
│    │  handlers call ctx.engine.process_value(...)       │
│    └─────────────────────────────────────┐              │
│                                          ▼              │
│  ┌──────────────────────────────────────────────────┐   │
│  │ VALUE PIPELINE (stabilization loop)              │   │
│  │  • SpecialResolveHandler ($ref, $eval)           │   │
│  │  • TemplSubstHandler (${...})                    │   │
│  │  • RecursiveDescentHandler (containers)          │   │
│  │  • IdentityHandler (scalars)                     │   │
│  └──────────────────────────────────────────────────┘   │
└─────────────────────────────────────────────────────────┘

Core Components

Component Purpose
Engine Orchestrates pipelines, manages context, runs stabilization loop
Pipeline Runs stages → middlewares → registry dispatch for each step
StageRegistry Tree of batch preprocessors (run-all, priority order)
ActionTypeRegistry Tree of action handlers (first-match or run-all)
ValueResolver Abstraction for addressing (JSON Pointer implementation)

Core API

Building an Engine

from j_perm import build_default_engine

# Default engine with all built-ins and default security limits
engine = build_default_engine()

# Custom specials (None = use defaults: $ref, $eval, $cast, $and, $or, $not, comparison, math, string, regex)
engine = build_default_engine(
    specials={"$ref": my_ref_handler, "$custom": my_handler},
    casters={"int": lambda x: int(x), "json": lambda x: json.loads(x)},  # Used in ${type:...} AND $cast
    jmes_options=jmespath.Options(custom_functions=CustomFunctions())
)

# Custom security limits (see Security and Limits section)
engine = build_default_engine(
    max_operations=10_000,
    max_function_recursion_depth=50,
    max_loop_iterations=1_000,
    regex_timeout=1.0,
    pow_max_exponent=100,
    # ... see factory.py for all available limits
)

Applying Transformations

result = engine.apply(
    spec,  # DSL script (dict or list)
    source=source,  # Source context (for pointers, templates)
    dest=dest,  # Initial destination (default: {})
)

Returns: Deep copy of the final dest after all transformations.


Security and Limits

J-Perm includes comprehensive protection against DoS attacks through configurable limits. All limits can be customized via build_default_engine() parameters.

Global Limits

Parameter Default Description
max_operations 1,000,000 Maximum total operations across entire transformation
max_function_recursion_depth 100 Maximum depth for recursive function calls

Example: Preventing infinite recursion

engine = build_default_engine(max_function_recursion_depth=50)

# This will raise RuntimeError if recursion exceeds 50 levels
spec = [
    {"$def": "factorial", "params": ["n"], "body": [
        {"op": "if", "cond": {"$eq": [{"$ref": "&:/n"}, 0]},
         "then": [{"/result": 1}],
         "else": [{"/result": {"$mul": [
             {"$ref": "&:/n"},
             {"$func": "factorial", "args": [{"$sub": [{"$ref": "&:/n"}, 1]}]}
         ]}}]}
    ], "return": "/result"},
    {"/output": {"$func": "factorial", "args": [100]}}  # Too deep!
]

Loop and Iteration Limits

Parameter Default Description
max_loop_iterations 10,000 Maximum iterations for while loops
max_foreach_items 100,000 Maximum items to process in foreach

Example: Preventing infinite loops

engine = build_default_engine(max_loop_iterations=1000)

# This will raise RuntimeError if loop exceeds 1000 iterations
spec = {
    "op": "while",
    "cond": {"$lt": [{"$ref": "@:/counter"}, 999999]},  # Never stops!
    "do": [{"/counter": {"$add": [{"$ref": "@:/counter"}, 1]}}]
}

Mathematical Operation Limits

Parameter Default Description
pow_max_base 1,000,000 Maximum base value for $pow
pow_max_exponent 1,000 Maximum exponent value for $pow
mul_max_operand 1,000,000,000 Maximum numeric operand in $mul
mul_max_string_result 1,000,000 Maximum string length from $mul (e.g., "x" * n)
add_max_number_result 1e15 Maximum numeric result from $add
add_max_string_result 100,000,000 Maximum string length from $add (concatenation)
sub_max_number_result 1e15 Maximum numeric result from $sub

Example: Preventing CPU exhaustion

engine = build_default_engine(
    pow_max_base=1000,
    pow_max_exponent=10
)

# This will raise ValueError: exponent exceeds limit
spec = {"/result": {"$pow": [2, 1000]}}  # 2^1000 would consume massive CPU

# This will raise ValueError: base exceeds limit
spec = {"/result": {"$pow": [999999, 2]}}

String Operation Limits

Parameter Default Description
str_max_split_results 100,000 Maximum results from $str_split
str_max_join_result 10,000,000 Maximum length of $str_join result
str_max_replace_result 10,000,000 Maximum length of $str_replace result

Example: Preventing memory exhaustion

engine = build_default_engine(str_max_split_results=1000)

# This will raise ValueError if split produces more than 1000 results
spec = {"/words": {"$str_split": {"string": "${/large_text}", "delimiter": " "}}}

Regex Protection (ReDoS Prevention)

Parameter Default Description
regex_timeout 2.0 Timeout in seconds for regex operations
regex_allowed_flags None Bitmask of allowed regex flags (None = default safe flags: IGNORECASE, MULTILINE, DOTALL, VERBOSE, ASCII; -1 = all flags allowed)

Example: Preventing ReDoS attacks

engine = build_default_engine(regex_timeout=1.0)

# This will raise TimeoutError if regex takes more than 1 second
spec = {
    "/result": {
        "$regex_match": {
            "pattern": "(a+)+b",  # Catastrophic backtracking pattern
            "string": "aaaaaaaaaaaaaaaaaaaaaaaac"  # No match, tries all combinations
        }
    }
}

Restricting regex flags:

import re

# Only allow case-insensitive and multiline flags
engine = build_default_engine(
    regex_allowed_flags=re.IGNORECASE | re.MULTILINE
)

# This will raise ValueError: prohibited regex flags
spec = {
    "/result": {
        "$regex_match": {
            "pattern": "test",
            "string": "TEST",
            "flags": re.DOTALL  # Not allowed!
        }
    }
}

# Allow all flags (not recommended for untrusted input)
engine = build_default_engine(regex_allowed_flags=-1)

Customizing Limits

All limits can be configured when building the engine:

from j_perm import build_default_engine

# Conservative limits for untrusted input
secure_engine = build_default_engine(
    max_operations=10_000,
    max_function_recursion_depth=10,
    max_loop_iterations=100,
    max_foreach_items=1_000,
    regex_timeout=0.5,
    pow_max_exponent=100,
    str_max_join_result=100_000,
)

# Relaxed limits for trusted environments
permissive_engine = build_default_engine(
    max_operations=10_000_000,
    max_function_recursion_depth=1000,
    max_loop_iterations=1_000_000,
    regex_timeout=10.0,
)

Best practices:

  • Use conservative limits when processing untrusted user input
  • Use permissive limits for internal data transformations
  • Monitor max_operations counter to detect suspicious activity
  • Test your transformations with realistic data sizes
  • Tune limits based on your specific use case

Features

1. JSON Pointer Addressing

J-Perm uses RFC 6901 JSON Pointer with extensions:

from j_perm import PointerResolver

resolver = PointerResolver()

# Basic pointers
resolver.get("/users/0/name", data)  # → "Alice"

# Root references (work on scalars too!)
resolver.get(".", 42)  # → 42
resolver.get("/", "text")  # → "text"

# Parent navigation
resolver.get("/a/b/../c", data)  # → data["a"]["c"]

# Slices (work on lists and strings)
resolver.get("/items[1:3]", data)  # → [item1, item2] for lists
resolver.get("/text[0:5]", {"text": "hello world"})  # → "hello" for strings
resolver.get("/text[-5:]", {"text": "hello world"})  # → "world" (negative indices)

# Append notation
resolver.set("/items/-", data, "new")  # Append to list

Key feature: Unlike standard JSON Pointer, PointerResolver works on any type (scalars, lists, dicts) for root references.

Data Source Prefixes

J-Perm supports prefixes to specify which context to read from:

Prefix Source Description
/path or _:/path source Read from the immutable source document
@:/path dest Read from the destination being built
&:/path args Read from temp_read_only — function arguments, loop variables, error info
!:/path temp Read from temp — mutable scratch space, not in final output

The &: prefix is the standard way to access:

  • Function parameters inside $def bodies
  • Loop variables inside foreach do blocks
  • Error info (_error_message, _error_type) inside try except blocks

Example: Accessing dest in templates

# Build incrementally, referencing previous values
spec = [
    {"/name": "Alice"},
    {"/greeting": "Hello, ${@:/name}!"}  # Reference dest value
]

result = engine.apply(spec, source={}, dest={})
# → {"name": "Alice", "greeting": "Hello, Alice!"}

Example: Function parameters via &:

spec = [
    {
        "$def": "greet",
        "params": ["name"],
        "body": [{"/msg": "Hello, ${&:/name}!"}],
        "return": "/msg",
    },
    {"/result": {"$func": "greet", "args": ["World"]}},
]

result = engine.apply(spec, source={}, dest={})
# → {"result": "Hello, World!"}

Example: Loop variable via &:

spec = {
    "op": "foreach",
    "in": "/items",
    "as": "item",
    "do": {"/out[]": "&:/item"},
}

result = engine.apply(spec, source={"items": [1, 2, 3]}, dest={})
# → {"out": [1, 2, 3]}

2. Template Interpolation (${...})

Templates are resolved by TemplSubstHandler in the value pipeline.

JSON Pointer lookup

"${/user/name}"     # → Resolve pointer from source
"${@:/total}"       # → Read from dest
"${&:/param_name}"  # → Read function argument / loop variable
"${!:/scratch}"     # → Read from temp scratch space
"${_:/user/name}"   # → Same as ${/user/name} (source alias)

Type casters (built-in)

"${int:/age}"  # → int(value)
"${float:/price}"
"${bool:/flag}"  # → bool(int(value)) if int/str, else bool(value)
"${str:/id}"

Note: Type casters can also be used via the $cast construct (see Special Constructs section).

JMESPath queries

"${?source.items[?price > `10`].name}"  # → Query source with JMESPath
"${?dest.total}"                         # → Query destination
"${?add(dest.x, source.y)}"              # → Mix source and dest
"${?args.item.age >= `18`}"              # → Query function arg / loop variable
"${?temp.scratch}"                       # → Query temp scratch space

Built-in JMESPath functions: add(a, b), subtract(a, b)

JMESPath data namespaces:

Namespace Context field Description
source.* ctx.source Source document
dest.* ctx.dest Destination being built
args.* ctx.temp_read_only Function args, loop vars, error info
temp.* ctx.temp Mutable scratch space

Nested templates

"${${/path_to_field}}"  # → Resolve inner template first

Escaping

$${ → ${ (literal)
$$  → $  (literal)

3. Special Constructs

Special values are resolved by SpecialResolveHandler.

$ref — Reference resolution

{
    "$ref": "/path/to/value",
    "$default": "fallback"
}
  • Resolves pointer from source context (supports all prefixes: @:, &:, !:, _:)
  • Returns deep copy (no aliasing)
  • Supports $default fallback

$eval — Nested evaluation

{
    "$eval": [
        {
            "op": "set",
            "path": "/x",
            "value": 1
        }
    ],
    "$select": "/x"
}
  • Executes nested DSL with dest={}
  • Optionally selects sub-path from result

$cast — Type casting

{
    "$cast": {
        "value": "42",
        "type": "int"
    }
}
  • Applies a registered type caster to a value
  • value — the value to cast (supports templates, $ref, etc.)
  • type — name of the registered caster (built-in: int, float, bool, str)
  • Alternative to template syntax ${type:...}

Examples:

# Cast string to int
{"/age": {"$cast": {"value": "25", "type": "int"}}}

# Cast with template substitution
{"/count": {"$cast": {"value": "${/raw_count}", "type": "int"}}}

# Cast with $ref
{"/price": {"$cast": {"value": {"$ref": "/data/price"}, "type": "float"}}}

# Dynamic type selection
{"/result": {"$cast": {"value": "123", "type": "${/target_type}"}}}

Custom casters:

# Define custom caster
def custom_upper(x):
    return str(x).upper()

engine = build_default_engine(casters={"upper": custom_upper})

# Use in spec
{"/name": {"$cast": {"value": "alice", "type": "upper"}}}  # → "ALICE"

$and — Logical AND with short-circuit

{
    "$and": [
        {"$ref": "/x"},
        {"$gt": [{"$ref": "/y"}, 10]},
        {"$eq": [{"$ref": "/status"}, "active"]}
    ]
}
  • Processes values in order through value pipeline
  • Returns last result if all are truthy
  • Short-circuits and returns first falsy result

Example:

# Check multiple conditions
spec = {
    "/is_valid": {
        "$and": [
            {"$ref": "/user/name"},           # truthy if name exists
            {"$gte": [{"$ref": "/user/age"}, 18]},  # age >= 18
            {"$in": ["admin", {"$ref": "/user/roles"}]}  # has admin role
        ]
    }
}

$or — Logical OR with short-circuit

{
    "$or": [
        {"$ref": "/x"},
        {"$ref": "/y"},
        {"$ref": "/z"}
    ]
}
  • Processes values in order through value pipeline
  • Returns first truthy result
  • Returns last result if all are falsy

Example:

# Provide fallback values
spec = {
    "/display_name": {
        "$or": [
            {"$ref": "/user/preferred_name"},
            {"$ref": "/user/full_name"},
            {"$ref": "/user/email"},
            "Unknown User"
        ]
    }
}

$not — Logical negation

{
    "$not": {"$ref": "/disabled"}
}
  • Processes value through value pipeline
  • Returns logical negation of the result

Example:

# Negate condition
spec = {
    "/is_enabled": {
        "$not": {"$ref": "/settings/disabled"}
    }
}

Comparison Operators

J-Perm provides comparison operators that work with any values:

$gt — Greater than

{"$gt": [10, 5]}   true
{"$gt": ["${/age}", 18]}   true if age > 18

$gte — Greater than or equal

{"$gte": [10, 10]}   true
{"$gte": [{"$ref": "/count"}, 100]}   true if count >= 100

$lt — Less than

{"$lt": [5, 10]}   true
{"$lt": ["${/price}", 50]}   true if price < 50

$lte — Less than or equal

{"$lte": [10, 10]}   true
{"$lte": [{"$ref": "/temperature"}, 30]}   true if temperature <= 30

$eq — Equal

{"$eq": [10, 10]}   true
{"$eq": ["${/status}", "active"]}   true if status == "active"

$ne — Not equal

{"$ne": [10, 5]}   true
{"$ne": ["${/role}", "admin"]}   true if role != "admin"

Usage in conditions:

spec = [
    {"/age": 25},
    {
        "op": "if",
        "cond": {"$gte": [{"$ref": "@:/age"}, 18]},
        "then": [{"/is_adult": True}],
        "else": [{"/is_adult": False}],
    },
]

result = engine.apply(spec, source={}, dest={})
# → {"age": 25, "is_adult": True}

Features:

  • All operators accept exactly 2 values in a list
  • Values are processed through process_value (support templates, $ref, $cast, etc.)
  • Can be nested and combined with logical operators

Membership and Existence Operators

$in — Python-style membership test

Works with strings (substring), lists (element), and dicts (key):

{"$in": ["world", "hello world"]}   true (substring)
{"$in": [2, [1, 2, 3]]}              true (element in list)
{"$in": ["key", {"key": "val"}]}     true (key in dict)

$exists — Check if a path resolves

Returns true if the pointer can be resolved without error, false otherwise. Supports all context prefixes (@:, &:, !:, _:, or plain /).

{"$exists": "/user/name"}     true if source has user.name
{"$exists": "@:/result"}      true if dest has /result
{"$exists": "&:/param"}       true if arg named 'param' was passed to the function

Example — conditional processing:

spec = {
    "op": "if",
    "cond": {"$exists": "/optional_field"},
    "then": [{"/result": "${/optional_field}"}],
    "else": [{"/result": "default"}],
}

Example — template path:

{"/ok": {"$exists": "/user/${/field_name}"}}

Mathematical Operators

J-Perm provides mathematical operators with support for 1+ operands:

$add — Addition

{"$add": [10]}            10
{"$add": [10, 5]}         15
{"$add": [1, 2, 3, 4]}    10  (1 + 2 + 3 + 4)

$sub — Subtraction

{"$sub": [10]}            10
{"$sub": [10, 5]}         5
{"$sub": [100, 20, 10]}   70  ((100 - 20) - 10)

$mul — Multiplication

{"$mul": [5]}             5
{"$mul": [10, 5]}         50
{"$mul": [2, 3, 4]}       24  ((2 * 3) * 4)

$div — Division

{"$div": [10]}            10
{"$div": [10, 5]}         2.0
{"$div": [100, 2, 5]}     10.0  ((100 / 2) / 5)

$pow — Exponentiation

{"$pow": [2]}             2
{"$pow": [2, 3]}          8
{"$pow": [2, 3, 2]}       64  ((2 ** 3) ** 2)

$mod — Modulo

{"$mod": [10]}            10
{"$mod": [10, 3]}         1
{"$mod": [100, 7, 3]}     2  ((100 % 7) % 3)

Nested expressions:

# Calculate: (price * quantity) + shipping
spec = {
    "/total": {
        "$add": [
            {"$mul": [{"$ref": "/price"}, {"$ref": "/quantity"}]},
            {"$ref": "/shipping"}
        ]
    }
}

# Complex: ((10 + 5) * 2) - 3 = 27
spec = {
    "/result": {
        "$sub": [
            {"$mul": [{"$add": [10, 5]}, 2]},
            3
        ]
    }
}

Features:

  • Accept 1+ operands (1 operand: returns the value itself)
  • 2+ operands: apply operation left-to-right
  • Values are processed through process_value (support templates, $ref, $cast, etc.)
  • Can be nested to create complex expressions
  • Work seamlessly with comparison operators in conditions

4. String Operations

J-Perm provides comprehensive string manipulation constructs:

Split and Join

# Split string by delimiter
{"$str_split": {"string": "a,b,c", "delimiter": ","}}   ["a", "b", "c"]
{"$str_split": {"string": "a:b:c", "delimiter": ":", "maxsplit": 1}}   ["a", "b:c"]

# Join array into string
{"$str_join": {"array": ["a", "b", "c"], "separator": "-"}}   "a-b-c"
{"$str_join": {"array": [1, 2, 3], "separator": ","}}   "1,2,3"

Slicing

# Extract substring
{"$str_slice": {"string": "hello", "start": 1, "end": 4}}   "ell"
{"$str_slice": {"string": "hello", "start": 2}}   "llo"
{"$str_slice": {"string": "hello", "end": 3}}   "hel"
{"$str_slice": {"string": "hello", "start": -3}}   "llo"

Note: String slicing is also supported in JSON Pointer syntax:

{"$ref": "/text[0:5]"}    # first 5 characters
{"$ref": "/text[6:]"}     # from 6th character to end
{"$ref": "/text[-5:]"}    # last 5 characters

Case Conversion

{"$str_upper": "hello"}   "HELLO"
{"$str_lower": "HELLO"}   "hello"

Trimming

# Strip whitespace (default)
{"$str_strip": "  hello  "}   "hello"
{"$str_lstrip": "  hello  "}   "hello  "
{"$str_rstrip": "  hello  "}   "  hello"

# Strip specific characters
{"$str_strip": {"string": "***hello***", "chars": "*"}}   "hello"
{"$str_lstrip": {"string": "___hello", "chars": "_"}}   "hello"
{"$str_rstrip": {"string": "hello___", "chars": "_"}}   "hello"

Replace

{"$str_replace": {"string": "hello", "old": "ll", "new": "rr"}}   "herro"
{"$str_replace": {"string": "aaa", "old": "a", "new": "b", "count": 2}}   "bba"

String Checks

{"$str_contains": {"string": "hello world", "substring": "world"}}   true
{"$str_startswith": {"string": "hello", "prefix": "he"}}   true
{"$str_endswith": {"string": "hello", "suffix": "lo"}}   true

5. Regular Expressions

J-Perm supports powerful regex operations using Python's re module:

Match and Search

# Check if entire string matches pattern
{"$regex_match": {"pattern": "^\\d+$", "string": "123"}}   true
{"$regex_match": {"pattern": "^\\d+$", "string": "abc"}}   false

# Find first occurrence
{"$regex_search": {"pattern": "\\d+", "string": "abc123def"}}   "123"
{"$regex_search": {"pattern": "\\d+", "string": "abc"}}   null

Find All Matches

{"$regex_findall": {"pattern": "\\d+", "string": "a1b2c3"}}   ["1", "2", "3"]
{"$regex_findall": {"pattern": "\\d+", "string": "abc"}}   []

Replace with Regex

# Simple replacement
{"$regex_replace": {"pattern": "\\d+", "replacement": "X", "string": "a1b2c3"}}   "aXbXcX"

# With backreferences
{"$regex_replace": {
    "pattern": "(\\w+)@(\\w+)",
    "replacement": "\\1 AT \\2",
    "string": "user@domain"
}}   "user AT domain"

# Limited replacements
{"$regex_replace": {"pattern": "\\d+", "replacement": "X", "string": "a1b2c3", "count": 2}}   "aXbXc3"

Extract Capture Groups

{"$regex_groups": {"pattern": "(\\w+)@(\\w+)", "string": "user@domain"}}   ["user", "domain"]
{"$regex_groups": {"pattern": "(\\d+)-(\\d+)", "string": "123-456"}}   ["123", "456"]

Optional flags parameter: All regex constructs accept optional flags parameter (e.g., re.IGNORECASE = 2):

{"$regex_match": {"pattern": "^hello$", "string": "HELLO", "flags": 2}}   true

6. Functions and Error Handling

J-Perm supports defining reusable functions and controlled error handling.

$def — Define a function

{
    "$def": "myFunction",
    "params": ["arg1", "arg2"],
    "body": [
        {"/result": "${&:/arg1}"},
        {"/total": "${int:${&:/arg2}}"}
    ],
    "return": "/total",
    "context": "copy",
    "on_failure": [
        {"/error": "Function failed"}
    ]
}
  • params — list of parameter names (optional, default: [])
  • body — actions to execute when function is called
  • return — path in local context to return (optional, default: entire dest); superseded by $return if used inside the body
  • context — how the function's dest is initialized (see below)
  • on_failure — error handler actions (optional)

Accessing parameters:

Inside the function body, parameters are available via the &: prefix:

spec = [
    {
        "$def": "greet",
        "params": ["name"],
        "body": [{"/msg": "Hello, ${&:/name}!"}],
        "return": "/msg",
    },
    {"/result": {"$func": "greet", "args": ["World"]}},
]
# → {"result": "Hello, World!"}

Accessing original source:

The original source document is always accessible via the plain / pointer (or _: alias):

spec = [
    {
        "$def": "getConfig",
        "body": [{"/cfg": {"$ref": "/config/key"}}],
        "return": "/cfg",
    },
    {"/result": {"$func": "getConfig"}},
]

result = engine.apply(spec, source={"config": {"key": "production"}}, dest={})
# → {"result": "production"}

context parameter — dest initialization mode:

Value Behavior
"copy" (default) Function body operates on a deep copy of the caller's dest. Mutations stay local.
"new" Function body starts with an empty dest = {}. Cannot see the caller's dest.
"shared" Function body operates on the same dest object as the caller. Mutations are visible to the caller.
# context: "copy" (default) — isolated
spec = [
    {"$def": "f", "body": [{"/internal": 99}]},
    {"/result": {"$func": "f"}},
]
# "internal" does NOT appear at the top level of the outer dest

# context: "new" — fresh slate
spec = [
    {"/outer": "hello"},
    {
        "$def": "f",
        "context": "new",
        "body": [{"/saw_outer": {"$exists": "@:/outer"}}],
        "return": "/saw_outer",
    },
    {"/result": {"$func": "f"}},
]
# → {"outer": "hello", "result": false}  (function can't see /outer)

# context: "shared" — direct mutation
spec = [
    {"$def": "f", "context": "shared", "body": [{"/shared_key": True}]},
    {"$func": "f"},
]
# → {"shared_key": true}  (mutation visible in outer dest)

$func — Call a function

{
    "$func": "myFunction",
    "args": [10, 20]
}
  • args — list of arguments to pass (optional, default: [])

Functions are stored in the execution context metadata and can be called multiple times within the same transformation.

$raise — Raise an error

{
    "$raise": "Invalid data: ${/error_details}"
}

Raises a JPermError with the specified message. The error can be:

  • Caught by on_failure handlers in function definitions
  • Used for validation and control flow
  • Combined with templates for dynamic error messages

Example with error handling:

spec = [
    {
        "$def": "validateAge",
        "params": ["age"],
        "body": [
            {
                "op": "if",
                "cond": {"$lt": [{"$ref": "&:/age"}, 0]},
                "then": [{"$raise": "Age cannot be negative"}]
            },
            {"/valid": True}
        ],
        "return": "/valid",
        "on_failure": [{"/validation_failed": True}]
    },
    {"/result": {"$func": "validateAge", "args": [25]}}
]

7. Loop Control Flow

$break, $continue, and $return are control flow commands for interrupting loops and functions. They are top-level actions (registered in the main pipeline) and work from anywhere inside a loop or function body — including inside nested if, try, or even other loops.

$break — Exit a loop

{"$break": null}

Stops the innermost foreach or while loop immediately. Any changes made to dest before $break in the current iteration are preserved.

spec = {
    "op": "foreach",
    "in": "/items",
    "as": "item",
    "do": [
        {
            "op": "if",
            "cond": {"$eq": [{"$ref": "&:/item"}, "stop"]},
            "then": [{"$break": None}],
        },
        {"/result[]": "&:/item"},
    ],
}

result = engine.apply(spec, source={"items": ["a", "b", "stop", "c"]}, dest={"result": []})
# → {"result": ["a", "b"]}

$continue — Skip to the next iteration

{"$continue": null}

Skips the remaining actions in the current iteration and moves to the next element (foreach) or re-evaluates the condition (while). Changes made before $continue are preserved.

spec = {
    "op": "foreach",
    "in": "/numbers",
    "as": "n",
    "do": [
        {
            "op": "if",
            "cond": {"$eq": [{"$mod": [{"$ref": "&:/n"}, 2]}, 0]},
            "then": [{"$continue": None}],   # skip even numbers
        },
        {"/odds[]": "&:/n"},
    ],
}

result = engine.apply(spec, source={"numbers": [1, 2, 3, 4, 5]}, dest={"odds": []})
# → {"odds": [1, 3, 5]}

$return — Return a value from a function

{"$return": <value>}

Exits the current function immediately, returning <value> as the function result. The value is evaluated through the value pipeline (supports $ref, templates, constructs). Use null to return None.

This supersedes the "return": "/path" parameter in $def when you need to return from multiple points in the body (early return, return from inside a loop, etc.).

spec = [
    {
        "$def": "sign",
        "params": ["x"],
        "body": [
            {
                "op": "if",
                "cond": {"$gt": [{"$ref": "&:/x"}, 0]},
                "then": [{"$return": "positive"}],
            },
            {
                "op": "if",
                "cond": {"$lt": [{"$ref": "&:/x"}, 0]},
                "then": [{"$return": "negative"}],
            },
            {"$return": "zero"},
        ],
    },
    {"/result": {"$func": "sign", "args": [-3]}},
]

result = engine.apply(spec, source={}, dest={})
# → {"result": "negative"}

Early return from inside a loop:

spec = [
    {
        "$def": "find_first",
        "params": ["items", "target"],
        "body": [
            {
                "op": "foreach",
                "in": "&:/items",
                "as": "item",
                "do": [
                    {
                        "op": "if",
                        "cond": {"$eq": [{"$ref": "&:/item"}, {"$ref": "&:/target"}]},
                        "then": [{"$return": {"$ref": "&:/item"}}],
                    },
                ],
            },
            {"$return": None},   # not found
        ],
    },
    {"/found": {"$func": "find_first", "args": [["a", "b", "c"], "b"]}},
]

result = engine.apply(spec, source={}, dest={})
# → {"found": "b"}

Interaction with try

Control flow signals propagate through try blocks — they are never caught by except. The finally block still runs before the signal continues propagating.

spec = [
    {
        "$def": "func",
        "body": [
            {
                "op": "try",
                "do": [{"$return": "early"}],
                "except": [{"/caught": True}],   # NOT reached
                "finally": [{"/cleanup": True}],  # always runs
            },
        ],
    },
    {"/answer": {"$func": "func"}},
]

result = engine.apply(spec, source={}, dest={})
# → {"answer": "early"}   ("caught" is never set, "cleanup" is set inside the function)

8. Shorthand Syntax

Shorthands are expanded by priority-ordered StageProcessors before execution.

~assert

{
    "~assert": {
        "/x": 10,
        "/y": 20
    }
}

Expands to:

[
    {
        "op": "assert",
        "path": "/x",
        "equals": 10
    },
    {
        "op": "assert",
        "path": "/y",
        "equals": 20
    }
]

~delete

{
    "~delete": [
        "/tmp",
        "/cache"
    ]
}

Expands to:

[
    {
        "op": "delete",
        "path": "/tmp"
    },
    {
        "op": "delete",
        "path": "/cache"
    }
]

Append notation (field[])

{
    "/items[]": 123
}

Expands to:

{
    "op": "set",
    "path": "/items/-",
    "value": 123
}

Pointer assignment

When the value starts with /, @:, &:, !:, or _:, it is treated as a copy-from pointer and expands to an op: copy step:

{"/name": "/user/fullName"}

Expands to:

{
    "op": "copy",
    "from": "/user/fullName",
    "path": "/name",
    "ignore_missing": true
}

This also works with context prefixes:

{"/copy_of": "@:/existing_dest_key"}
{"/arg_val": "&:/param_name"}

Literal assignment

{
    "/status": "active"
}

Expands to:

{
    "op": "set",
    "path": "/status",
    "value": "active"
}

Priority order: ~assert (100) → ~delete (50) → pointer/literal assignment (0)


Built-in Operations

All operations are registered as ActionHandler instances in the main registry.

set

Write value to destination path.

{
    "op": "set",
    "path": "/target",
    "value": "...",
    "create": true,
    // Auto-create parents (default: true)
    "extend": true
    // Extend lists on append (default: true)
}

Special: path ending with /- appends to list.


copy

Copy value from source to destination.

{
    "op": "copy",
    "from": "/source/path",
    "path": "/dest/path",
    "ignore_missing": false,
    // Skip if missing (default: false)
    "default": "..."
    // Fallback value
}

The from pointer supports all context prefixes (@:, &:, !:, _:).


delete

Remove value at path.

{
    "op": "delete",
    "path": "/remove",
    "ignore_missing": true
    // Don't error if missing (default: true)
}

foreach

Iterate over array/mapping.

{
    "op": "foreach",
    "in": "/items",
    "as": "item",
    // Variable name (default: "item")
    "do": [
        ...
    ],
    // Nested actions
    "skip_empty": true,
    // Skip if empty (default: true)
    "default": []
    // Fallback if missing
}

Note: If source is a dict, iterates over (key, value) tuples.

Loop control: Use $break to exit the loop early or $continue to skip the rest of the current iteration (see Loop Control Flow).

Accessing the loop variable:

The loop variable is stored in temp_read_only and is accessible inside do blocks via:

  • &:/item — pointer syntax
  • ${&:/item} — template syntax
  • {"$ref": "&:/item"} — $ref syntax
  • ${?args.item} — JMESPath syntax

The original source is accessible via the plain / pointer as usual.

spec = {
    "op": "foreach",
    "in": "/products",
    "as": "product",
    "do": {
        "/results[]": {
            "$eval": [
                {"/name": "${&:/product/name}"},
                {"/price": "${&:/product/price}"},
                {"/tax": {"$ref": "/config/tax"}},  # from original source
            ]
        }
    }
}

result = engine.apply(
    spec,
    source={
        "products": [{"name": "A", "price": 100}],
        "config": {"tax": 1.2}
    },
    dest={}
)
# → {"results": [{"name": "A", "price": 100, "tax": 1.2}]}

while

Loop while condition holds.

Path mode:

{
    "op": "while",
    "path": "/counter",
    "equals": 0,
    // Or "exists": true
    "do": [
        ...
    ],
    "do_while": false
    // Execute at least once (default: false)
}

Expression mode:

{
    "op": "while",
    "cond": "${?dest.counter < `10`}",
    "do": [
        ...
    ]
}

Note: Condition is checked against destination state. Use do_while: true to execute body at least once before checking condition.

Loop control: Use $break to exit the loop early or $continue to skip the rest of the current iteration and re-evaluate the condition (see Loop Control Flow).


if

Conditional execution.

Path mode:

{
    "op": "if",
    "path": "/check",
    "equals": "value",
    // Optional
    "exists": true,
    // Optional
    "then": [
        ...
    ],
    // Success branch
    "else": [
        ...
    ]
    // Failure branch
}

Expression mode:

{
    "op": "if",
    "cond": "${?source.age >= `18`}",
    "then": [
        ...
    ]
}

exec

Execute nested script.

From source:

{
    "op": "exec",
    "from": "/script",
    "merge": false
    // Replace dest (default) or merge into it
}

Inline:

{
    "op": "exec",
    "actions": [
        ...
    ]
}

update

Merge mapping into target.

{
    "op": "update",
    "path": "/obj",
    "value": {
        "b": 2
    },
    // Or "from": "/source/obj"
    "deep": false
    // Recursive merge (default: false)
}

distinct

Remove duplicates from list.

{
    "op": "distinct",
    "path": "/items",
    "key": "/id"
    // Optional: compare by nested field
}

assert

Assert value existence/equality.

Basic usage:

{
    "op": "assert",
    // Check source
    "path": "/required",
    "equals": "value"
    // Optional
}

With direct value:

{
    "op": "assert",
    "value": "${?source.computed}",
    // Check computed value instead of path
    "equals": "expected"
}

With return mode:

{
    "op": "assert",
    "path": "/optional",
    "return": true,
    // Return value instead of raising error
    "to_path": "/result"
    // Optional: write result to destination
}
  • return: true — returns value on success, false on failure (instead of raising error)
  • to_path — destination path for return value
  • value — alternative to path, checks direct value

try

Execute actions with error handling (try-except-finally pattern).

Basic try-except:

{
    "op": "try",
    "do": [
        {"op": "copy", "from": "/might_not_exist", "path": "/result"}
    ],
    "except": [
        {"/error": "Failed to copy value"}
    ]
}

Access error information:

Inside the except block, error info is available via the &: prefix:

{
    "op": "try",
    "do": [
        {"$raise": "Something went wrong"}
    ],
    "except": [
        {"/error_message": "${&:/_error_message}"},
        {"/error_type": "${&:/_error_type}"}
    ]
}

With finally cleanup:

{
    "op": "try",
    "do": [
        {"/status": "processing"},
        {"op": "exec", "from": "/dangerous_operation"}
    ],
    "except": [
        {"/status": "error"},
        {"/error_msg": "${&:/_error_message}"}
    ],
    "finally": [
        {"/processed_at": "2024-01-01"},
        {"/cleanup": true}
    ]
}

Behavior:

  • Executes actions in do block
  • If error occurs:
    • Error info stored in temp_read_only (_error_type, _error_message)
    • If except block provided, executes it with error info accessible via &:/ prefix
    • If no except, re-raises error after executing finally (if present)
  • finally block always executes (even on error)
  • Control flow signals ($break, $continue, $return) are not caught by except — they propagate through the try block; finally still runs before they propagate

Error info in except:

  • &:/_error_type — error class name (e.g., "JPermError")
  • &:/_error_message — error message string

Example: Validation with fallback

spec = {
    "op": "try",
    "do": [
        {"/age": {"$cast": {"value": "${/user_input}", "type": "int"}}},
        {
            "op": "if",
            "cond": {"$lt": [{"$ref": "@:/age"}, 0]},
            "then": [{"$raise": "Age cannot be negative"}]
        },
        {"/valid": True}
    ],
    "except": [
        {"/valid": False},
        {"/error": "${&:/_error_message}"}
    ]
}

result = engine.apply(spec, source={"user_input": "-5"}, dest={})
# → {"age": -5, "valid": False, "error": "Age cannot be negative"}

Extending J-Perm

Custom Operations

Create a new ActionHandler and register it:

from j_perm import ActionHandler, ActionNode, OpMatcher, ExecutionContext


class MyOpHandler(ActionHandler):
    def execute(self, step, ctx: ExecutionContext):
        # Your logic here
        return ctx.dest


# Register in main registry (in build_default_engine or custom factory)
registry.register(ActionNode(
    name="my_op",
    priority=10,
    matcher=OpMatcher("my_op"),
    handler=MyOpHandler(),
))

Custom Special Constructs

Add a SpecialFn to the specials dict:

def my_special(node, ctx):
    value = ctx.engine.process_value(node["$mySpecial"], ctx)
    return value.upper()


engine = build_default_engine(specials={
    "$ref": ref_handler,
    "$eval": eval_handler,
    "$mySpecial": my_special,
})

Custom Stages

Create a StageProcessor for batch preprocessing:

from j_perm import StageProcessor, StageNode, StageRegistry


class ValidateStage(StageProcessor):
    def apply(self, steps, ctx):
        # Validate/transform steps
        return steps


# Register in main pipeline stages
stages = build_default_shorthand_stages()
stages.register(StageNode(
    name="validate",
    priority=200,  # Higher = runs earlier
    processor=ValidateStage(),
))

# Use in custom engine
main_pipeline = Pipeline(stages=stages, registry=main_registry)

Custom Casters

Casters are used in both template syntax (${type:...}) and the $cast construct.

Provide custom casters via build_default_engine:

import json
from j_perm import build_default_engine

custom_casters = {
    "int": lambda x: int(x),
    "float": lambda x: float(x),
    "json": lambda x: json.loads(x),
    "upper": lambda x: str(x).upper(),
}

engine = build_default_engine(casters=custom_casters)

# Now you can use them in both ways:
spec = [
    {"/age": "${int:/raw_age}"},              # Template syntax
    {"/data": {"$cast": {"value": "{}", "type": "json"}}},  # $cast construct
    {"/name": {"$cast": {"value": "alice", "type": "upper"}}},
]

Or use the default built-in casters: int, float, bool, str.


Using Construct Groups

J-Perm provides pre-organized groups of construct handlers for convenient registration:

from j_perm import build_default_engine
from j_perm import (
    CORE_HANDLERS,                # $ref, $eval
    LOGICAL_HANDLERS,             # $and, $or, $not
    COMPARISON_HANDLERS,          # $gt, $gte, $lt, $lte, $eq, $ne, $in, $exists
    MATH_HANDLERS,                # $add, $sub, $mul, $div, $pow, $mod (with default limits)
    STRING_HANDLERS,              # All string operations (11 constructs, with default limits)
    REGEX_HANDLERS,               # All regex operations (5 constructs, with default limits)
    get_all_handlers,             # Function to get all handlers with casters
    get_all_handlers_with_limits, # Function to get all handlers with custom limits
)

# Build engine with specific groups only
engine = build_default_engine(specials={
    **CORE_HANDLERS,
    **STRING_HANDLERS,
    **REGEX_HANDLERS,
})

# Or extend default engine with additional handlers
from j_perm.casters import BUILTIN_CASTERS

all_handlers = get_all_handlers(casters=BUILTIN_CASTERS)
engine = build_default_engine(specials={
    **all_handlers,
    "$custom": my_custom_handler,
})

# Import individual handlers
from j_perm import (
    str_split_handler,
    str_join_handler,
    regex_match_handler,
    add_handler,
    exists_handler,
    # ... etc
)

Available groups:

  • CORE_HANDLERS — Core constructs ($ref, $eval)
  • LOGICAL_HANDLERS — Logical operators ($and, $or, $not)
  • COMPARISON_HANDLERS — Comparison operators ($gt, $gte, $lt, $lte, $eq, $ne, $in, $exists)
  • MATH_HANDLERS — Mathematical operators with default limits ($add, $sub, $mul, $div, $pow, $mod)
  • STRING_HANDLERS — String operations with default limits (11 constructs)
  • REGEX_HANDLERS — Regular expression operations with default limits (5 constructs)
  • ALL_HANDLERS_NO_CAST — All handlers except $cast
  • get_all_handlers(casters) — Function returning all handlers including $cast (with default limits)
  • get_all_handlers_with_limits(casters, **limits) — Function returning all handlers with custom limits

Example with custom limits:

from j_perm import get_all_handlers_with_limits
from j_perm.casters import BUILTIN_CASTERS

# Build handlers with conservative limits
secure_handlers = get_all_handlers_with_limits(
    casters=BUILTIN_CASTERS,
    regex_timeout=1.0,
    pow_max_exponent=100,
    str_max_join_result=100_000,
    mul_max_string_result=100_000,
    add_max_string_result=1_000_000,
    sub_max_number_result=1e10,
)

engine = build_default_engine(specials=secure_handlers)

Custom Matchers

Implement ActionMatcher or StageMatcher:

from j_perm import ActionMatcher


class PrefixMatcher(ActionMatcher):
    def __init__(self, prefix):
        self.prefix = prefix

    def matches(self, step):
        return isinstance(step, dict) and
            step.get("op", "").startswith(self.prefix)

Async Support

J-Perm provides full support for asynchronous operations through parallel async infrastructure.

Overview

All core components have async counterparts that work seamlessly with Python's async/await:

  • Sync pipeline (engine.apply()) - for synchronous handlers
  • Async pipeline (engine.apply_async()) - for async handlers and I/O operations
  • Mixed mode - sync and async handlers can coexist in the same pipeline

Async Base Classes

from j_perm import AsyncActionHandler, AsyncStageProcessor, AsyncMiddleware

class AsyncHttpHandler(AsyncActionHandler):
    """Async handler for HTTP requests."""

    async def execute(self, step, ctx):
        url = await ctx.engine.process_value_async(step["url"], ctx)

        async with aiohttp.ClientSession() as session:
            async with session.get(url) as response:
                data = await response.json()
                ctx.dest["response"] = data

        return ctx.dest

Using Async Engine

import asyncio
from j_perm import build_default_engine, ActionNode, OpMatcher

# Build engine as usual
engine = build_default_engine()

# Register async handler
engine.main_pipeline.registry.register(ActionNode(
    name="http",
    priority=10,
    matcher=OpMatcher("http"),
    handler=AsyncHttpHandler(),
))

# Use async apply
async def main():
    spec = [
        {"op": "http", "url": "https://api.example.com/data"},
        {"/result": "${@:/response/value}"}
    ]

    result = await engine.apply_async(spec, source={}, dest={})
    print(result)

asyncio.run(main())

Async Methods

Method Description
engine.apply_async() Async version of apply()
engine.apply_to_context_async() Async version of apply_to_context()
engine.process_value_async() Async value stabilization
engine.run_pipeline_async() Run named pipeline asynchronously
pipeline.run_async() Async pipeline execution
registry.run_all_async() Async stage execution (for StageRegistry)

Mixing Sync and Async

The async pipeline automatically handles both sync and async components:

# Sync handler
class SyncSetHandler(ActionHandler):
    def execute(self, step, ctx):
        ctx.dest["sync"] = True
        return ctx.dest

# Async handler
class AsyncFetchHandler(AsyncActionHandler):
    async def execute(self, step, ctx):
        data = await fetch_data()  # async I/O
        ctx.dest["async"] = data
        return ctx.dest

# Both work in apply_async()
result = await engine.apply_async([
    {"op": "set", ...},      # sync handler
    {"op": "fetch", ...},    # async handler
], source={}, dest={})

When to Use Async

Use async handlers for:

  • Network I/O - HTTP requests, API calls, webhooks
  • Database operations - async DB queries
  • File I/O - async file reads/writes
  • External services - Cloud APIs, microservices
  • Concurrent operations - when you need to parallelize work

Sync handlers are fine for:

  • Pure transformations - data mapping, filtering
  • Simple operations - set, copy, delete
  • CPU-bound work - computations without I/O

Example: Async HTTP Handler

import aiohttp
from j_perm import AsyncActionHandler, ActionNode, OpMatcher

class HttpGetHandler(AsyncActionHandler):
    """Fetch data from HTTP endpoint."""

    async def execute(self, step, ctx):
        # Process URL with template support
        url = await ctx.engine.process_value_async(step["url"], ctx)
        headers = await ctx.engine.process_value_async(
            step.get("headers", {}), ctx
        )

        # Make async HTTP request
        async with aiohttp.ClientSession() as session:
            async with session.get(url, headers=headers) as response:
                response.raise_for_status()
                data = await response.json()

        # Write result to destination
        path = await ctx.engine.process_value_async(step["path"], ctx)
        ctx.engine.resolver.set(path, ctx.dest, data)

        return ctx.dest

# Register and use
engine.main_pipeline.registry.register(ActionNode(
    name="http_get",
    priority=10,
    matcher=OpMatcher("http_get"),
    handler=HttpGetHandler(),
))

# Usage
spec = {
    "op": "http_get",
    "url": "https://api.github.com/users/${/username}",
    "headers": {"Accept": "application/json"},
    "path": "/user_data"
}

result = await engine.apply_async(spec, source={"username": "octocat"}, dest={})

Async Stages and Middlewares

You can also create async stages and middlewares:

from j_perm import AsyncStageProcessor, AsyncMiddleware

class AsyncValidationStage(AsyncStageProcessor):
    """Async validation of steps."""

    async def apply(self, steps, ctx):
        # Async validation logic
        await validate_steps(steps)
        return steps

class AsyncLoggingMiddleware(AsyncMiddleware):
    """Log each step asynchronously."""

    name = "async_logger"
    priority = 10

    async def process(self, step, ctx):
        await log_step(step)  # async logging
        return step

Note: Stages and middlewares remain sync by default. Only use async versions when you have actual async I/O in preprocessing/middleware logic.


Advanced Topics

Value Stabilization Loop

When handlers call ctx.engine.process_value(value, ctx), the value pipeline runs repeatedly until:

  1. Output equals input (stable)
  2. value_max_depth iterations reached (default: 50)

This resolves nested templates and special constructs:

# Input: {"$ref": "/path_to_template"}
# Pass 1: {"$ref": ...} → "${/nested}"
# Pass 2: "${/nested}" → "final"
# Pass 3: "final" → "final" (stable ✓)

Hierarchical Registries

Both StageRegistry and ActionTypeRegistry support tree structures:

# Group related operations
math_registry = ActionTypeRegistry()
math_registry.register(ActionNode("add", 10, AddMatcher(), AddHandler()))
math_registry.register(ActionNode("sub", 10, SubMatcher(), SubHandler()))

# Mount as sub-tree
main_registry.register_group(
    "math",
    math_registry,
    matcher=OpMatcher("math"),
    priority=50,
)

Priority and Execution Order

Stages: All matching stages run in priority order (high → low).

Actions: First matching handler executes (unless exclusive=False).

Shorthands:

  1. AssertShorthandProcessor (100) — extracts ~assert
  2. DeleteShorthandProcessor (50) — extracts ~delete
  3. AssignShorthandProcessor (0) — fallback for all remaining keys

Unescape Rules

After value stabilization, registered UnescapeRule callables strip escape sequences:

from j_perm import UnescapeRule

# Built-in: template_unescape (strips $${ → ${, $$ → $)
# Registered at priority 0

# Add custom unescape
engine.unescape_rules.append(
    UnescapeRule(name="custom", priority=10, unescape=my_unescape_fn)
)

API Reference

Core Classes

from j_perm import (
    # Core infrastructure
    ExecutionContext,
    ValueResolver,
    ValueProcessor,
    Engine,
    Pipeline,

    # Stage system
    StageProcessor,
    AsyncStageProcessor,  # Async version
    StageMatcher,
    StageNode,
    StageRegistry,

    # Action system
    ActionHandler,
    AsyncActionHandler,  # Async version
    ActionMatcher,
    ActionNode,
    ActionTypeRegistry,

    # Middleware
    Middleware,
    AsyncMiddleware,  # Async version

    # Unescape
    UnescapeRule,
)

Handlers

from j_perm import (
    # Value handlers
    TemplMatcher,
    TemplSubstHandler,
    SpecialMatcher,
    SpecialResolveHandler,
    ContainerMatcher,
    RecursiveDescentHandler,
    IdentityHandler,

    # Special construct functions
    ref_handler,
    eval_handler,
    make_cast_handler,  # Factory for $cast handler
    and_handler,
    or_handler,
    not_handler,

    # Comparison operators
    gt_handler,
    gte_handler,
    lt_handler,
    lte_handler,
    eq_handler,
    ne_handler,
    in_handler,
    exists_handler,

    # Mathematical operators
    add_handler,
    make_add_handler,     # Factory with configurable limits
    sub_handler,
    make_sub_handler,     # Factory with configurable limits
    mul_handler,
    make_mul_handler,     # Factory with configurable limits
    div_handler,
    pow_handler,
    make_pow_handler,     # Factory with configurable limits
    mod_handler,

    # String operations
    str_split_handler,
    make_str_split_handler,    # Factory with configurable limits
    str_join_handler,
    make_str_join_handler,     # Factory with configurable limits
    str_slice_handler,
    str_upper_handler,
    str_lower_handler,
    str_strip_handler,
    str_lstrip_handler,
    str_rstrip_handler,
    str_replace_handler,
    make_str_replace_handler,  # Factory with configurable limits
    str_contains_handler,
    str_startswith_handler,
    str_endswith_handler,

    # Regex operations
    regex_match_handler,
    make_regex_match_handler,   # Factory with configurable limits
    regex_search_handler,
    make_regex_search_handler,  # Factory with configurable limits
    regex_findall_handler,
    make_regex_findall_handler, # Factory with configurable limits
    regex_replace_handler,
    make_regex_replace_handler, # Factory with configurable limits
    regex_groups_handler,
    make_regex_groups_handler,  # Factory with configurable limits

    # Function handlers
    DefMatcher,
    CallMatcher,
    DefHandler,
    CallHandler,
    RaiseMatcher,
    RaiseHandler,
    JPermError,
    ReturnMatcher,
    ReturnHandler,

    # Loop control flow handlers
    BreakMatcher,
    BreakHandler,
    ContinueMatcher,
    ContinueHandler,

    # Control flow signals (exceptions)
    BreakSignal,    # raised by $break
    ContinueSignal, # raised by $continue
    ReturnSignal,   # raised by $return (.value holds the return value)

    # Operation handlers
    SetHandler,
    CopyHandler,
    DeleteHandler,
    ForeachHandler,
    WhileHandler,
    IfHandler,
    ExecHandler,
    UpdateHandler,
    DistinctHandler,
    AssertHandler,
    TryHandler,
)

Utilities

from j_perm import (
    # Matchers
    OpMatcher,
    AlwaysMatcher,

    # Resolver
    PointerResolver,

    # Processor
    PointerProcessor,

    # Casters
    BUILTIN_CASTERS,  # Built-in type casters (int, float, bool, str)

    # Shorthand stages
    AssertShorthandProcessor,
    DeleteShorthandProcessor,
    AssignShorthandProcessor,

    # Factory
    build_default_engine,
    build_default_shorthand_stages,
)

Examples

Example 1: Data Filtering with foreach

spec = {
    "op": "foreach",
    "in": "/products",
    "as": "item",
    "do": {
        "op": "if",
        "cond": "${?args.item.price < `100`}",
        "then": {"/affordable[]": "&:/item"}
    }
}

Example 2: Conditional Copy with Default

spec = {
    "/result": {
        "$ref": "/maybe_missing",
        "$default": "not found"
    }
}

Example 3: Nested Evaluation

spec = {
    "/computed": {
        "$eval": [
            {"op": "set", "path": "/x", "value": "${int:/a}"},
            {"op": "set", "path": "/y", "value": "${int:/b}"}
        ],
        "$select": "${?add(dest.x, dest.y)}"
    }
}

Example 4: Mixed Shorthands

spec = [
    {"~assert": {"/user/id": 123}},
    {"~delete": "/temp"},
    {"/output": "/user/name"}
]

Example 5: Functions with Parameters and Error Handling

spec = [
    {
        "$def": "validateAge",
        "params": ["age"],
        "body": [
            {
                "op": "if",
                "cond": {"$lt": [{"$ref": "&:/age"}, 0]},
                "then": [{"$raise": "Age cannot be negative: ${&:/age}"}]
            },
            {"/valid": True}
        ],
        "return": "/valid",
        "on_failure": [
            {"/validation_failed": True},
            {"/last_error": "Validation error occurred"}
        ]
    },
    {"/user_age_valid": {"$func": "validateAge", "args": [25]}},
]

Example 6: try-except with Error Info

spec = {
    "op": "try",
    "do": [
        {"/age": {"$cast": {"value": "${/user_input}", "type": "int"}}},
        {
            "op": "if",
            "cond": {"$lt": [{"$ref": "@:/age"}, 0]},
            "then": [{"$raise": "Age cannot be negative"}]
        },
        {"/valid": True}
    ],
    "except": [
        {"/valid": False},
        {"/error": "${&:/_error_message}"}
    ]
}

result = engine.apply(spec, source={"user_input": "-5"}, dest={})
# → {"age": -5, "valid": False, "error": "Age cannot be negative"}

Example 7: Loop Control with $break and $continue

# Collect items until sentinel, skipping nulls
spec = {
    "op": "foreach",
    "in": "/stream",
    "as": "item",
    "do": [
        {
            "op": "if",
            "cond": {"$eq": [{"$ref": "&:/item"}, None]},
            "then": [{"$continue": None}],  # skip null
        },
        {
            "op": "if",
            "cond": {"$eq": [{"$ref": "&:/item"}, "END"]},
            "then": [{"$break": None}],     # stop at sentinel
        },
        {"/result[]": "&:/item"},
    ],
}

result = engine.apply(
    spec,
    source={"stream": ["a", None, "b", "END", "c"]},
    dest={"result": []},
)
# → {"result": ["a", "b"]}

Example 8: Early Return from a Function

spec = [
    {
        "$def": "first_positive",
        "params": ["nums"],
        "body": [
            {
                "op": "foreach",
                "in": "&:/nums",
                "as": "n",
                "do": [
                    {
                        "op": "if",
                        "cond": {"$gt": [{"$ref": "&:/n"}, 0]},
                        "then": [{"$return": {"$ref": "&:/n"}}],
                    },
                ],
            },
            {"$return": None},
        ],
    },
    {"/result": {"$func": "first_positive", "args": [[-3, -1, 0, 5, 8]]}},
]

result = engine.apply(spec, source={}, dest={})
# → {"result": 5}

Example 9: $exists for Optional Fields

spec = [
    {
        "op": "if",
        "cond": {"$exists": "/user/middle_name"},
        "then": [{"/display": "${/user/first_name} ${/user/middle_name} ${/user/last_name}"}],
        "else": [{"/display": "${/user/first_name} ${/user/last_name}"}],
    }
]

License

MIT (or adapt to your project as needed)


Contributing

Issues and pull requests welcome!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

j_perm-1.5.0.tar.gz (96.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

j_perm-1.5.0-py3-none-any.whl (70.1 kB view details)

Uploaded Python 3

File details

Details for the file j_perm-1.5.0.tar.gz.

File metadata

  • Download URL: j_perm-1.5.0.tar.gz
  • Upload date:
  • Size: 96.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.0

File hashes

Hashes for j_perm-1.5.0.tar.gz
Algorithm Hash digest
SHA256 9dc120d594efc34e488123f3d777630d7027ab68eb3921ac655b7e3af71f33cb
MD5 3abf1b06d1314d2ed05186fe6cb46daa
BLAKE2b-256 90669a12cdbdd77cd76a1fe187e3f6ac951d669850b6b25f840a9a6343332c09

See more details on using hashes here.

File details

Details for the file j_perm-1.5.0-py3-none-any.whl.

File metadata

  • Download URL: j_perm-1.5.0-py3-none-any.whl
  • Upload date:
  • Size: 70.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.0

File hashes

Hashes for j_perm-1.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 8d4099f2a1bc15b04b3ae58c3b53b5a6945dcc3606a72c3af9415a633f02ed00
MD5 7ba77d8ea525bbf3c4bf6c03efc9c239
BLAKE2b-256 d0f5a8217d0682420fee1c31ba39462c5bf4ba13281463a6c0c515f92010c266

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page