Skip to main content

A Python package named modict

Project description

modict - The Swiss Army Knife of Python Data Structures

Python 3.10+ License: MIT

modict is a sophisticated, hybrid data structure that combines the simplicity of Python dictionaries with the power of dataclasses and the robustness and runtime typechecking capabilities of Pydantic models. It's designed to be the versatile tool you'll want to use in every project for handling structured data.

๐ŸŽฏ Philosophy & Goals

modict bridges the gap between different Python data paradigms:

  • ๐Ÿ“š Dict-like: Native dictionary inheritance with full compatibility - modicts ARE dicts!
  • ๐Ÿ—๏ธ Dataclass-like: Type annotations and structured field definitions
  • ๐Ÿ›ก๏ธ Pydantic-like: Runtime validation, type coercion, custom validators, and computed properties
  • ๐Ÿ”ง Developer-friendly: Intuitive API that "just works" for common patterns
  • 100% standard library - No external dependencies, all is coded from scratch including the typechecker and coercion engine

Why modict?

# Traditional approaches require choosing between flexibility and structure
data = {"name": "Alice", "age": 30}           # Dict: flexible but unstructured

@dataclass
class User: name: str; age: int               # Dataclass: structured but rigid

class User(BaseModel): name: str; age: int    # Pydantic: powerful but heavy

# modict: Best of all worlds
class User(modict):
    name: str
    age: int = 25

user = User(name="Alice")                   # โœ… Structured
user.age                                    # 25 โœ… Default value
user.email = "alice@email.com"              # โœ… Flexible  
user['phone'] = "123-456-7890"              # โœ… Dict-compatible
isinstance(user,dict)                       # True (still a dict!)

๐Ÿš€ Key Features

Core Capabilities

  • Full dict inheritance - All native dict methods work seamlessly.
  • Attribute-style access - obj.key and obj['key'] both work
  • Type annotations - Optional runtime validation with a powerful type validation and coercion system
  • Recursive conversion
    • Explicit: modict.convert() / .to_modict() for full deep conversion
    • Automatic: auto_convert=True (default) converts nested dicts to modict on first access
  • JSON-first design - Built-in JSON serialization/deserialization
  • JSONPath support (RFC 9535) - Unambiguous nested access with JSONPath strings, tuple paths, or Path objects
    • Disambiguates array indices [0] from string keys ['0']
    • Path objects preserve container type metadata for round-trip conversion

Advanced Features

  • Computed properties - Dynamic values with dependency tracking
  • Custom validators - Field-level validation and transformation
  • Type coercion - Intelligent type conversion system
  • Deep operations - Merge, diff, walk through nested structures
  • Field extraction - Select/exclude keys with simple methods

๐Ÿ“ฆ Installation

pip install modict

๐Ÿƒโ€โ™‚๏ธ Quick Start

Basic Usage

from modict import modict

# Create from dict or keyword arguments
user = modict({"name": "Alice", "age": 30})
user = modict(name="Alice", age=30)

# Attribute and dict-style access
print(user.name)        # "Alice"
print(user['age'])      # 30

# Add new fields dynamically
user.email = "alice@email.com"
user['phone'] = "123-456-7890"

Structured Classes

from modict import modict
from typing import List, Optional

class User(modict):
    name: str
    age: int = 25
    email: Optional[str] = None
    tags: List[str] = modict.factory(list)  # Factory for mutable defaults

# Type-safe creation
user = User(name="Bob", age=35)
print(user.age)         # 35
print(user.tags)        # []

Nested Structures & JSONPath Access

modict supports JSONPath (RFC 9535) for unambiguous nested access:

# Automatic recursive conversion
data = modict({
    "users": [
        {"name": "Alice", "profile": {"city": "Paris"}},
        {"name": "Bob", "profile": {"city": "Lyon"}}
    ],
    "settings": {"theme": "dark"}
})

# JSONPath-based access (RFC 9535)
print(data.get_nested("$.users[0].name"))              # "Alice"
data.set_nested("$.users[0].profile.country", "France")
print(data.has_nested("$.settings.theme"))             # True

# JSONPath disambiguates array indices from string keys
data.set_nested("$.config['0'].value", "string key")   # String key '0'
data.set_nested("$.items[0].value", "array index")     # Array index 0

# Alternative: tuple paths (auto-converted to Path objects internally)
print(data.get_nested(("users", 0, "name")))           # "Alice"

# Chained attribute access also works
# (Only if auto_convert=True (default) - see below about config)
print(data.users[0].profile.city)                      # "Paris"

Path Objects: Methods like walk() return Path objects that preserve container type information (mapping vs sequence), enabling proper round-trip conversion and disambiguation between integer keys and array indices.

๐Ÿ’ซ Advanced Features

Computed Properties

class Calculator(modict):
    a: float = 0
    b: float = 0
    
    @modict.computed(cache=True, deps=['a', 'b'])
    def sum_ab(self):
        print("Computing sum...")
        return self.a + self.b
    
    @modict.computed(cache=True, deps=['sum_ab'])  # Cascading dependencies
    def doubled_sum(self):
        return self.sum_ab * 2

calc = Calculator(a=10, b=20)
print(calc.sum_ab)         # "Computing sum..." โ†’ 30
print(calc.sum_ab)         # 30 (cached)
calc.a = 15                # Invalidates cache automatically
print(calc.sum_ab)         # "Computing sum..." โ†’ 35
print(calc.doubled_sum)    # 70

Custom Validators

class Profile(modict):
    email: str
    age: int
    
    @modict.check('email')
    def validate_email(self, value):
        """Clean and validate email addresses"""
        email = value.lower().strip()
        if '@' not in email:
            raise ValueError("Invalid email format")
        return email
    
    @modict.check('age')  
    def validate_age(self, value):
        """Ensure age is reasonable"""
        age = int(value)
        if age < 0 or age > 150:
            raise ValueError("Invalid age range")
        return age

profile = Profile(email="  ALICE@EMAIL.COM  ", age="30")
print(profile.email)  # "alice@email.com" (cleaned)
print(profile.age)    # 30 (converted to int)

Deep Operations

# Deep merging
network_config = modict({"db": {"host": "localhost", "port": 5432}})
overrides = {"db": {"port": 3306, "ssl": True}}
network_config.merge(overrides)
# Result: {"db": {"host": "localhost", "port": 3306, "ssl": True}}

# Walking through nested structures (returns Path objects)
data = modict({"users": [{"name": "Alice"}, {"name": "Bob"}]})
for path, value in data.walk():
    print(f"{path}: {value}")
# Output (Path objects with JSONPath representation):
# $.users[0].name: Alice
# $.users[1].name: Bob

# Flattened view (Dict[Path, Any])
flat = data.walked()
# {Path($.users[0].name): "Alice", Path($.users[1].name): "Bob"}
# Path objects can be converted to strings: str(path) or path.to_jsonpath()

JSONPath & Path Objects

modict uses JSONPath (RFC 9535) to provide unambiguous access to nested structures:

from modict import modict

data = modict({
    "items": [{"id": 1}, {"id": 2}],
    "config": {"0": "string key", 1: "int key"}
})

# Array index (integer in bracket notation)
data.get_nested("$.items[0].id")        # 1 - accesses items[0]

# String key (quoted in bracket notation)
data.get_nested("$.config['0']")        # "string key" - accesses config['0']

# Note: Integer keys in dicts cannot be represented in JSONPath
# Use direct access: data.config[1] or data['config'][1]

Path Objects: All path-returning methods (walk(), walked(), diff()) now return Path objects instead of strings. Path objects:

  • Preserve container type information (mapping vs sequence)
  • Enable proper round-trip conversion
  • Support multiple representations:
from modict._collections_utils import Path

# Create Path objects
path1 = Path.from_jsonpath("$.users[0].name")
path2 = Path.from_tuple(('users', 0, 'name'))
path3 = Path.normalize("$.users[0].name")  # Accepts string, tuple, or Path

# Convert between representations
print(path1.to_jsonpath())     # "$.users[0].name"
print(path1.to_tuple())        # ('users', 0, 'name')
print(str(path1))              # "$.users[0].name"

# Path components preserve metadata
for component in path1.components:
    print(f"{component.value}: {component.container_type}")
# Output:
# users: mapping
# 0: sequence
# name: mapping

Breaking Change: In modict 0.2.0+, walk() and walked() return Path objects instead of strings. To convert back to strings:

# Old behavior (modict < 0.2.0)
for path_str, value in data.walk():
    print(f"{path_str}: {value}")  # path_str was a string

# New behavior (modict >= 0.2.0)
for path, value in data.walk():
    print(f"{path}: {value}")           # path is a Path object (str() is automatic)
    print(f"{path.to_jsonpath()}: {value}")  # Explicit JSONPath string

๐Ÿ› ๏ธ Configuration Options

The cassmethod modict.config allows you to customize the behavior of your modict subclass. It returns an modictConfig object (dataclass) that you may pass as the _config class variable or your modict.

class MyModict(modict):
    _config = modict.config(
        auto_convert=True,          # Auto-convert dicts to modicts in nested sub-containers (upon access)
        strict=False,               # Strict runtime type checking
        coerce=False,               # Enable automatic type coercion
        allow_extra=True,           # Disallow extra attributes
        enforce_json=False,         # Enforce JSON serializability of values
    )

auto_convert controls whether dicts found in nested mutable containers (MutableMappings, MutableSequence) are automatically converted to modict (if they aren't already) on first access. Note that MutableMappings that are NOT dicts won't be converted, but their content may if they are dicts.

Subclass configs are properly merged with parent class configs, also supporting multiple inheritance patterns (following MRO order).

class Parent(modict):
    _config = modict.config(strict=True, coerce=False)

class Child(Parent):
    _config = modict.config(coerce=True)  # strict=True, coerce=True (overrides Parent)

class A(modict):
    _config = modict.config(strict=True)
    a: int=1
    value: str="A"

class B(modict):
    _config = modict.config(strict=False, coerce=True)
    b: int=2
    value: str="B"

class C(A,B):
    _config = modict.config(allow_extra=False) 
    # strict=True from A (A overrides B, since A follows B in MRO), 
    # coerce=True from B
    # allow_extra=False from C

c = C()
print(c.a) # 1
print(c.b) # 2
print(c.value) # "A" (A overrides B)
c.a = "3"
print(c.a) # 3 (coercion enabled)

try:
    c.a = "invalid" 
except Exception as e:
    print(e) # โŒ TypeError (strict mode enabled)

try:
    c.undefined = "value" 
except Exception as e:
    print(e) # โŒ KeyError (extra fields not allowed)

Example

class StrictConfig(modict):

    _config=modict.config(
        strict = True          # Enable runtime type checking
        allow_extra = False    # Disallow undefined fields
        coerce = True          # Enable type coercion
    )

    name: str
    count: int

config = StrictConfig(name="test", count=42)
# config.undefined = "value"    # โŒ KeyError (extra fields not allowed)
# config.count = "32"           # coerced to int (coercion enabled)
# config.count = "invalid"      # โŒ TypeError (can't be coerced, type checking raises an error)

๐Ÿ“„ JSON Integration

# JSON-enforced mode
class JSONConfig(modict):

    _config=modict.config(
        enforce_json=True
    )

# Built-in JSON support
config = JSONConfig.load("config.json")        # Load from file
config = JSONConfig.loads(json_string)         # Load from string

config.dump("output.json", indent=2)          # Save to file
json_str = config.dumps(indent=2)             # Convert to string

config.data = {1, 2, 3}   # โŒ ValueError (sets are not JSON-serializable)

๐ŸŽจ Field Utilities

user = modict(name="Alice", age=30, email="alice@email.com", phone="123-456")

# Extract specific fields
basic_info = user.extract('name', 'age')         # {"name": "Alice", "age": 30}

# Exclude sensitive fields  
public_info = user.exclude('email', 'phone')     # {"name": "Alice", "age": 30}

# Rename fields
user.rename(email='email_address')               # Changes key name

# Deep copy
backup = user.deepcopy()

๐Ÿ”„ Conversion & Compatibility

# let's turn auto-conversion off globally (affects all modict instances created after this change)
modict._config.auto_convert = False

data = {"user": {"name": "Alice"}, "count": 42}

safe_modict = modict(data)            # No auto-conversion
safe_modict.user.name                # โŒ AttributeError (user is still a dict)
safe_modict.user["name"]             # "Alice" (works with dict access)
isinstance(safe_modict.user, modict)  # False (it's a plain dict)
data["user"] is safe_modict.user     # True (same object)

modict_data = safe_modict.to_modict()  # Deep conversion (in-place on the structure)
isinstance(modict_data.user, modict)  # True (now it's a modict)
data["user"] is modict_data.user     # False: user has been converted to a new modict
modict_data.user.name                # โœ… "Alice" (user is now a modict)
dict_data = modict_data.to_dict()    # Back to plain dicts

# Factory method for clean conversion
converted = modict.convert(complex_nested_dict)
unconverted = modict.unconvert(converted)  # Back to plain dicts

โš ๏ธ Important Behaviors & Limitations

Descriptor Handling

modict distinguishes between definitions and assignments in class namespaces:

class MyModict(modict):
    # โœ… DEFINITIONS (stay as class methods)
    @classmethod
    def my_classmethod(cls):
        return "method behavior"
    
    @property  
    def my_property(self):
        return "property behavior"
    
    # โœ… ASSIGNMENTS (become dict fields)
    external_func = some_external_function        # Stored in dict
    external_cm = classmethod(external_function)  # Stored in dict (may be non-callable)

obj = MyModict()
obj.my_classmethod()     # โœ… Works (bound method)
obj.external_func("x")   # โœ… Works (raw function, no binding)
obj.external_cm("x")     # โŒ May fail ('classmethod' object not callable)

Principle: Syntax determines behavior

  • def/@decorator syntax โ†’ Class behavior (Python semantics)
  • = assignment syntax โ†’ Data storage (user responsibility)

Import Limitations

Imports inside class namespaces are treated as field assignments:

# โŒ PROBLEMATIC
class MyModict(modict):
    import json        # Becomes a 'json' field in the modict

    def method(self):
        return json.dumps(self)  # โŒ NameError: 'json' not defined

# โœ… RECOMMENDED  
import json
class MyModict(modict):
    # json accessible via module scope
    pass

This limitation rarely affects normal usage of modict as a data structure.

Memory Considerations

  • Validation overhead: Type checking and coercion add runtime cost
  • Computed properties: Cached values consume additional memory
  • Recursive conversion: Deep nesting may impact performance

๐Ÿ†š Comparison with Alternatives

Feature modict dict dataclass Pydantic
Dict compatibility โœ… Full โœ… Native โŒ No โŒ Limited
Attribute access โœ… Yes โŒ No โœ… Yes โœ… Yes
Type validation โœ… Optional โŒ No โŒ No โœ… Yes
Runtime flexibility โœ… High โœ… High โŒ Low โŒ Medium
Nested structures โœ… Auto โŒ Manual โŒ Manual โœ… Auto
JSON integration โœ… Built-in โŒ Manual โŒ Manual โœ… Built-in
Learning curve ๐ŸŸก Medium ๐ŸŸข Low ๐ŸŸข Low ๐Ÿ”ด High
Performance ๐ŸŸก Good ๐ŸŸข Excellent ๐ŸŸข Excellent ๐ŸŸก Good

๐Ÿ“– Public API Reference

Core class

  • modict(**kwargs | mapping): dict subclass with attribute access, defaults, validation
  • modict.config(auto_convert=True, allow_extra=True, strict=False, enforce_json=False, coerce=False): build a modictConfig for subclasses
  • modict.factory(callable): declare per-instance default factories for fields
  • modict.check(field_name): decorator to attach field validators
  • modict.computed(func=None, *, cache=False, deps=None): decorator/constructor for cached computed fields

Instance helpers

  • .to_modict(): deep-convert nested dicts into modict instances in place
  • .to_dict(): deep-convert modicts back to plain dicts (preserving sharing)
  • .get_nested(path, default=MISSING): fetch nested value via JSONPath string ("$.a[0].b"), tuple ('a', 0, 'b'), or Path object
  • .set_nested(path, value): assign nested value, creating missing levels (supports JSONPath, tuple, or Path)
  • .del_nested(path): delete a nested key/path (supports JSONPath, tuple, or Path)
  • .pop_nested(path, default=MISSING): pop a nested key/path (supports JSONPath, tuple, or Path)
  • .has_nested(path): check existence of a nested path (supports JSONPath, tuple, or Path)
  • .rename(mapping_or_kwargs): rename keys without touching values
  • .exclude(*keys): return a new modict excluding given keys
  • .extract(*keys): return a new modict with only given keys
  • .walk(callback=None, filter=None, excluded=None): iterate leaf paths/values as (Path, value) tuples with optional transform/filter
  • .walked(callback=None, filter=None): return Dict[Path, Any] of walked paths/values (Path objects as keys)
  • .merge(mapping): deep-merge another mapping into self
  • .diff(mapping): structural diff vs another mapping
  • .deep_equals(mapping): deep equality check vs another mapping
  • .deepcopy(): deep-copy preserving type
  • modict.unwalk(walked): reconstruct a nested structure from walked output
  • .dumps(**json_kwargs): serialize to JSON string using json.dumps
  • .dump(fp, **json_kwargs): serialize to JSON file-like/path using json.dump
  • modict.loads(json_str, **json_kwargs): classmethod JSON string loader returning modict
  • modict.load(fp, **json_kwargs): classmethod JSON file loader returning modict
  • Dict compatibility: keys()/values()/items() validated views; |/|= merge with validation; copy(), popitem(), setdefault() validated dict semantics

Class/staticmethods

  • modict.convert(obj): recursively upgrade nested dicts to modict, preserving shared references when possible
  • modict.unconvert(obj, seen=None): recursively downgrade modicts to plain dicts
  • modict.fromkeys(iterable, value=None): standard dict API

Type utilities (exported from modict)

  • check_type(hint, value): validate a value against a type hint (strict)
  • coerce(value, hint): coerce a value to the hinted type when possible
  • @typechecked: enforce annotated args/return types at call time on a callable
  • Exceptions: TypeCheckError, TypeCheckException, TypeCheckFailureError, TypeMismatchError, CoercionError
  • Helpers/metadata: Coercer, TypeChecker, __version__, __title__, __description__, __url__, __author__, __email__, __license__

๐Ÿง  Typechecker & Coercion

The bundled typechecker/coercion engine supports a broad slice of Python typing:

  • PEP 604 unions: int | str, list[int] | None
  • Typing constructs: List[T], Dict[K, V], Tuple[...], Set[...], Optional[...], Union[...], Literal[...], Callable[[...], R]
  • Protocols: runtime-checkable Protocol support (attribute presence and callability)
  • TypedDict: key/value checking against TypedDict definitions
  • NewType: treated as underlying type
  • Callables: signature arity/type checks, including Callable[..., R] and Callable[[A, B], R]
  • Generics / TypeVars: parameterized generics with type arguments (e.g., list[int], dict[str, float]) and robust TypeVar handling (propagated through nested containers/unions)
  • Custom classes: regular isinstance semantics, including subclass checks
  • Nested structures: deep validation/coercion of containers, preserving shared references
  • Coercion rules: best-effort conversions for numbers ("1"โ†’int), sequences/iterables to typed containers, tuple/list interchange when possible, dict-like sources to typed mappings, and unions (first matching branch wins)
  • Decorators: @typechecked enforces annotations on args/return at runtime
  • Config toggles: per-modict _config can enable strict (type errors), coerce (auto-coercion), and enforce_json (JSON-serializable only)

Limitations: structural protocols without @runtime_checkable, detailed variance, and advanced typing constructs (e.g., ParamSpec, TypeVar constraints) arenโ€™t enforced; coercion is best-effort and may leave values unchanged if no safe conversion is found.

๐Ÿค Contributing

We welcome contributions! Please see our Contributing Guide for details.

Development Setup

git clone https://github.com/your-username/modict.git
cd modict
pip install -e .[dev]
pytest

๐Ÿ“œ License

This project is licensed under the MIT License - see the LICENSE file for details.

๐Ÿ™ Acknowledgments

  • Inspired by the flexibility of Python dicts, the structure of dataclasses, and the power of Pydantic
  • Built with modern Python typing and metaclass techniques
  • Community feedback and real-world usage patterns

modict: Because data structures should be both powerful and pleasant to use ๐Ÿš€

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

modict-0.2.0.tar.gz (102.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

modict-0.2.0-py3-none-any.whl (107.5 kB view details)

Uploaded Python 3

File details

Details for the file modict-0.2.0.tar.gz.

File metadata

  • Download URL: modict-0.2.0.tar.gz
  • Upload date:
  • Size: 102.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.12

File hashes

Hashes for modict-0.2.0.tar.gz
Algorithm Hash digest
SHA256 4284ee4687e6a551b5b154461452ebb1e52632dd24b578d4a382c22b721d2f10
MD5 036e4daf898bdf6b869e9d1e914a9ac8
BLAKE2b-256 96fa0dd2caa1277c226f5b251e7d778826e2ece2654d8daee5e6fa6821bf6946

See more details on using hashes here.

File details

Details for the file modict-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: modict-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 107.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.12

File hashes

Hashes for modict-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 49aa09cddc0533c145f753ee363bbe1f041ced9d9f153fe28585c2a338d626e7
MD5 f50d63c7cce5923bdd4692f3102fdc6c
BLAKE2b-256 abb84d26d446dae88d9c34de6a89b06b79585886efe9c02084c8e2ca0d12e001

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page