Skip to main content

A lightweight key-value database written in Python, intended for use on the Internet Computer (IC)

Project description

IC Python DB

A lightweight key-value database with entity relationships and audit logging capabilities, intended for small to medium-sized applications running on the Internet Computer. Forked from kybra-simple-db.

Test on IC Test PyPI version Python 3.10 License

Features

  • Persistent Storage: Works with StableBTreeMap stable structure for persistent storage on your canister's stable memory so your data persists automatically across canister upgrades.
  • Entity-Relational Database: Create, read and write entities with OneToOne, OneToMany, ManyToOne, and ManyToMany relationships.
  • Entity Hooks: Intercept and control entity lifecycle events (create, modify, delete) with on_event hooks.
  • Access Control: Thread-safe context management for user identity tracking and ownership-based permissions.
  • Namespaces: Organize entities into namespaces to avoid type conflicts when you have multiple entities with the same class name.
  • Audit Logging: Track all changes to your data with created/updated timestamps and who created and updated each entity.

Installation

pip install ic-python-db

Quick Start

The database storage must be initialized before using IC Python DB. Here's an example of how to do it:

from basilisk import StableBTreeMap
from ic_python_db import Database

# Initialize storage and database
storage = StableBTreeMap[str, str](memory_id=1, max_key_size=100, max_value_size=1000)  # Use a unique memory ID for each storage instance
Database.init(db_storage=storage)

Read Basilisk's documentation for more information regarding StableBTreeMap and memory IDs.

Next, define your entities:

from ic_python_db import (
    Database, Entity, String, Integer,
    OneToOne, OneToMany, ManyToOne, ManyToMany, TimestampedMixin
)

class Person(Entity, TimestampedMixin):
    __alias__ = "name"  # Use `name` as the alias field for lookup
    name = String(min_length=2, max_length=50)
    age = Integer(min_value=0, max_value=120)
    friends = ManyToMany("Person", "friends")
    mother = ManyToOne("Person", "children")
    children = OneToMany("Person", "mother")
    spouse = OneToOne("Person", "spouse")

Entity Lookup

Entities can be retrieved using Entity[key] syntax with three different lookup modes:

# Create an entity
john = Person(name="John", age=30)

# Lookup by ID
Person[1]                  # Returns john (by _id)
Person["1"]                # Also works with string ID

# Lookup by alias (defined via __alias__)
Person["John"]             # Tries ID first, then alias field "name"

# Lookup by specific field (tuple syntax)
Person["name", "John"]     # Lookup by field "name" only
Syntax Behavior
Person[1] Lookup by _id
Person["John"] Try _id first, then __alias__ field
Person["name", "John"] Lookup by specific field name only

Then use the defined entities to store objects:

    # Create and save an object
    john = Person(name="John", age=30)

    # Update an object's property
    john.age = 33  # Type checking and validation happens automatically

    # use the `_id` property to load an entity with the [] operator
    Person(name="Peter")
    peter = Person["Peter"]

    # Delete an object
    peter.delete()

    # Create relationships
    alice = Person(name="Alice")
    eva = Person(name="Eva")
    john.mother = alice
    assert john in alice.children
    eva.friends = [alice]
    assert alice in eva.friends
    assert eva in alice.friends

    print(alice.serialize())  # Prints the dictionary representation of an object
    # Prints: {'timestamp_created': '2025-09-12 22:15:35.882', 'timestamp_updated': '2025-09-12 22:15:35.883', 'creator': 'system', 'updater': 'system', 'owner': 'system', '_type': 'Person', '_id': '3', 'name': 'Alice', 'age': None, 'children': '1', 'friends': '4'}

    assert Person.count() == 3
    assert Person.max_id() == 4
    assert Person.instances() == [john, alice, eva]

    # Cursor-based pagination
    assert Person.load_some(0, 2) == [john, alice]
    assert Person.load_some(2, 2) == [eva]

    # Retrieve database contents in JSON format
    print(Database.get_instance().dump_json(pretty=True))

    # Audit log
    audit_records = Database.get_instance().get_audit(id_from=0, id_to=5)
    pprint(audit_records['0'])
    ''' Prints:

    ['save',
    1744138342934,
    'Person@1',
    {'_id': '1',
    '_type': 'Person',
    'age': 30,
    'creator': 'system',
    'name': 'John',
    'owner': 'system',
    'timestamp_created': '2025-04-08 20:52:22.934',
    'timestamp_updated': '2025-04-08 20:52:22.934',
    'updater': 'system'}]

    '''

For more usage examples, see the tests.

Namespaces

Organize entities with the __namespace__ attribute to avoid type conflicts when you have the same class name in different modules:

# In app/models.py
class User(Entity):
    __namespace__ = "app"
    name = String()
    role = String()
# In admin/models.py  
class User(Entity):
    __namespace__ = "admin"
    name = String()
    permissions = String()
from app.models import User as AppUser
from admin.models import User as AdminUser

app_user = AppUser(name="Alice", role="developer")      # Stored as "app::User"
admin_user = AdminUser(name="Bob", permissions="all")     # Stored as "admin::User"

# Each namespace has isolated ID sequences and storage
assert app_user._id == "1"
assert admin_user._id == "1"

Entity Hooks

Intercept and control entity changes with the on_event hook:

from ic_python_db import Entity, String, ACTION_MODIFY

class User(Entity):
    name = String()
    email = String()
    
    @staticmethod
    def on_event(entity, field_name, old_value, new_value, action):
        # Validate email format
        if field_name == "email" and "@" not in new_value:
            return False, None  # Reject invalid email
        
        # Auto-capitalize names
        if field_name == "name":
            return True, new_value.upper()
        
        return True, new_value

user = User(name="alice", email="alice@example.com")
assert user.name == "ALICE"  # Auto-capitalized

See docs/HOOKS.md for more patterns.

Access Control

Thread-safe user context management with as_user():

from ic_python_db import Database, Entity, String, ACTION_MODIFY, ACTION_DELETE
from ic_python_db.mixins import TimestampedMixin
from ic_python_db.context import get_caller_id

class Document(Entity, TimestampedMixin):
    title = String()
    
    @staticmethod
    def on_event(entity, field_name, old_value, new_value, action):
        caller = get_caller_id()
        
        # Only owner can modify or delete
        if action in (ACTION_MODIFY, ACTION_DELETE):
            if entity._owner != caller:
                return False, None
        
        return True, new_value

db = Database.get_instance()

# Alice creates a document
with db.as_user("alice"):
    doc = Document(title="My Doc")  # Owner: alice

# Bob cannot modify Alice's document
with db.as_user("bob"):
    doc.title = "Hacked"  # Raises ValueError

See docs/ACCESS_CONTROL.md and examples/simple_access_control.py.

Type Hints

The library is fully typed (PEP 561 compliant). Type checkers and IDEs automatically infer property types:

class User(Entity):
    name = String()      # Inferred as str
    age = Integer()      # Inferred as int
    active = Boolean()   # Inferred as bool

user = User(name="Alice", age=30, active=True)
user.name   # IDE knows this is str
user.age    # IDE knows this is int

For stricter typing, you can add explicit annotations:

from typing import Optional

class User(Entity):
    name: str = String()
    age: int = Integer()
    profile: Optional["Profile"] = OneToOne("Profile", "user")

API Reference

  • Core: Database, Entity
  • Properties: String, Integer, Float, Boolean
  • Relationships: OneToOne, OneToMany, ManyToOne, ManyToMany
  • Mixins: TimestampedMixin (timestamps and ownership tracking)
  • Hooks: ACTION_CREATE, ACTION_MODIFY, ACTION_DELETE
  • Context: get_caller_id(), set_caller_id(), Database.as_user()

Development

Setup Development Environment

# Clone the repository
git clone https://github.com/smart-social-contracts/ic-python-db.git
cd ic-python-db

# Recommended setup
pyenv install 3.10.7
pyenv local 3.10.7
python -m venv venv
source venv/bin/activate

# Install development dependencies
pip install -r requirements-dev.txt

# Running tests
./run_linters.sh && (cd tests && ./run_test.sh && ./run_test_ic.sh)

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

MIT.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ic_python_db-0.7.4.tar.gz (29.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ic_python_db-0.7.4-py3-none-any.whl (28.4 kB view details)

Uploaded Python 3

File details

Details for the file ic_python_db-0.7.4.tar.gz.

File metadata

  • Download URL: ic_python_db-0.7.4.tar.gz
  • Upload date:
  • Size: 29.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.19

File hashes

Hashes for ic_python_db-0.7.4.tar.gz
Algorithm Hash digest
SHA256 32ca823d486139a64cb43489408dbf95a7cf4513c0508d4a082abdd950e1b83b
MD5 d53224cfb3d021001417f552b255b966
BLAKE2b-256 f1b0ded24edec3832baa9f9f5a7dffd51ca6a106d0e82990d18c808b774ce62e

See more details on using hashes here.

File details

Details for the file ic_python_db-0.7.4-py3-none-any.whl.

File metadata

  • Download URL: ic_python_db-0.7.4-py3-none-any.whl
  • Upload date:
  • Size: 28.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.19

File hashes

Hashes for ic_python_db-0.7.4-py3-none-any.whl
Algorithm Hash digest
SHA256 ac721672311ac6fe1c4273624b6f9ae81ab346cd3a34c915f86a9a59738d9cd0
MD5 20f006ea54022807c01c7c5eeb731e4a
BLAKE2b-256 bb8210912c041b576d4e819da15563168c59708045a8e0f482d59269bcc5d702

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page