A lightweight Python package providing shelve-like persistent storage with zstd compression, SQLite transactions, multithreading support, and flexible serialization options.

Project description

Introduction

This package functions similarly to Python’s built-in shelve but offers additional features such as:

Zstandard (zstd) compression for efficient storage ✅ DONE
SQLite-backed transactions to ensure data integrity ⚠️ TODO
Multiple serialization formats support: JSON, Pickle, and Pydantic models ✅ DONE

📊 Benchmark

Benchmark functions are defined in tests/test_shelve.py.

🔥 Performance Comparison

Test Case	Min (ms)	Max (ms)	Mean (ms)	StdDev (ms)	Median (ms)	OPS (ops/sec)	Rounds
`shelve_speed`	443.62	459.37	450.19	5.96	450.10	2.22	5
`shelvez_pickle_speed`	237.18	243.08	240.01	2.54	239.53	4.17	5
`shelvez_pydantic_speed`	245.33	318.90	263.59	31.14	252.38	3.79	5
`shelvez_json_speed`	246.83	250.72	249.37	1.57	249.44	4.01	5

OPS = Operations Per Second (calculated as 1 / Mean)

🗂️ Database Size Comparison

File sizes are measured after writing the same number key-value data using each backend.

Backend	File Size
`shelve`	380.00 kB
`shelvez` (Pickle)	312.00 kB
`shelvez` (JSON)	312.00 kB
`shelvez` (Pydantic)	308.00 kB

Smaller database files and faster write performance make shelvez a more efficient alternative to the standard shelve module.

Installation

pip install shelvez

Base Usage

import shelve_sqlite_zstd as shelve

db = shelve.open("any_db_path/your_db.db")
db["key"] = "value"
print(db["key"])
db.close()

Serialization (default is Pickle)

The default serialization method uses Pickle, with the Pickle data further compressed by zstd. For specific data types, you can also choose other serialization methods to achieve better version compatibility and reduce storage size.

with JSON-serializable dicts

import shelve_sqlite_zstd as shelve

# Use Json serializer
serializer = shelve.serializer.JsonSerializer()
db = shelve.open("any_db_path/your_db.db", serializer=serializer)

db["key"] = {"key":"value"}

with Pydantic model

from pydantic import BaseModel
import shelve_sqlite_zstd as shelve

class MyModel(BaseModel):
    value: str

# use Pydantic serializer
serializer = shelve.serializer.PydanticSerializer(MyModel)
db = shelve.open("any_db_path/your_db.db", serializer=serializer)

db["key"] = MyModel(value="value")

with Self Custom Serialization

To implement your own serialization method, you need to create a subclass of serializer.BaseSerializer and override the following two methods:

serialize(self, obj) -> bytes: This method takes a Python object (obj) and returns its serialized form as bytes. Implement this method with your custom serialization logic.
unserialize(self, data: bytes): This method takes the serialized bytes (data) and returns the original Python object by deserializing it.

Here is a template example:

from shelve_sqlite_zstd import serializer

class CustomSerializer(serializer.BaseSerializer):
    def serialize(self, obj) -> bytes:
        # Implement custom serialization logic here
        # Convert `obj` to bytes
        pass

    def unserialize(self, data: bytes):
        # Implement custom deserialization logic here
        # Convert bytes back to original object
        pass

Using zstd Compression Dictionary

After accumulating a sufficient amount of data, you can generate a custom zstd compression dictionary for your database by calling db.dict.optimize_database(). This function will also recompress the existing data using the newly created dictionary. When stored data shares similar structures or formats, a personalized zstd dictionary can greatly enhance compression efficiency, particularly for relatively small datasets. Typically, generating the dictionary after storing a few thousand samples yields good results.

⚠️ Warning: During the optimization process, do not perform any other read or write operations on the database to prevent data corruption or inconsistent states.

import shelve_sqlite_zstd as shelve

db = shelve.open("any_db_path/your_db.db")
db["key"] = "value"
# ... store more data as needed

# Generate and apply a custom zstd compression dictionary
db.dict.optimize_database()

db.close()

Project details

Release history Release notifications | RSS feed

0.4.3

Sep 16, 2025

0.4.2

Sep 16, 2025

0.4.1

Sep 16, 2025

0.3.1

May 18, 2025

This version

0.3.0

May 18, 2025

0.2.1

May 16, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

shelvez-0.3.0.tar.gz (35.8 kB view details)

Uploaded May 18, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

shelvez-0.3.0-py3-none-any.whl (8.2 kB view details)

Uploaded May 18, 2025 Python 3

File details

Details for the file shelvez-0.3.0.tar.gz.

File metadata

Download URL: shelvez-0.3.0.tar.gz
Upload date: May 18, 2025
Size: 35.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.7.5

File hashes

Hashes for shelvez-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`20b6a642cde3741198b2b381f6985c738cce8b169e22fdcaff7a2be58ce5cd26`
MD5	`8c2773241b370edec2a1d74dd806404f`
BLAKE2b-256	`0506f011c2659590daf9f5745d91e3c393a9c525229f1934306e877165061f37`

See more details on using hashes here.

File details

Details for the file shelvez-0.3.0-py3-none-any.whl.

File metadata

Download URL: shelvez-0.3.0-py3-none-any.whl
Upload date: May 18, 2025
Size: 8.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.7.5

File hashes

Hashes for shelvez-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b281e1b111852e4eeeceb9f21c70ab06392d66737c79ad4a9d05d5283b179b2a`
MD5	`27d7b984dffbf77c4c3678542a10155c`
BLAKE2b-256	`b2277b134a790be3e891918d6d27d6ee9c3add0fcf0d034119088e6aad9990ab`

See more details on using hashes here.

shelvez 0.3.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Introduction

📊 Benchmark

🔥 Performance Comparison

🗂️ Database Size Comparison

Installation

Base Usage

Serialization (default is Pickle)

with JSON-serializable dicts

with Pydantic model

with Self Custom Serialization

Using zstd Compression Dictionary

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes