lindh-jsondb

JSON based document database

These details have not been verified by PyPI

Project links

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

https://travis-ci.org/eblade/jsonobject.svg?branch=master

jsondb

JSON Key-Value store in pure Python 3

Introduction

JSONDB is a library for Python 3 that provides the ability to run a very simplified CouchDB-like document database, a.k.a. a Key-Value store. The features include:

Hard disk storage of documents
In-memory storage of indexes
Map and reduce functions specified in Python directly
Any number of views per database
Views can be accessed with or without reducing them
Thread-safe (with locks per database)

Installation

You can pip (python 3) install this Github repository or a tag, like this:

$pip install https://github.com/eblade/jsondb/archive/0.2.tar.gz

This will also install blist which is used to get the views faster.

Examples

To create a new database (a table if you think in relation database terms):

>>> from lindh.jsondb import Database
>>> db = Database('/tmp/cars')
>>> db.clear() # for doctest purposes

This will create a folder /tmp/cars which will be used to store the documents (json files) and an ID counter.

To populated the database with some content you can use db.save(...). These documents will be given a unique id automatically. If you just want to retrieve them using indices, this is not a problem, but if you want control over the identifiers, you can do like this instead:

>>> db[0] = {'brand': 'Volvo', 'model': 'S40', 'wheels': 6}
>>> db[1] = {'brand': 'Mercedes', 'model': 'C', 'wheels': 8}
>>> db[2] = {'brand': 'Volvo', 'model': 'V70', 'wheels': 4}
>>> db[3] = {'brand': 'Honda', 'model': 'CB500F', 'wheels': 2}

This enables you to retrieve them back in the expected pythonic way.

The documents are stored synchronously, so your app may be restarted without data loss.

Let’s look at an interactive session to find out what the document looks like when it comes back:

>>> db[0] == {'wheels': 6, '_id': 0, '_rev': 0, 'brand': 'Volvo', 'model': 'S40'}
True

As you can see, the structure closely mimic that of CouchDB, with the _id and _rev fields. The _rev field is important to keep intact as updated requires it to be the latest (otherwise a lindh.jsondb.Conflict is raised). To update, it’s quite easy to use save (but index-based setting also works):

>>> db.save({'wheels': 6, '_id': 0, '_rev': 0, 'brand': 'Volvo', 'model': 'S40', 'color': 'white'}) == \
... {'wheels': 6, '_id': 0, '_rev': 1, 'brand': 'Volvo', 'model': 'S40', 'color': 'white'}
True

The _rev should change here, usually pop one number up (whereas CouchDB would return random hashes for each revision).

To delete a document you can simple use del db[key] or db.delete(key).

Views

What fun is a Key-Value store with no indexing? Not much!

>>> db.define('by_wheels', lambda o: (o['wheels'], ' '.join([o['brand'], o['model']])))
>>> list(db.view('by_wheels'))[0] == \
... {'id': 3, 'key': 2, 'value': 'Honda CB500F'}
True

So we defined a view called by_wheels where the number of wheels is used as key and a concatenation of brand and model is used as value. The view is always sorted so I know that the motorcycle will come out first. The rest of the order is somewhat arbitrary since a binary search tree is used to hold the index in memory.

Note that the index is available as soon as it is created. This is because the operation of defining an index is asynchronous. It does not matter if the view is defined before or after the documents are created, as the documents will be placed in the index ad hoc. They will also be deleted that way. This means, for performance:

Adding a document is O(log n)
Finding a document is O(log n)
Deleting a document is O(log n)

So this scales quite well as long as the index fits in memory (the actual documents do not need to fit in memory, however). By the nature of being a binary search tree, it is constantly sorted by key.

Now, this takes us to the sorting. To further mimic CouchDB, keys need to be sortable beyond the core functionality of python. Anything needs to be comparable with anything basically. Also, we need something to be smaller and bigger than everything else, respectively. These are None and any.

Lets revisit the by_wheels view, and take everything with equal to or more than 6 wheels (I know this is not accurate data).

>>> list(db.view('by_wheels', startkey=6, endkey=any)) == \
... [{'id': 0, 'key': 6, 'value': 'Volvo S40'},{'id': 1, 'key': 8, 'value': 'Mercedes C'}]
True

The reason to use list() here is because I’m always given a generator back.

Author

lindh.jsondb is written and maintained by Johan Egneblad <johan@egneblad.se>.

Project details

These details have not been verified by PyPI

Project links

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

0.3.0

Jul 29, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lindh_jsondb-0.3.0-py3-none-any.whl (8.7 kB view details)

Uploaded Jul 29, 2019 Python 3

File details

Details for the file lindh_jsondb-0.3.0-py3-none-any.whl.

File metadata

Download URL: lindh_jsondb-0.3.0-py3-none-any.whl
Upload date: Jul 29, 2019
Size: 8.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.18.4 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.6.2

File hashes

Hashes for lindh_jsondb-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f18a5d8e4e5700236fa5dd0e963065376f394cf13feb200f6db62136ac23597d`
MD5	`83fb3f0beeba3f922b025504082713a0`
BLAKE2b-256	`e6822872935c1890017b231fdd9b51995caf77ce7d4ba85830567adfb3dca377`

See more details on using hashes here.

lindh-jsondb 0.3.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

jsondb

Introduction

Installation

Examples

Views

More on Views

Further Reading

Author

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes