Skip to main content

More than a collection, less than a database

Project description

nanotable

PyPI - Version PyPI - Python Version codecov

Nanotable is meant to bridge the gap between simple collections, such as list and dict, and full-on database tables. It lets you store a set of objects, index it by several keys, and more! It's fast, memory-efficient, and well-tested. The project draws inspiration from littletable, but is a completely original implementation. Its goal is to avoid feature bloat and maintain performance on par with built-in collections.

There are several situations where you might want to use Nanotable:

  • When you'd otherwise use a dict from an object's field to the object itself. Nanotable does that for you, and also provides additional features, such as checking for the existence of an element with a simple in check; hanging the value of the key field with Table.rekey, or catching accidental changes to the key field automatically.

  • When you'd otherwise use a bidict. That's a great library in its own right, but Nanotable provides some additional functionality such as storing extra non-hashable metadata along your objects (also see the previous point).

  • When you'd otherwise use a database. Nanotable spares you the computational and mental overhead. You probably have already written your own domain-specific version of Nanotable at some point in your life -- now you can use a well-tested library instead.

Installation

pip install nanotable

To install with all extras features, instead use:

pip install nanotable[all]

Usage

A basic usage example is given below:

from nanotable import Table

table = Table(of_dicts=True)\
    .primary_index_on("name")\
    .index_on("phone")

table.add({"name": "John Doe", "phone": "123-456-7890", "age": 25})
table.add({"name": "Jane Doe", "phone": "987-654-3210", "age": 26})
table.add({"name": "Barrack Obama", "age": "idk"})

table.at["Jane Doe"]  # {"name": "Jane Doe", "phone": "987-654-3210", "age": 26}
table.by.name["John Doe"]  # Same as above
table.by.phone["987-654-3210"]  # {"name": "Jane Doe", "phone": "987-654-3210", "age": 26}

table.remove(table.by.name["Barrack Obama"])

You can store any kind of object in the table. Specify of_dicts=True or getfield_factory=getfield_item to use mappings (dict or anything with obj[key] item access); of_objects=True or getfield_factory=getfield_attr to use objects with attributes (obj.key access); or any function with the signature (obj: Any, key: str) -> Any | MISSING as getfield_factory. You can also specify of=MyType to have the table infer either of_objects or of_dicts based on the anticipated element type.

Check out the documentation for nanotable.Table to see all the methods supported by tables.

Typing

The library is fully type-annotated. To make use of this, at the bare minimum you can specify the type of the objects you want to store in the table:

table = Table[Person](of_objects=True)
# or
table = Table[dict[str, Any]](of_dicts=True)

To add static typing to your indexes, you need to define a type with all of them:

class MyIndexes(Protocol):
    name: UniqueIndex[Person, str]
    phone: UniqueIndex[Person, str]

# The first template parameter is the object type;
# The second template parameter is protocol for `by`;
# The third template parameter is the primary index type.
table = Table[Person, MyIndexes, UniqueIndex[Person, str]](of_objects=True)
table.primary_index_on("name", required=True)
table.index_on("phone")

Indexes

Indexes are used to provide fast and efficient lookup of elements by the value of one of the fields. All indexes must inherit from nanotable.Index. If you wish to implement your own, consult the documentation of that class to see which methods you need to implement.

Broadly, the interface of an index consists of:

  • register, which adds an element to the index
  • unregister, which removes an element from the index
  • get, which returns the element or elements corresponding to the key (the type of the result depends on the specific index, but it is always semantically equivalent to a collection of stored objects, and can be transformed to a list with result_items).
  • [] item lookup, which is a shortcut for get and the most frequent operation you will likely perform while using an index.
  • More utility methods, which you can find by exploring the index documentation.

Any index can be required or not, which is controlled by the required boolean parameter. A required index will raise an error when encounering an object with no value for its field. If an index is not required, it will simply ignore such objects. By default, None will be considered missing, but you may override this by setting none_means_empty=False, in which case None will be treated as a regular value.

An index has a name, which should correspond to the field it indexes. However, indexes rely on a customizable getfield function to extract the field value, which allows indexes on properties that would not be considered fields in a conventional way: for example, a tuple of several fields, or a nested field. In this case the name should convey the same information to a human, but it is important not to treat it as a source of truth. getfield is a function with the signature (obj: Any) -> Any | MISSING. When defining your own getfield, remember to return nanotable.MISSING instead of None or raising exceptions when the provided object does not have the required fields (for example when it's of the wrong type).

Any required=True index can be used as a primary index for a table, though a UniqueIndex (or one of its subclasses) is recommended.

Nanotable provides the following types of indexes out of the box:

  • nanotable.UniqueIndex. Requires that no two elements in the table share the same value for the index field. Lookups return the only element with the specified value, or raise a KeyError if there is none. The values of the indexed field must be hashable.

  • nanotable.MultiIndex. Supports duplicate values for the index field. Lookups return a list of all elements with a given value for the index field, including potentially an empty list. The values of the indexed field must be hashable.

With the sorted extra installed (pip install nanotable[sorted]), you will also have access to the following indexes:

  • nanotable.SortedUniqueIndex. Has the same requirements as UniqueIndex, but maintains elements in sorted order of their indexed field. Beside single-item lookups, provides efficient range lookups with get_range and [low:high]. The values of the indexed field must be hashable and comparable.

  • nanotable.SortedMultiIndex. Has the same requirements as MultiIndex, but maintains elements in sorted order of their indexed field. Beside list lookups, provides efficient range lookups with get_range and [low:high]. The values of the indexed field must be hashable and comparable.

Storage

Storage is what holds the elements of the table. In a sense, it simply abstracts a collection with a consistent interface. All storage implementations must inherit from nanotable.Storage. If you wish to implement your own, consult the documentation of that class and nanotable.WrapperStorage to see which methods you need to implement.

If your table uses a primary index, it does not need a storage and will use the index for that purpose. (Unlike a conventional database, since Python already stores objects by-reference, our indexes have access to the objects themselves rather than indexes to the storage). Note that this is the only difference between a primary and a regular index. If you want to use a custom storage, you do not need to (and cannot) specify a primary index.

Nanotable provides the following types of storage out of the box:

  • nanotable.ListStorage. Stores items in a list, prohibiting duplicates. Preserves insertion order. Linear time for mutation and presence checks.

  • nanotable.MultiListStorage. Stores items in a list but allows duplicates. Preserves insertion order. Linear time for mutation and presence checks.

  • nanotable.SetStorage. Stores items in a set, prohibiting duplicates. Does not preserve insertion order. O(1) time for mutation and presence checks. Requires objects to be hashable.

  • nanotable.OrderedSetStorage. Stores items in a dict with None-values, essentially emulating a set but making use of Python 3.6+ dict's ordered nature. Preserves insertion order. O(1) time for mutation and presence checks. Requires objects to be hashable.

  • nanotable.IndexViewStorage. Relies on some kind of index to store items. Semantics depend on index semantics, but for all built-in indexes, the performance is O(1) time for mutation and presence checks. This is used automatically when you define a primary index for a table.

Caveats

Indexed fields must be hashable, like with the built-in dict. This already imposes the restriction that they must be immutable (which is why you can't use, for example, a list as a dict key -- see here to learn why). With Nanotable, however, comes the additional restriction that the value of the indexed field itself mustn't be changed. For a dict this obviously isn't a concern since it stores keys and values separately, inaccessible to the user. Nanotable will try to detect this happening and warn you, but this unfortunately cannot be done reliably. If you wish to change an indexed field, the correct way to do that is to remove it from the table, change the field and re-add it. Nanotable provides a helper that does this for you:

with table.rekey(obj):
    obj.field = new_value

If a field is not indexed, this is unnecessary.

If you are certain that your code never modifies an indexed field of an object in a Table, you can disable the checks that issue the warning by setting nanotable.safety.disable_safety_checks to False. This provides a small performance improvement, with the downside that any potential bugs will be almost impossible to catch and will show up as subtly wrong results. It is recommended that you keep the safety checks on unless you know what you're doing.

Nanotable is also not thread-safe. When using a Table from multiple threads at once, use a synchronization primitive such as a threading.Lock to ensure that only one thread can interact with the table at a time. Multithreaded read-only access should theoretically be fine.

License

Nanotable is distributed under the terms of the MIT license. See the LICENSE.txt for details.

© 2026 abel1502

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nanotable-0.1.1.tar.gz (47.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nanotable-0.1.1-py3-none-any.whl (24.1 kB view details)

Uploaded Python 3

File details

Details for the file nanotable-0.1.1.tar.gz.

File metadata

  • Download URL: nanotable-0.1.1.tar.gz
  • Upload date:
  • Size: 47.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.19 {"installer":{"name":"uv","version":"0.11.19","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for nanotable-0.1.1.tar.gz
Algorithm Hash digest
SHA256 0dd4158ed853a6d6bcdb77237bd30379a367721e8c45f8a0ea9e4604238625a7
MD5 e87c749ed35e3af57ad2f308c32851b2
BLAKE2b-256 cc484d557a8e8ada3bac52353e070661f13a9f41da4f88469e7d4fc442b43445

See more details on using hashes here.

File details

Details for the file nanotable-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: nanotable-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 24.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.19 {"installer":{"name":"uv","version":"0.11.19","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for nanotable-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 adcc57b24a714e9a5f1c4d03e0997db7619e9482c4450bfb04c82d52e2be3abe
MD5 1e278d660f5ca67a3a0329a4b4a1c38d
BLAKE2b-256 ea281d838acb85c27cc4b64ecb77b38adae51b5f000d15b80143f96da926a1c2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page