Cache, advanced slicing and lazy loading for __getitem__
Project description
ZnSlice
A lightweight library (without external dependencies) for:
- advanced slicing.
- cache
__getitem__(self, item) - lazy load
__getitem__(self, item)
Installation
pip install znslice
Usage
Advanced Slicing and Cache
Convert List to znslice.LazySequence to allow advanced slicing.
import znslice
lst = znslice.LazySequence.from_obj([1, 2, 3], indices=[0, 2])
print(lst[[0, 1]].tolist()) # [1, 3]
import znslice
import collections.abc
class MapList(collections.abc.Sequence):
def __init__(self, data, func):
self.data = data
self.func = func
@znslice.znslice
def __getitem__(self, item: int):
print(f"Loading item = {item}")
return self.func(self.data[item])
def __len__(self):
return len(self.data)
data = MapList([0, 1, 2, 3, 4], lambda x: x ** 2)
assert data[0] == 0
assert data[[1, 2, 3]] == [1, 4, 9]
# calling data[:] will now only compute data[4] and load the remaining data from cache
assert data[:] == [0, 1, 4, 9, 16]
Lazy Database Loading
You can use znslice to lazy load data from a database. This is useful if you have a large database and only want to load a small subset of the data.
In the following we will use the ase package to generate Atoms objects stored in a database and load them lazily.
import ase.io
import ase.db
import znslice
import tqdm
import random
# create a database
with ase.db.connect("data.db", append=False) as db:
for _ in range(10):
atoms = ase.Atoms('CO', positions=[(0, 0, 0), (0, 0, random.random())])
db.write(atoms, group="data")
# load the database lazily
class ReadASEDB:
def __init__(self, file):
self.file = file
@znslice.znslice(
advanced_slicing=True, # this getitem supports advanced slicingn
lazy=True # we want to lazy load the data
)
def __getitem__(self, item):
data = []
with ase.db.connect(self.file) as database:
if isinstance(item, int):
print(f"get {item = }")
return database[item + 1].toatoms()
for idx in tqdm.tqdm(item):
data.append(database[idx + 1].toatoms())
return data
def __len__(self):
with ase.db.connect(self.file) as db:
return len(db)
db = ReadASEDB("data.db")
data = db[::2] # LazySequence([<__main__.ReadASEDB>], [[0, 2, 4, 6, 8]])
data.tolist() # list[ase.Atoms]
# supports addition, advanced slicing, etc.
data = db[::2] + db[1::2]
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file znslice-0.1.3.tar.gz.
File metadata
- Download URL: znslice-0.1.3.tar.gz
- Upload date:
- Size: 9.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.3.2 CPython/3.10.8 Linux/5.19.0-32-generic
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
780033efa0a73096a96b06972265c6a8b9bd0fa3178c98888a0aa03f49fca634
|
|
| MD5 |
559fa548a4d8dbe2c7c5e3a99cae9af8
|
|
| BLAKE2b-256 |
a32f8fd844656c540f6965a225acb304c8c0d82d3a8cd129dead7a719f0cd5e4
|
File details
Details for the file znslice-0.1.3-py3-none-any.whl.
File metadata
- Download URL: znslice-0.1.3-py3-none-any.whl
- Upload date:
- Size: 9.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.3.2 CPython/3.10.8 Linux/5.19.0-32-generic
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d0e9b5421f2b3a4fe7bd390ad552da7131e610167afae6487ac1488868234f21
|
|
| MD5 |
9da1da04571f34193143ff18eea0575c
|
|
| BLAKE2b-256 |
1efed090f6d702db1a57623724f319f32538f001ecccad659f1d82d014433ae6
|