Skip to main content

Pure-Python safetensors

Project description

pure_safetensors

Safetensors library but in pure clean Python. Run it on PyPy or IronPython or wherever.

Dependencies

We try to keep dependencies light:

  • attrs dataclass library (2881 LoC)
  • marshmallow serialization and validation library (2647 LoC)
  • sortedcollections tiny sorted collections library (339 LoC) built on top of sortedcontainers (1493 LoC)
  • (optional) sparsefile sparse file library (191 LoC)
  • (optional) fickle whitelist-based firewall for safe pickle loading, used for PyTorch model conversion (926 LoC)

Optionally, this library integrates with NumPy (if available). PyTorch integration is planned, someday.

To run the tests, you'll need pytest, numpy, and optionally hypothesis.

Examples

from pure_safetensors import SafeTensors

with SafeTensors("/path/to/example.safetensors", "r+") as sf:
    arrays = sf.as_numpy()
    arrays["hello"][3, :] += 420.69
    arrays["world"] = arrays["hello"][0:2] * 10

    # assign multiple arrays! much faster!
    arrays.update(
        {
            "q": my_array_1,
            "k": my_array_2,
            "v": my_array_3,
        }
    )

    # delete arrays! such wonders!
    del arrays["v"]

Conversion

Do you have an existing PyTorch checkpoint model that you would like to convert to safetensors? Then try running:

python3 -m pure_safetensors import-pytorch /path/to/model.ckpt /path/to/model.safetensors

You may also be able to convert the model in-place without making an additional copy! Use this facility at your own risk (or make a zero-cost copy-on-write backup copy of your checkpoint file using cp --reflink=always ... if your filesystem supports it).

# convert it
python3 -m pure_safetensors import-pytorch-inplace /data/model.ckpt

# rename
mv /data/model.ckpt /data/model.safetensors

Bugs

The space allocator is a greedy algorithm based on first-fit-decreasing bin packing. So if you add/remove tensors to an existing file, it may leave too much empty space behind.

PyTorch support isn't implemented yet.

Alternatives

pure_safetensors safetensors pure_torch.py safetensors.cpp
Written in pure Python?
Supports NumPy (without PyTorch)?
Supports PyTorch?
Can work without numpy or pytorch?
Can write safetensors files?
Can modify file in-place to add/remove tensors?
Has test suite?
Stable API? 🤷
Automatically makes files sparse to save space?
Works on platforms without mmap?

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pure_safetensors-0.3.2.tar.gz (19.4 kB view details)

Uploaded Source

Built Distribution

pure_safetensors-0.3.2-py3-none-any.whl (18.0 kB view details)

Uploaded Python 3

File details

Details for the file pure_safetensors-0.3.2.tar.gz.

File metadata

  • Download URL: pure_safetensors-0.3.2.tar.gz
  • Upload date:
  • Size: 19.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.4.2 requests/2.25.1 setuptools/52.0.0 requests-toolbelt/0.9.1 tqdm/4.57.0 CPython/3.9.2

File hashes

Hashes for pure_safetensors-0.3.2.tar.gz
Algorithm Hash digest
SHA256 e8b748158428643f9f7c6dddff5784d3051b429e40a0075b63c6662e2c102b13
MD5 147905527ae7f0c453c3fd1c09f6193d
BLAKE2b-256 c093bd80899faff14079c56c09ebd882fb46f484a68522592cf79ab12b404ccf

See more details on using hashes here.

File details

Details for the file pure_safetensors-0.3.2-py3-none-any.whl.

File metadata

  • Download URL: pure_safetensors-0.3.2-py3-none-any.whl
  • Upload date:
  • Size: 18.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.4.2 requests/2.25.1 setuptools/52.0.0 requests-toolbelt/0.9.1 tqdm/4.57.0 CPython/3.9.2

File hashes

Hashes for pure_safetensors-0.3.2-py3-none-any.whl
Algorithm Hash digest
SHA256 5c1af9d49da1addf3716e91634da33c3ee9d3c37a5a498e9af33fcf9a956e661
MD5 bab171c24c902359eed481bae0a4406e
BLAKE2b-256 1945675273f16792d7b139bbd970525713b70fae65962de9d61b96b39e1726fb

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page