Skip to main content

Pure Python module providing a variable-length, content-based blocking algorithm

Project description

Chop a file into variable-length, content-based chunks.

Example use: .. code-block:: python

>>> import rolling_checksum_mod
>>> with open('/tmp/big-file.bin', 'rb') as file_:
>>>     for chunk in rolling_checksum_mod.min_max_chunker(file_):
>>>         # chunk is now a piece of the data from file_, and it will not always have the same length.
>>>         # Instead, it has the property that if you insert a byte at the beginning of /tmp/big-file.bin,
>>>         # most of the chunks of the file will remain the same.  This can be nice for a deduplicating
>>>         # backup program.
>>>         print(len(chunk))

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rolling_checksum_py_mod-1.0.1.tar.gz (3.8 kB view details)

Uploaded Source

File details

Details for the file rolling_checksum_py_mod-1.0.1.tar.gz.

File metadata

  • Download URL: rolling_checksum_py_mod-1.0.1.tar.gz
  • Upload date:
  • Size: 3.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.9.2

File hashes

Hashes for rolling_checksum_py_mod-1.0.1.tar.gz
Algorithm Hash digest
SHA256 575065e5b09c5395b2494b39ea22bbebf4b23a500ae3c38ba7338c6d86948f98
MD5 5909182638518b08ee0e02b98fb45dc4
BLAKE2b-256 bd33126ea129205390a22b99d7b515388c2c911c69c8b31c272b794b50d784d7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page