Skip to main content

Cython-augmented Python module providing a variable-length, content-based blocking algorithm

Project description

Chop a file into variable-length, content-based chunks.

Example use: .. code-block:: python

>>> import rolling_checksum_mod
>>> # If you have both rolling_checksum_pyx_mod and rolling_checksum_py_mod installed, the software will
>>> # automatically prefer the pyx version.  Both py and pyx versions require rolling_checksum_py_mod, but
>>> # only the pyx version requires rolling_checksum_pyx_mod.
>>> with open('/tmp/big-file.bin', 'rb') as file_:
>>>     for chunk in rolling_checksum_mod.min_max_chunker(file_):
>>>         # chunk is now a piece of the data from file_, and it will not always have the same length.
>>>         # Instead, it has the property that if you insert a byte at the beginning of /tmp/big-file.bin,
>>>         # most of the chunks of the file will remain the same.  This can be nice for a deduplicating
>>>         # backup program.
>>>         print(len(chunk))

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rolling_checksum_pyx_mod-1.1.2.tar.gz (60.9 kB view details)

Uploaded Source

File details

Details for the file rolling_checksum_pyx_mod-1.1.2.tar.gz.

File metadata

File hashes

Hashes for rolling_checksum_pyx_mod-1.1.2.tar.gz
Algorithm Hash digest
SHA256 395863e9adfc3c3d9f1661c582e205d67c1ba9906a7282e9a0adbc2e0130d94f
MD5 f4c07a55083fdceaa5cdd4f72fa32386
BLAKE2b-256 5b912c4940944cebfc0821fc1306fbd28420f3e381321723e2e65c35c18f7daf

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page