Pure Python module providing a variable-length, content-based blocking algorithm
Project description
Chop a file into variable-length, content-based chunks.
Example use: .. code-block:: python
>>> import rolling_checksum_mod >>> with open('/tmp/big-file.bin', 'rb') as file_: >>> for chunk in rolling_checksum_mod.min_max_chunker(file_): >>> # chunk is now a piece of the data from file_, and it will not always have the same length. >>> # Instead, it has the property that if you insert a byte at the beginning of /tmp/big-file.bin, >>> # most of the chunks of the file will remain the same. This can be nice for a deduplicating >>> # backup program. >>> print(len(chunk))
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Close
Hashes for rolling_checksum_py_mod-1.0.1.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 575065e5b09c5395b2494b39ea22bbebf4b23a500ae3c38ba7338c6d86948f98 |
|
MD5 | 5909182638518b08ee0e02b98fb45dc4 |
|
BLAKE2b-256 | bd33126ea129205390a22b99d7b515388c2c911c69c8b31c272b794b50d784d7 |