A simple tool to check file integrity using CRC32 hash in filename
Project description
crctools - A simple tool to check file integrity using CRC32 hash in filename
Having the hash of files in their name makes it so much easier to:
- keep track of their integrity
- deduplicate files
- probably more
This is intended to be a simple to use command-line utility to:
- add CRC32 hash into filename in a widely recognised format (suffix with 8 hex uppercase characters in square brackets)
- verify integrity of files based on hash in file name
- update hash in name
Requirements
This project was developed for Python 3.10 and may not work on lower versions.
Installation
From a terminal execute:
python -m pip install crctools-DavidRodriguezSoaresCUI
On some systems it may be necessary to specify python version as python3
Usage
$> python -m crctools --help
usage: __main__.py [-h] [--skip_verify] [--overwrite] [--recursive] [--extensions [EXTENSIONS ...]]
[--min_size MIN_SIZE] [--write_report] [--skip_frozen_dirs]
[--frozen_dirs [FROZEN_DIRS ...]]
[--frozen_dir_file_ext [FROZEN_DIR_FILE_EXT ...]] [--debug]
PATH
positional arguments:
PATH Can be file path or directory (all files in directory will be processed)
options:
-h, --help show this help message and exit
--skip_verify Skip verification; only process files with no hash in filename (useful to
resume interrupted execution)
--overwrite Overwrite CRC in filename when verification fails
--recursive (Only with PATH a directory) Enables recursive search for files to verify
--extensions [EXTENSIONS ...]
Restrict files to process with extension whitelist (default: no restriction;
you may list extensions with leading dot separator)
--min_size MIN_SIZE Restrict files to ones of at least <min_size> bytes; accepts values like
'-4.4k', '99G' or '0.5M' (case insensitive); default: 0)
--write_report Writes JSON file with list of files processed by category: COMPUTED,
VERIFIED, ERROR
--skip_frozen_dirs Skip frozen dirs (see --frozen_dirs)
--frozen_dirs [FROZEN_DIRS ...]
Name of directories that should be treated as read-only (so no file
renaming); default: VIDEO_TS BDMV
--frozen_dir_file_ext [FROZEN_DIR_FILE_EXT ...]
List of file extensions typically associated with frozen directories (used
for warning); default: IFO BUP VOB M2TS BDMV MPLS CLPI
Example: Check large (>10MB) video files in directory D:\Videos
(and subdirectories) :
python -m crctools "D:\Vidéos" --recursive --extensions mkv mp4 --min_size 10000000
Note: these are all equivalent:
--extensions mkv mp4
--extensions .mkv .mp4
--extensions MKV MP4
--extensions .MKV .MP4
I typically use:
python -m crctools . --recursive --extension AVI MP4 MKV WMV ZIP TS MOV WEBM RAR --min_size 20M
Concept of frozen directories
Some directories, like DVD/BD disk backups, have a known directory structure and shouldn't be changed by adding hash into file names. Instead, all files inside are hashed, their hashes collected into a file outside the directory, and that file hashed to give a "composite" hash that represents the whole directory.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for crctools-DavidRodriguezSoaresCUI-0.0.3.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 38f605638da796d51bccac9c492832d5cb0bf21e04a8b1c3377a493e1fe2a3f8 |
|
MD5 | 2d3906825bddcad594c4bfb2933d2c47 |
|
BLAKE2b-256 | b345eeb7a4c030709153234acdad80a6952eadf317dc813bbcaadcb331fe5f18 |
Hashes for crctools_DavidRodriguezSoaresCUI-0.0.3-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 34fb80f7ff5eb975441bebe1a09fa89c204b0ae4e517db74d4278daf6c4c9b88 |
|
MD5 | 19c53d80191d6c22ead7df6882b4d89f |
|
BLAKE2b-256 | 11231c25381c48e90ad30375a285aaea540f957e032dd36fca1543e90e1cdc32 |