Split binary files on content boundaries
Command-line tool like unix csplit but which splits binary files based on content boundaries rather than index boundaries.
This is particularly useful for salvaging jpegs from a corrupted disk image.
$ csplitb –prefix photo –suffix .jpg –number 4 ffd8ffe1 corrupted-disk-image.raw
This will split the disk image, keeping the header, into photo0000.jpg, photo0001.jpg, etc. Because a disk image represents a potentially fragmented file system, some (or all) of the jpeg files may be corrupted.
The “ffd8ffe1” here is a jpeg header. I’m not an expert on the jpeg format, so it’s possible that you may need to use a different hex search term. Use a hex editor or viewer such as xxd to find what content you want to split on.
pip install csplitb
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
|Filename, size||File type||Python version||Upload date||Hashes|
|Filename, size csplitb-1.0.2.tar.gz (9.1 kB)||File type Source||Python version None||Upload date||Hashes View|