Skip to main content

Filesystem to filesystem backup (deduplicated, compressed with xz)

Project description

Backshift is a filesystem to filesystem backup program, analogous to rsync –link-dest.

Compared to rsync, backshift deduplicates much better, and compresses files - rsync does not.

Backshift also allows easy removal of old backups, despite its strong deduplication and compression.

Files to back up are selected using something like ‘find / -xdev -print0’ and piping that to backshift.

Files are restored by piping to ‘tar xfp’.

Metadata is partially compressed. Each directory’s metadata is compressed separately for easy partial restores.

Content-based, variable length chunks are deduplicated - so simply inserting a byte at a random place in a large file is not going to require backing up the entire file anew.

Backshift runs on CPython 3.x and Pypy3. It may run on nuitka - backshift+nuitka has not been tested much.

On many modern systems, Backshift runs fastest on Pypy, but on some (older?) machines you may be better off with CPython 3.x plus the Cython versions of treap and rolling_checksum_mod.

For pypy, simply install backshift with pip. This should give you a pure-python version of backshift that pypy likes. For CPython+Cython, first install backshift with pip just as you would for pypy. Then additionally install pyx-treap and rolling-checksum-pyx-mod with pip - for a speed boost.

Backshift is not as fast as rsync –link-dest; rsync does not have to do as much work to accomplish what it sets out to do. But if you are paying for your storage, backshift will probably be significantly cheaper.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

backshift-1.55.tar.gz (76.4 kB view details)

Uploaded Source

File details

Details for the file backshift-1.55.tar.gz.

File metadata

  • Download URL: backshift-1.55.tar.gz
  • Upload date:
  • Size: 76.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.2

File hashes

Hashes for backshift-1.55.tar.gz
Algorithm Hash digest
SHA256 9df77e2378fed6bb03d5c481aad7665cfd8faed616b68e2a3cf4aa72cad8e2c2
MD5 20471fff707c91808df8157045a660bc
BLAKE2b-256 af599518dd84df3da603a736198288a10399d014bc5a629ff725ecdb94410d3c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page