Skip to main content

A block based deduplicating backup software for Ceph RBD, image files and devices

Project description

https://img.shields.io/github/workflow/status/elemental-lf/benji/All-in-One/master?style=plastic https://img.shields.io/pypi/l/benji.svg?style=plastic&label=License https://img.shields.io/pypi/v/benji.svg?style=plastic&label=PyPI%20version https://img.shields.io/pypi/pyversions/benji.svg?style=plastic&label=Supported%20Python%20versions

Benji Backup

Benji Backup is a block based deduplicating backup software. It builds on the excellent foundations and concepts of backy² by Daniel Kraft.

While Benji can backup any block device or image file (this includes LVM logical volumes and snapshots) it excels at backing up Ceph RBD images and it also includes preliminary support to backup iSCSI targets.

Benji is written in Python and is available in PyPI for installation with pip. Benji also features a generic container image with all dependencies included as well as an image and Helm chart to integrate Benji into a Kubernetes environment to backup Ceph RBD based persistent volumes.

The documentation is available here.

Status

Benji is beta quality and will probably stay that way due to time constraints. Please open an issue on GitHub if you have any usage question that is not or incorrectly covered by the documentation. And have a look at the CHANGES file for any upgrade notes.

Benji requires Python 3.6.5 or newer because older Python versions have some shortcomings in the concurrent.futures implementation which lead to an excessive memory usage.

The master branch contains the development version of Benji and may be broken at times and may even destroy your backups. Please use the latest pre-releases to get some resemblance of stability and a migration path from one pre-release to the next.

The benji-k8s container image together with the Helm chart provides a solid way for backing up persistent volumes provided by Ceph RBD. This includes volumes provisioned by Rook, Ceph CSI or the older volume plugin integrated into kubelet.

Main Features

Small backups

Benji deduplicates all data read and each unique block is only written to the storage location once. The deduplication takes into account all historic data present on the backup storage and so spans all backups and all backup sources.

In addition Benji supports fast state-of-the-art compression to further reduce the storage space requirements.

Fast backups

With the help of snapshots and the rbd diff command Benji only backups blocks that have changed since the last backup when used with Ceph RBD images. The same mechanism can be extended to other backup sources.

Fast restores

Sparse blocks are be skipped on restore providing fast restores of sparsely populated disk images.

Low bandwidth requirements

As only changed and not yet known blocks are written to the backup storage, the bandwidth requirements for the network connection between Benji and the storage location are usually low. Even with newly created block devices the traffic to the backup storage location is generally small as these devices mostly contain sparse blocks. Enabling compression further reduces the bandwidth requirements.

Support for a variety of backup storage locations

Benji supports AWS S3 as a backup storage location and it has options to enable compatibility with other S3 implementations like Google Storage, Ceph’s RADOS Gateway or Minio.

Benji also supports Backblaze’s B2 Cloud Storage which opens up a very cost effective way to store backups.

Benji is able to use any file based storage including external hard drives and network based storage solutions like NFS, SMB or even CephFS.

Multiple different storage locations can be used simultaneously and in parallel to accomodate different backup strategies.

Confidentiality

Benji supports AES-256 in GCM mode to encrypt all data blocks on the backup storage. By using envelope encryption every block is encrypted with its own unique random key. This makes plaintext attacks even more difficult.

Integrity

Each data block in Benji is protected by a checksum. This checksum is not only used for deduplication but also to ensure the integrity of the whole backup. Long-term availability of backups is ensured by regularly checking existing backups for bit rot.

Integrated NBD server

Benji brings its own NBD (network block device) server which makes backup images directly accessible as a block device - even over the network. The block device can be mounted if it contains a filesystem and any individual files needed can be easily restored even though Benji is a block based backup solution.

Benji can also provide a writable version of a backup via NBD. This enables repair operations like fsck. The original backup is not changed in this case. All changes are transparently written to a new backup via copy-on-write and this new backup can be restored just like any other backup after the repair is complete.

Concurrency

Benji supports running multiple operations simultaneously. Instances can be distributed across different hosts or containers without the need for a central server.

Extensibility

Benji comes with a module framework to easily add new protocols for accessing backup sources or storages. New compression and encryption algorithms are also easily integrated into Benji.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

benji-0.17.0.tar.gz (347.1 kB view details)

Uploaded Source

Built Distribution

benji-0.17.0-py3-none-any.whl (144.7 kB view details)

Uploaded Python 3

File details

Details for the file benji-0.17.0.tar.gz.

File metadata

  • Download URL: benji-0.17.0.tar.gz
  • Upload date:
  • Size: 347.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.14

File hashes

Hashes for benji-0.17.0.tar.gz
Algorithm Hash digest
SHA256 0580347832ccd884797c13614c2ed6311ec36b6aa9adf8fc61fec04dd6a46dac
MD5 8fbd4824ce9220fba3f0c399fc0374c1
BLAKE2b-256 083e635ce7f8863827ed4fc7d84c113d4978615764fd4bbc0e9c5c64c82f54f1

See more details on using hashes here.

File details

Details for the file benji-0.17.0-py3-none-any.whl.

File metadata

  • Download URL: benji-0.17.0-py3-none-any.whl
  • Upload date:
  • Size: 144.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.14

File hashes

Hashes for benji-0.17.0-py3-none-any.whl
Algorithm Hash digest
SHA256 8ee3b805e40a3b42e99b8cb68ab808952830d7cab3299e64c83eb7917a8261c0
MD5 064f4a5026fa9632cbe90992b7fa047b
BLAKE2b-256 a8dce3a28bb9675a0c9c217856d148e731d591018e4a6f0161262967512fb8eb

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page