S4 · PyPI

Fast and cheap synchronisation of files using Amazon S3

These details have not been verified by PyPI

Project links

Homepage

Project description

Fast and cheap synchronisation of files using Amazon S3.

S4 stands for “Simple Storage Solution (S3) Syncer”.

The intention of this project is to be an open source alternative to typical proprietary sync solutions like Dropbox. Because S4 interacts with S3 directly, you can expect very fast upload and download speeds as well as very cheap costs (See Amazon S3 Pricing for an idea of how much this would cost you). See Why? for further motivation for this project.

You can also take advantage of other cool features that S3 provides like versioning. Everytime you sync a version of a new file, you will now have the ability to easily rollback to any previous version.

Requirements
Installation
Setup
Synchronising
Handling Conflicts
Other Subcommands
How S4 Works
Ignoring Files
Why?
Contributing

See it in action here:

Requirements

S4 requires python 3.5+ to work

Installation

The easiest way to install S4 is through pip:

$ pip install s4

You will need libmagic installed. This is installed by default on most linux distributions but on MacOSX you need to install it with brew as follows:

brew install libmagic

Setup

Run s4 add to add a new sync local folder and target S3 uri:

$ s4 add
local folder: /home/username/myfolder1
s3 uri: s3://mybucket/folder1
AWS Access Key ID: AKIAJD53D9GCGKCD
AWS Secret Access Key:
region name: eu-west-2
Provide a name for this entry [myfolder1]:

Synchronising

Run s4 sync in the project directory to synchronise the local folders you specified with the folders in the bucket.

$ s4 sync
Syncing myfolder1 [/home/username/myfolder1/ <=> s3://mybucket/folder1/]
Creating foobar.jpg (/home/username/myfolder1/ => s3://mybucket/folder1/)
Creating boarding-pass.pdf (/home/username/myfolder1/ => s3://mybucket/folder1/)
Flushing Index to Storage

All files will be automatically synced between the source and target destinations where possible.

You may specify a specific folder to synchronise by the name you provided during add.

$ s4 sync myfolder1

If you wish to synchronise your targets continuously, use the daemon command:

$ s4 daemon myfolder1

NOTE: This command is only supported on machines that can run INotify. This typically means Linux based operating systems.

Handling Conflicts

In the case where S4 cannot decide on a reasonable action by itself, it will ask you to intervene:

Syncing /home/username/myfolder1/ with s3://mybucket/folder1/

Conflict for "test.txt". Which version would you like to keep?
   (1) /home/username/myfolder1/test.txt updated at 2017-01-23 12:26:24 (CREATED)
   (2) s3://mybucket/folder1/test.txt updated at 2017-01-23 12:26:30 (CREATED)
   (d) View difference (requires diff command)
   (X) Skip this file

Choice (default=skip):

If you do not wish to fix the issue, you can simply skip the file for now.

Other Subcommands

Some other subcommands that you could find useful:

s4 targets - print existing targets
s4 edit - edit the settings of a targets
s4 rm - remove a target
s4 ls - print tracked files and metadata of a target

Use the --help parameter on each subcommand to get more details.

How S4 Works

S4 keeps track of changes between files with a .index file at the root of each folder you are syncing. This contains the keys of each file being synchronised along with the timestamps found locally and remotely in JSON format.

This is compressed (currently using gzip) to save space and increase performance when loading.

If you are curious, you can view the contents of an index file using the s4 ls subcommand or you can view the file directly using a command like zcat.

NOTE: Deleting this file will result in that folder being treated as if it was never synced before so make sure you do not delete it unless you know what you are doing.

All information about your configuration (such as targets, your keys etc..) are stored in a JSON formatted file under ~/.config/s4/sync.conf.

Ignoring Files

Create a .syncignore file in the root of the directory being synced to list patterns of subdirectories and files you wish to ignore. The .syncignore file uses the exact same pattern that you would expect in .gitignore. Each line specifies a GLOB pattern to ignore during sync.

Note that if you add a pattern which matches an item that was previously synced, that item will be deleted from the target you are syncing with next time you run S4.

Why?

There are a number of open source S3 backup tools out there - but none of them really satisfied the requirements that this project tries to solve.

Here are is a list of open source solutions that I have tried in the past.

s3cmd: Provides a sync function that works very well for backing up - but stops working correctly as soon as there is second machine you want to sync to S3.
owncloud/nextcloud: Requires you to setup a server to perform your syncing. In terms of costs on AWS, this quickly becomes costly compared with just using S3. The speed of your uploads and downloads are also heavily bottlenectked by the network and hardware performance of your ec2 instance.
seafile: suffers from the same problem as owncloud/nextcloud.
duplicity: great backup tool, but does not provide a sync solution of any kind.

Contributing

Pull requests are welcome! Make sure you pass all the tests, CircleCI will tell you if you don’t ;)

When opening a pull request, please make sure it is from a separate branch in your fork.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.4.4

May 26, 2019

0.4.2

May 6, 2019

0.4.1

Nov 15, 2018

0.4.0

Oct 30, 2018

0.3.0

Oct 29, 2018

0.2.20

Jul 26, 2018

0.2.19

Jul 25, 2018

0.2.18

Jul 1, 2018

0.2.16

Jun 18, 2018

0.2.15

May 20, 2018

0.2.14

May 8, 2018

0.2.13

May 8, 2018

0.2.12

Apr 13, 2018

0.2.11

Apr 3, 2018

0.2.10

Apr 3, 2018

0.2.9

Mar 21, 2018

0.2.8

Feb 24, 2018

0.2.7

Jan 24, 2018

0.2.6

Dec 23, 2017

0.2.4

Sep 27, 2017

0.2.3

Sep 26, 2017

0.2.2

Sep 26, 2017

0.2.1

Aug 10, 2017

0.2.0

Aug 10, 2017

0.1.31

Aug 2, 2017

0.1.30

Jul 30, 2017

0.1.29

Jul 30, 2017

0.1.28

Jul 30, 2017

0.1.27

Jul 30, 2017

0.1.26

Jul 27, 2017

0.1.25

Jul 25, 2017

0.1.24

Jul 25, 2017

0.1.23

Jul 24, 2017

0.1.22

Jul 24, 2017

0.1.21

Jul 24, 2017

0.1.20

Jul 24, 2017

0.1.19

Jul 23, 2017

0.1.18

Jul 23, 2017

0.1.17

Jul 23, 2017

0.1.16

Jul 23, 2017

0.1.15

Jul 21, 2017

0.1.14

Jul 20, 2017

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

S4-0.4.4.tar.gz (47.1 kB view details)

Uploaded May 26, 2019 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

S4-0.4.4-py2.py3-none-any.whl (36.1 kB view details)

Uploaded May 26, 2019 Python 2Python 3

File details

Details for the file S4-0.4.4.tar.gz.

File metadata

Download URL: S4-0.4.4.tar.gz
Upload date: May 26, 2019
Size: 47.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: Python-urllib/3.7

File hashes

Hashes for S4-0.4.4.tar.gz
Algorithm	Hash digest
SHA256	`c0ac4dbc2cfb90d63d296ca83547a7f856b0e166e8e22f8f659ead9f8c2b8f6f`
MD5	`54f95f2e520c48cf634421e68fb21523`
BLAKE2b-256	`70a1c2202c7de3588d97095402d03450366045cace89c6272faaac8cb479d143`

See more details on using hashes here.

File details

Details for the file S4-0.4.4-py2.py3-none-any.whl.

File metadata

Download URL: S4-0.4.4-py2.py3-none-any.whl
Upload date: May 26, 2019
Size: 36.1 kB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No
Uploaded via: Python-urllib/3.7

File hashes

Hashes for S4-0.4.4-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`33419350cba6b946be4b275b96920b96812d261408bcc6b24fa95d04bd722ab6`
MD5	`07b3b333ff00c5136bd9138059d69dc3`
BLAKE2b-256	`63a4316409c489e6c7100b0e29e69ea0b75b3e9486e78169f4c3011ee7b84f2b`

See more details on using hashes here.

S4 0.4.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Requirements

Installation

Setup

Synchronising

Handling Conflicts

Other Subcommands

How S4 Works

Ignoring Files

Why?

Contributing

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes