Skip to main content

A tool to copy big data to multiple smaller disks

Project description

PyPI version

partialCopy

A tool to copy big data to multiple smaller disks

Motivation

As the storage becomes larger in big projects, we need to a tool to break large folders (100 TBs) to smaller chunks, so we can migrate to another location or storing it on tapes.

How does it work?

The tool finds the best placement for the files and it creates a files list in --save-to directory which can be passed rsync using --files-from parameter.

Installation

sudo pip install partialCopy

Usage

usage: pcp.py [-h] [--dest DEST | --dest-size DEST_SIZE] [-s SAVE_TO] [-f]
              [-n] [-ma MODIFIED_AFTER] [-fp FIND_PARAMS]
              src

positional arguments:
  src                   Source Directory

optional arguments:
  -h, --help            show this help message and exit
  --dest DEST           Destionation mountpoint
  --dest-size DEST_SIZE
                        Destination size, given in bytes or using 1 letter
                        unit B,K,M,G,T,P
  -s SAVE_TO, --save-to SAVE_TO
                        Where to save rsync list,default
                        '$src/pcp_rsync_list/'
  -f, --force           Rewrite all lists again
  -n, --new             Find New Files
  -ma MODIFIED_AFTER, --modified-after MODIFIED_AFTER
                        Find files modified after certain time (YYYY-mm-dd)
  -fp FIND_PARAMS, --find-params FIND_PARAMS
                        Parameters to find command

Changes

  • Allowing to provide size rather than mount point.
  • Finds only new files using -n parameter.
  • Finds files modified after a certain date using -ma parameter.
  • Allows rewriting all lists using -f flag.

Contributors

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

partialCopy-0.6.1.tar.gz (5.4 kB view details)

Uploaded Source

File details

Details for the file partialCopy-0.6.1.tar.gz.

File metadata

  • Download URL: partialCopy-0.6.1.tar.gz
  • Upload date:
  • Size: 5.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/2.7.15+

File hashes

Hashes for partialCopy-0.6.1.tar.gz
Algorithm Hash digest
SHA256 206c14f90d26066dcacb3719f3281dfa082493ddba3779f9613912d4d557fa73
MD5 ee9fab2207a1017fb8bb2fe13094208c
BLAKE2b-256 45664e63dd8e8a3d8fb9f047883260248382ade23a3a757b50e378412e877947

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page