Skip to main content

Improves file transfer rates when copying files to/from JUNOS/EVO/*nix hosts

Project description

Splitcopy

Improves file transfer rates when copying files to/from JUNOS/EVO/*nix hosts.

At a minimum, sshd must be running on the remote host.
On JUNOS/EVO this requires 'system services ssh' configuration.

If using ftp to copy files then an ftp daemon must be running on the remote host.
On JUNOS this requires 'system services ftp' configuration.
FTP is the default transfer method due to its lower resource usage and its ability to restart transfers.

Script overheads include authentication, sha hash generation/comparison, disk space check, file split and join.
It can be slower than normal ftp/scp for small files as a result.

Because it opens a number of simultaneous connections, if the JUNOS/EVO host has connection/rate limits configured like this:

system {
    services {
        ssh { # or ftp
            connection-limit 10;
            rate-limit 10;
        }
    }
}

or system login retry-options:

system {
    login {
        retry-options {
            <..>
        }
    }
}

The script will deactivate these limits so it can proceed, then rollback these changes upon completion.

Arguments

source Mandatory
target Mandatory
--pwd Optional, password
--scp Optional, use scp instead of ftp to transfer files
--ssh_key Optional, path to private ssh key (only required if not located in ~/.ssh/)
--log Optional, enables additional logging, specify a logging level as argument
--noverify Optional, skips sha1 hash comparison of src and dst file
--split_timeout Optional, time to wait for remote file split operation to complete, default 120s
--ssh_port Optional, ssh port number to connect to
--nocurses Optional, disables the use of a curses window to show per-file progress statistics

The format of source and target arguments match those of the 'scp' cmd.
Both accept either a local path, or a remote path in the format - user@host:path or host@path

To copy from local host to remote host:

splitcopy  <local path> <user>@<host>:<path>

To copy from remote host to local host:

splitcopy  <user>@<host>:<path> <local path>

Supports connecting through jumphosts via 'ProxyCommand' entries in ~/.ssh/config. Example:

Host myserver  
  ProxyCommand ssh myjumphost.mydomain.com -W %h:%p

INSTALLATION

Installation requires Python >= 3.6 and associated pip tool

python3 -m pip install splitcopy

Installing from Git is also supported (OS must have git installed).

To install the latest MASTER code
python3 -m pip install git+https://github.com/Juniper/splitcopy.git
-or-
To install a specific version, branch, tag, etc.
python3 -m pip install git+https://github.com/Juniper/splitcopy.git@<branch,tag,commit>

Upgrading has the same requirements as installation and has the same format with the addition of --upgrade

python3 -m pip install splitcopy --upgrade

Usage Examples

FTP transfer (default method)

$ splitcopy /var/tmp/jselective-update-ppc-J1.1-14.2R5-S3-J1.1.tgz lab@192.168.1.1:/var/tmp/
Password:
checking remote port(s) are open...
using FTP for file transfer
checking remote storage...
sha1 not found, generating sha1...
splitting file...
starting transfer...
100% done
transfer complete
joining files...
deleting remote tmp directory...
generating remote sha hash...
local and remote sha hash match
file has been successfully copied to 192.168.1.1:/var/tmp/jselective-update-ppc-J1.1-14.2R5-S3-J1.1.tgz
data transfer = 0:00:16.831192
total runtime = 0:00:31.520914

SCP transfer

$ splitcopy lab@192.168.1.1/var/log/messages /var/tmp/ --scp  
ssh auth succeeded
checking remote storage...
checking local storage...
sha1 not found, generating sha1...
splitting file...
starting transfer...
100% done
transfer complete
joining files...
deleting remote tmp directory...
generating remote sha hash...
local and remote sha hash match
file has been successfully copied to /var/tmp/messages
data transfer = 0:00:18.768987
total runtime = 0:00:44.891370

Notes on using FTP

FTP is the default transfer method.
The processing of each file chunk is performed by a dedicated thread
Each cpu core is allowed up to 5 threads, with a system max of 32 threads used

Using FTP method will generate the following processes on the remote host:

  • for mgmt session: 1x sshd, 1x cli, 1x mgd, 1x csh
  • for transfers: up to 40x ftpd processes (depends on Python version and number of cpus as described above)

In theory, this could result in the per-user maxproc limit of 64 being exceeded:

May  2 04:46:59   /kernel: maxproc limit exceeded by uid 2001, please see tuning(7) and login.conf(5).

The script modulates the number of chunks to match the number of threads available
The maximum number of user owned processes that could be created is <= 44

Notes on using SCP

The processing of each file chunk is performed by a dedicated thread
Each cpu core is allowed up to 5 threads, with a system max of 32 threads used

Using SCP method will generate the following processes on the remote host:

  • for mgmt session: 1x sshd, 1x cli, 1x mgd, 1x csh
  • for transfers: depends on Python version, number of cpus (see above), OpenSSH and Junos FreeBSD version (see below)

In FreeBSD 11 based Junos each scp transfer creates 2 user owned processes:

lab 30366   0.0  0.0  475056   7688  -  Ss   10:39      0:00.03 cli -c scp -t /var/tmp/
lab 30367   0.0  0.0   61324   4860  -  S    10:39      0:00.01 scp -t /var/tmp/

In FreeBSD 10 based Junos each scp transfer creates 2 user owned processes

lab  28639   0.0  0.0  734108   4004  -  Is   12:00PM     0:00.01 cli -c scp -t /var/tmp/splitcopy_jinstall-11.4R5.5-domestic-signed.tgz/
lab  28640   0.0  0.0   24768   3516  -  S    12:00PM     0:00.01 scp -t /var/tmp/splitcopy_jinstall-11.4R5.5-domestic-signed.tgz/

In FreeBSD 6 based Junos each scp transfer creates 3 user owned processes:

lab  78625  0.0  0.1  2984  2144  ??  Ss    5:29AM   0:00.01 cli -c scp -t /var/tmp/splitcopy_jinstall-11.4R5.5-domestic-signed.tgz/  
lab  78626  0.0  0.0  2252  1556  ??  S     5:29AM   0:00.00 sh -c scp -t /var/tmp/splitcopy_jinstall-11.4R5.5-domestic-signed.tgz/  
lab  78627  0.0  0.1  3500  1908  ??  S     5:29AM   0:00.01 scp -t /var/tmp/splitcopy_jinstall-11.4R5.5-domestic-signed.tgz/  

In addition, if OpenSSH version is >= 7.4, an additional user owned process is created:

lab  2287  2.4  0.1 11912  2348  ??  S     3:49AM   0:00.15 sshd: lab@notty (sshd)

This could result in the per-user maxproc limit of 64 being exceeded:

May  2 04:46:59   /kernel: maxproc limit exceeded by uid 2001, please see tuning(7) and login.conf(5).

To mitigate this, the script modulates the number of chunks to match the maximum number of simultaneous transfers possible (based on OpenSSH, Junos FreeBSD versions and the number of cpu's).
The maximum number of user owned processes that could be created is <= 45

LICENSE

Apache 2.0

CONTRIBUTORS

Juniper Networks is actively contributing to and maintaining this repo. Please contact jnpr-community-netdev@juniper.net for any queries.

Contributors:

Chris Jenn

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

splitcopy-1.5.0.tar.gz (53.9 kB view details)

Uploaded Source

Built Distribution

splitcopy-1.5.0-py3-none-any.whl (58.5 kB view details)

Uploaded Python 3

File details

Details for the file splitcopy-1.5.0.tar.gz.

File metadata

  • Download URL: splitcopy-1.5.0.tar.gz
  • Upload date:
  • Size: 53.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.4

File hashes

Hashes for splitcopy-1.5.0.tar.gz
Algorithm Hash digest
SHA256 3aa8f7af3ca909aebd36f84e00e5bca0a737fcb8f800e14d26bb0b48ebd355a8
MD5 27830b411254ec973fe59e90bb9a4ca6
BLAKE2b-256 89f35e42fe71843d54901062ed9bf424a7d267ed782874d1c37806eedf759856

See more details on using hashes here.

File details

Details for the file splitcopy-1.5.0-py3-none-any.whl.

File metadata

  • Download URL: splitcopy-1.5.0-py3-none-any.whl
  • Upload date:
  • Size: 58.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.10.4

File hashes

Hashes for splitcopy-1.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 1ae47b3e2a8fe35952fcaaf5a0b599ab23f505e3464ce8c4b7e3e97b247bdaa7
MD5 edfb85c47733ea08618b2fd32e07a8b6
BLAKE2b-256 78f01df76e20cd3cdd50f0aa8673eac1aa70a245f5420befe096566c4d5a7ee4

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page