Skip to main content

A CLI to handle PATRIC data from the FTP

Project description

cirtap

A command-line utility to handle PATRIC data from their FTP

Check out the wiki page for more info.

Installation

$ pip install cirtap

Usage

Usage: cirtap [OPTIONS] COMMAND [ARGS]...

  Run `cirtap COMMAND -h` for subcommand help

Options:
  --version   Show the version and exit.
  --help  Show this message and exit.

Commands:
  best     Select best genomes based on stats retrieved from genome_summary
  collect  Create sequence sets based on the installed files
  index    Create an index of contents for all directories
  mirror   Mirror all data from ftp.patricbrc.org in the specified DB_DIR
  pack     Create a gzipped tar archive from a list of genome ids in a file

Quickstart

mirror

  • Start a new mirror of all data in a local path, wiht 8 parallel downloads
$ cirtap mirror -j 8 some/path

  • Resume a failing job
$ cirtap mirror -j 8 -r some/path
  • Archive previous release notes and send notification emails when a mirror job launches or fails to some users
$ cirtap mirror -j 8 --notify user1@example.com,user2@gmail.com --archive-notes some/path

The rest of the commands assume a mirror is set up

index

  • Create a presence/absence index of files installed
$ cirtap index -j 16 path/to/genomes index.tsv

This is useful for selecting genomes based on file presence

collect

  • Collect all proteins for e.g. building a blastp database
$ cirtap collect -t proteins -j 4 -i path/to/index.tsv path/to/genomes all_proteins.fa.gz
  • Collect all 16S SSU sequences
$ cirtap collect -t SSU -j 4 -i path/to/index.tsv path/to/genomes SSU.fa.gz

best

  • Make a selection of best genomes based on completeness, consistency, fine/coarse consistency.
$ cirtap best -i path/to/index.tsv -d path/to/local/patric best_genomes

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cirtap-0.3.1.tar.gz (22.5 kB view details)

Uploaded Source

Built Distribution

cirtap-0.3.1-py2.py3-none-any.whl (23.6 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file cirtap-0.3.1.tar.gz.

File metadata

  • Download URL: cirtap-0.3.1.tar.gz
  • Upload date:
  • Size: 22.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.24.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.8.6

File hashes

Hashes for cirtap-0.3.1.tar.gz
Algorithm Hash digest
SHA256 d110c3c52b6ba98ca1cd4cf67ca9bcf667c4f016b399e968f7d753be01ac4dc4
MD5 1eae337bea428a3c5b1e5e0b62a86faf
BLAKE2b-256 472eb4ff803d9698d31e3ea1d8c2920b4f056b3fbb5ad111c01f0512b1d8d706

See more details on using hashes here.

File details

Details for the file cirtap-0.3.1-py2.py3-none-any.whl.

File metadata

  • Download URL: cirtap-0.3.1-py2.py3-none-any.whl
  • Upload date:
  • Size: 23.6 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.24.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.8.6

File hashes

Hashes for cirtap-0.3.1-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 a280c5ff20ae91be643040fb4bf2afd441faeb4e14b1ff22dfe8d740773b38fd
MD5 34110f18877a4736cc3066896fa9ac0a
BLAKE2b-256 e2b5f0803e949785af2f716f4b88dd7f0db9db2be793cf60f76977a816f4a4e7

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page