Skip to main content

Calculates both scaffold and contig statistics (N50, L50, etc.) from a scaffold FASTA file.

Project description

assembly_stats

A Python library that takes a FASTA file as input and calculates both scaffold and contig statistics (N50, L50, etc.) from a scaffold FASTA file. It does this by breaking each scaffold wherever there is more than one N and then calculating statistics for both the scaffolds and contigs.

This is a re-write of fasta_metadata_parser to speed up the old implementation, and -- most importantly -- to learn how to install Python scripts onto the Smithsonian HPC.

Installation

pip install assembly_stats

Usage

  $ assembly_stats -h

    usage: assembly_stats [-h] filename

    Calculate statistics about genome assemblies.

    positional arguments:
      filename    Genome file in FASTA format.

    optional arguments:
      -h, --help  show this help message and exit

After calculating the statistics for the genome assembly, they will be printed out in JSON format.

Next steps

  • Add ability to save NumPy sequence length arrays for further visualization, since generating these are what takes the most time.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

assembly_stats-0.1.4.tar.gz (3.9 kB view details)

Uploaded Source

Built Distribution

assembly_stats-0.1.4-py3-none-any.whl (5.0 kB view details)

Uploaded Python 3

File details

Details for the file assembly_stats-0.1.4.tar.gz.

File metadata

  • Download URL: assembly_stats-0.1.4.tar.gz
  • Upload date:
  • Size: 3.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4

File hashes

Hashes for assembly_stats-0.1.4.tar.gz
Algorithm Hash digest
SHA256 5ef35e04b5d86acea839b04fcccd2a73e64e559ea31a8c256de8f0dda78e6968
MD5 c809ac3970fbb658b72a449316bd0413
BLAKE2b-256 b3cf1e09948caded07b195f95851c882da4142e08d61404ffb964404eb9ee312

See more details on using hashes here.

File details

Details for the file assembly_stats-0.1.4-py3-none-any.whl.

File metadata

  • Download URL: assembly_stats-0.1.4-py3-none-any.whl
  • Upload date:
  • Size: 5.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.7.4

File hashes

Hashes for assembly_stats-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 77ad05de78ce4a37c69db6e178fec7a11ffc1c3fc3a2a94072e64e11d6e5f0f7
MD5 4a16bc0117698de2d07e5dc1383ab70e
BLAKE2b-256 d7dd7025ead6f8cf24ad0580af1193bcfd7176f7393df207f82be3ffbaed55e0

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page