Skip to main content

Utility package to parse multi fasta files resulting from de novo assembly

Project description

Contig Tools

Installation

pip3 install contig-tools

source code: https://gitlab.com/antunderwood/contig_tools

Usage

usage: contig-tools [-h] [-v] {filter,metrics,check_metrics} ...

A package to maniuplate and assess contigs arising from de novo assemblies


positional arguments:
  {filter,metrics,check_metrics}
                        The following commands are available. Type
                        contig_tools <COMMAND> -h for more help on a specific
                        commands
    filter              Filter contigs based on either length and/or coverage
    metrics             Print contig metrics
    check_metrics       check contig metrics

optional arguments:
  -h, --help            show this help message and exit
  -v, --version         display the version number

Examples

filter contigs

contig-tools filter -l 500 -c 3 -f contigs.fasta

print contig metrics

contig-tools metrics -f contig_tools/tests/test_data/contigs_for_checks.fas
contig-tools metrics -f contig_tools/tests/test_data/contigs_for_checks.fas -o json

check if contigs meet conditions based on conditions enoded in a yaml file

example yaml file

N50 score:
  condition_type: gt
  condition_value: 10
Largest contig:
  condition_type: gt
  condition_value: 15
Total length:
  condition_type: lt_gt
  condition_value:
    - 100
    - 50

example command

contig-tools check_metrics -f contigs.fasta -y conditions.yml

metrics that can be checked are

  • Number of contigs
  • Number of contigs > 500bp
  • Total length
  • %GC
  • Largest contig
  • N50 score

conditions that can be used are

  • gt => greater than
  • lt => less than
  • lt_gt => less than and greater than

check if a two or more loci are co-located

Make a fasta query file with the 2 or more loci you want to see if they are co-located e.g

>gene1
GCAGCTAGCGACTGCGAC.....
>gene2
CTACGTAGGACACGACTA....

There are two options

  1. Search a single genome file for the co-location of loci

    contig-tools co_located -q queries.fas -f /path/to/single/genome/contigs.fas
    

or

  1. Search a list of genomes for the co-location of loci Make a text file with paths to genomes e.g

    /path/to/single/genome1.fas
    /path/to/single/genome1.fas
    ....
    

    and then run the command

    contig-tools co_located -q queries.fas -l /path/to/single/genome_list_file.txt
    

code

Code can be found here

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

contig tools-0.3.5.tar.gz (7.6 kB view details)

Uploaded Source

Built Distribution

contig_tools-0.3.5-py3-none-any.whl (11.1 kB view details)

Uploaded Python 3

File details

Details for the file contig tools-0.3.5.tar.gz.

File metadata

  • Download URL: contig tools-0.3.5.tar.gz
  • Upload date:
  • Size: 7.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.4.2 requests/2.19.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.24.0 CPython/3.7.0

File hashes

Hashes for contig tools-0.3.5.tar.gz
Algorithm Hash digest
SHA256 3b7f68d9f27e6586c402bcc242fed8efd35d93c0956ff4ce91dedd6aa08f28df
MD5 d1df44ab11841ace0e962d0479f8fb13
BLAKE2b-256 60c07c3b8369d74fd79fc3630a879b43c712ae15c09f3d084be7c953cd244b27

See more details on using hashes here.

File details

Details for the file contig_tools-0.3.5-py3-none-any.whl.

File metadata

  • Download URL: contig_tools-0.3.5-py3-none-any.whl
  • Upload date:
  • Size: 11.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.4.2 requests/2.19.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.24.0 CPython/3.7.0

File hashes

Hashes for contig_tools-0.3.5-py3-none-any.whl
Algorithm Hash digest
SHA256 097f155d45a4b153df01d98cb12fb60adfe15284d8bb781524f29a24c1e1bd8d
MD5 9fc3001e4fbefc4d729d3730eb7e529f
BLAKE2b-256 17f23f6b5cff4de8506c880bffb0ab431f5ee15c918aa45f7fdda84cae1e4744

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page