extract_genome_region

Given a CSV file of variable information defining the regions of interest, return a file that contains a fasta-formatted representation of these regions.

These details have not been verified by PyPI

Project links

Homepage

Project description

docs
tests
package

Given a CSV file of variable information defining the regions of interest, return a file that contains a fasta-formatted representation of these regions.

Free software: BSD license

Usage

$ extract_genome_region --help
Usage: extract_genome_region [OPTIONS] REGIONS IN_FASTA OUT_FASTA

  Given a CSV file of variable information defining the regions of interest
  along with input and output fasta file paths, write a file that contains a
  fasta-formatted representation of these regions.

  Structure of the `regions` CSV file:

    record_name   The name you want the seq to have in the new fasta.
       scaffold   The name of the seq record in the source fasta (chromosome, scaffold, contig, etc).
          start   The first bp of the seq feature you want in the new fasta.
           stop   The last bp of the seq feature you want in the new fasta.
       left_bfr   How many "extra" bp with coords smaller than `start` you want (0 for none).
      right_bfr   How many "extra" bp with coords larger than `stop` you want (0 for none).

  Naming Strategies:

            csv   use only the contents of the `record_name` field in the csv file (>CPR23).
      seq_range   use only the `scaffold` name and sequence range (>scaffold1:230-679).
  csv_seq_range   use both the contents of the `record_name` field and the `scaffold`.
                  name and sequence range (>CPR23 scaffold1:230-679).

Options:
  -n, --naming [csv|seq_range|csv_seq_range]
                                  Options regarding how each new fasta record
                                  will be named. See main help-text for
                                  explainations of options. [default='csv']
  --help                          Show this message and exit.

Installation

$ conda install -c bioconda -c gusdunn extract_genome_region

$ pip install extract_genome_region

Documentation

https://extract-genome-region.readthedocs.org/

Development

To run the all tests run:

tox

Changelog

0.0.3 (2016-05-24)

altered meaning of ‘csv_seq_range’ to allow splitting fasta key on whitespace to ignore scaffold:range info

0.0.2 (2016-05-24)

expanded the help text
built the docs

0.0.1 (2016-05-23)

First release on anaconda.org/gusdunn.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.0.3

May 24, 2016

0.0.2

May 24, 2016

0.0.1

May 24, 2016

extract_genome_region 0.0.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Usage

Installation

Documentation

Development

Changelog

0.0.3 (2016-05-24)

0.0.2 (2016-05-24)

0.0.1 (2016-05-23)

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed