Skip to main content

Command line scripts for the oldowan.mitomotifs package.

Project description

oldowan.mitomotifs-cmdline provides the seq2sites and sites2seq command line scripts to covert human mitochondrial DNA sequences to lists of variant sites relative to the revised Cambridge Reference Seqeunce and vice versa. Further information on the rCRS and variant site nomenclature for human mtDNA sequences is available at the MitoMotifs website.

This package is only the command line extension of the core oldwan.mitomotifs module. If you want just the Python libraries and not the command line tools, you should go directly to that package.

Installation Instructions

This package is pure Python and has only pure Python dependencies (oldowan.mitomotifs and oldowan.fasta) outside of the standard library. All of these dependencies will be automatically installed if you use the setuptools easy_install tool. This usually goes something like this:

$ easy_install oldowan.mitomotifs-cmdline

or on a unix-like system, assuming you are installing to the main Python site-packages directory as a non-privileged user, this:

$ sudo easy_install oldowan.mitomotifs-cmdline

You may also use the standard python distutils setup method. You will have to download ind install oldowan.fasta and oldowan.mitomotifs first. Then, download the current source archive from the file list towards the bottom of this page, unarchive it, and install. On Mac OS X and many other unix-like systems, having downloaded the archive and changed to the directory containing this archive in your shell, this might go something like:

$ tar xvzf oldowan.mitomotifs-cmdline*
$ cd oldowan.mitomotifs-cmdline*
$ python install

Quick Start

Convert sequences to sites:

$ seq2sites sequences.fasta

Convert sequences to sites, saving the output to a file:

$ seq2sites sequences.fasta > sites.txt

Sequences must be contiguous! Separate runs of sequence, such as HVR1 and HVR2 without the intervening sequence interval, must be analyzed separately.

There is also a cutoff on the number of ambigous sites (N) allowed in the sequence. By default, this is 10 - but this is an option that can be set:

$ seq2sites --ambig-cutoff=20 sequences.fasta

Convert a list of variable sites to sequence. The input file should be a comma-seprated-values list with one entry per line, with name, N, and sites. Sites should be separated by whitespace:

$ cat hvr1_sites.txt
Seq2,1,16129A 16223T
$ sites2seq hvr1_sites.txt

The default range of sequence returned is HVR1, defined as positions 16023 to 16365 on the rCRS. All predefined ranges are:

  • HVR1: 16024-16365
  • HVR2: 73-340
  • HVR1to2: 16024-340
  • coding: 577-15992
  • all: 1-16559

So, to convert a list of HVR2 sites to sequence:

$ sites2seq --region=hvr2 hvr2_sites.txt

An arbitrary range may be selected. If the stop value is smaller than the start, it is assumed that the range runs through the origin:

$ sites2seq --begin=16024 --end=340 dloop_sites.txt

The rCRS sequence will can be selected with ‘rCRS’ as the sites value:

$ cat sites.txt
Seq2,1,16129A 16223T

Release History ===============

1.0.0 (August 16, 2008)
initial release of module.

Project details

Release history Release notifications

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for oldowan.mitomotifs-cmdline, version 1.0.0
Filename, size File type Python version Upload date Hashes
Filename, size oldowan.mitomotifs_cmdline-1.0.0-py2.5.egg (12.9 kB) File type Egg Python version 2.5 Upload date Hashes View hashes
Filename, size oldowan.mitomotifs-cmdline-1.0.0.tar.gz (4.4 kB) File type Source Python version None Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page