A python for plotting GC-skew from DNA sequences.
Project description
gcskewer
create GC skew plots from DNA sequences in python
Installation
The easiest way to install gcskewer is though the python package index.
pip install gcskewer
This will fetch and install the latest version from: LINK
You can also install gcskewer
by cloning this repository.
gcskewer
requires Bio
,matplotlib
and plotly
. They should be installed automatically.
Usage
Input
gcskewer
can take DNA sequences in .fasta or .gbk format. You can specify with -f
/--fasta
or -g
/--gbk
. You can't do both at the same time - only define you sequence one! For example:
gcskewer -g example.gbk
or
gcskewer -f example.fasta
Output
gcskewer
has three output formats: .csv (a comma seperated table of the results), .svg (an editable vector format graph) and .html (an interactive graph of the results). You can specify which outputs you want with -c
/--csv
, -s
/--svg
and -p
/--plot
(for the .html). If you are unsure, you can just specify all three:
gcskewer -g example.gbk -c -s -p
Window and Step Size
gcskewer
will automatically decide the window and step size for the analysis, however you can set these values yourself. For best results, I recommend using a step size that will result in around 1,000 steps. E.g. for a sequence of 50 kb use a step size of 50. Ensure that the window size is at least the same size as the step. You can set the window and step size with -ws
/--window-size
and -ss
/--step-size
, respectively. For example:
gcskewer -g example.gbk -ss 50 -ws 500
Example Data
Example data and output is provided in the example_data
directory in this repository. There are two subdirectories fasta
and genbank
to illustrate how gcskewer
operates on different input types. Each directry contains the .csv, .svg and .html output and the command used to generate then data is stored as command.bash
.
This script was origionally inspired by Nivina et al.'s paper: GRINS: Genetic elements that recode assembly-line polyketide synthases and accelerate their diversification. As such, I used the polyketide synthase tylactone as a test case. The sequence was obtained from MiBiG.
Versions
- 1.0.0
- initial release
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file gcskewer-1.0.0.tar.gz
.
File metadata
- Download URL: gcskewer-1.0.0.tar.gz
- Upload date:
- Size: 18.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 700884285fe573447dd4251e38763618ffb400faadd230177657fa20cebe4b36 |
|
MD5 | 41bce68a0f59dbc247eb7d4cfd4d0a3a |
|
BLAKE2b-256 | 4f90ccc3bc8e22c8bdafc3d7849707f25a677e584bb44d72cdb8651acf2c3cd6 |
File details
Details for the file gcskewer-1.0.0-py3-none-any.whl
.
File metadata
- Download URL: gcskewer-1.0.0-py3-none-any.whl
- Upload date:
- Size: 19.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 082f1a559014e1b019553a78ac79755dacb76d94e5df43f0e5270838f09cc7ad |
|
MD5 | 10008363b9db861a1de13cf9ede5d832 |
|
BLAKE2b-256 | b4abe9e9572e0ed60221ec3e1c42b02405ddd0f50782fdb4665ae7a801fad0f3 |