Plot variants on the human mitochondrial genome.
Project description
mitoviz
Plot variants on the human mitochondrial genome.
Free software: MIT license
Documentation: https://mitoviz.readthedocs.io
GitHub repo: https://github.com/robertopreste/mitoviz
Features
mitoviz is a simple python package to plot human mitochondrial variants on a graphical representation of the human mitochondrial genome. It currently supports plotting variants stored in VCF and tabular files, as well as from general pandas dataframes when importing mitoviz in Python.
Variants are shown according to their heteroplasmic fraction (HF), plotting variants with HF = 1.0 on the outer border of the mitochondrial circle, those with HF = 0.0 on the inner border and all the others in between, according to their actual HF value.
If the HF information is not available, variants will all be shown in the middle of the mitochondrial circle.
A linear representation of the mitochondrial genome can also be plotted; in this case, variants are shown using a lollipop plot style, with the height of the marker reflecting their HF.
Variants with no HF information will be shown as if their HF was 0.5.
Usage
mitoviz can be used both from the command line and as a python module.
Command Line
Given a VCF file with human mitochondrial variants (sample.vcf), plotting them is fairly simple:
$ mitoviz sample.vcf
An image named mitoviz.png will be created in the current directory; if you want to provide a specific filename where the plot will be saved, just add the --output option with the desired path:
$ mitoviz sample.vcf --output my_mt_plot.png
Linear plots can be created using the --linear option:
$ mitoviz sample.vcf --linear
It is also possible to plot variants stored in a tabular file, such as CSV or TSV formats; mitoviz will automatically recognise them, treating the file as comma-separated by default. If a different separator is used (as in the case of TSV files), just specify it with the --sep option:
$ mitoviz sample.tsv --sep "\t"
Python Module
Import mitoviz and use its plot_vcf function to use it in your own script:
from mitoviz import plot_vcf
my_plot = plot_vcf("sample.vcf")
In this case, no plot will be shown until a call to plt.show() is made. It is possible to save the resulting plot using the save option and to provide a specific file where the plot will be saved using the output option:
plot_vcf("sample.vcf", save=True, output="my_mt_plot.png")
By default, a polar plot is returned; linear plots are easily created using the linear option:
plot_vcf("sample.vcf", save=True, linear=True)
A similar function to plot variants contained in a pandas DataFrame is available as plot_df. Supposing you have a pandas DataFrame with human mitochondrial variants named variants_df, it is possible to plot them as follows:
from mitoviz import plot_df
plot_df(variants_df)
Variants stored in tabular files can be plotted using plot_table, which accepts the same options available for plot_vcf and plot_df, with the addition of sep, which is used to specify the column separator. By default, the comma is used as column delimiter:
from mitoviz import plot_table
# plotting a CSV file
plot_table("sample.csv")
# plotting a TSV (tab-separated) file
plot_table("sample.tsv", sep="\t")
plot_table also accept additional keyword options, which will be passed to pandas.read_table when processing the given input file:
plot_table("sample.tsv", sep="\t", comment="#", skiprows=0)
Please refer to the Usage section of the documentation for further information.
Installation
PLEASE NOTE: HmtNote only supports Python >= 3.6!
The preferred installation method for mitoviz is using pip:
$ pip install mitoviz
Please refer to the Installation section of the documentation for further information.
Credits
This package was created with Cookiecutter and the cc-pypackage project template.
History
0.1.0 (2019-12-27)
First release.
0.2.0 (2019-12-29)
Add functionality to plot multiple samples.
0.2.1 (2020-01-06)
Add legend to plots and update colors.
0.2.2 (2020-01-08)
Add option to plot variant labels.
0.2.3 (2020-01-11)
Make legend plotting optional.
0.3.0 (2020-01-15)
Add plot_df function to plot variants from a pandas DataFrame.
0.4.0 (2020-01-26)
Add plot_table function to plot variants from tabular files;
add CLI functionality to plot variants from tabular files;
refactor code.
0.4.1 (2020-02-13)
Refactor to use abstract classes;
Rename internal classes to _PolarLocus and _PolarVariant.
0.4.2 (2020-02-14)
Fix bug with non coding loci not being shown in plots.
0.5.0 (2020-02-19)
Add split option to plot split strands on polar plots.
0.6.0 (2020-02-29)
_PolarVariant is deprecated and replaced by _Variant;
Add linear option to create linear plots.
0.6.1 (2020-03-02)
Refactor and clean code;
Add CI module for internal management.
0.6.2 (2020-03-03)
Fix borders on linear plots.
0.6.3 (2020-03-04)
Fix stemlines on split linear plots.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for mitoviz-0.6.3-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | caad81cf913d35f97fbba424703cf8535fab90f9cc160f592694dcc38d344552 |
|
MD5 | eef9f72af33a86a8fd44ae458407dffd |
|
BLAKE2b-256 | 8baa09427baa6968f1ebb8d5f5b84fc0e72b673adac93c3fc091d32eb5347539 |