A simple way to visualize features on human chromosome ideograms
Project description
tagore
tagore
is a simple way to visualize features on human chromosome ideograms as shown in this article: https://www.nature.com/articles/srep12376
tagore
was designed to allow everyone to create 23andMe style chromosome painting diagrams.
Installation
tagore
is a simple Python script that uses the RSVG library and has no other depenendies.
pip install tagore
tagore --version
# tagore (version 1.1.2)
Requirements
Quick start
The demo data consists of Catalogue of Somatic Mutations in Cancer (COSMIC) Cancer Gene Census genes and 100 randomly simulated mutations. Points represent single nucleotide variants (i.e. variant present in <3 samples); triangles represent single nucleotide polymorphisms (i.e. variants found in many samples); and short lines (single chromosome) represent known INDEL sites.
tagore --input example_ideogram/test.bed --prefix example_ideogram/example -vf
Usage
usage: tagore [-h] [--version] -i <input.bed> [-p [output file prefix]] [-b [hg78/hg38]] [-f] [-v]
tagore: a utility for illustrating human chromosomes https://github.com/jordanlab/tagore
optional arguments:
-h, --help show this help message and exit
--version Print the software version
-i <input.bed>, --input <input.bed> Input BED-like file
-p [output file prefix], --prefix [output file prefix] Output prefix [Default: "out"]
-b [hg78/hg38], --build [hg78/hg38] Human genome build to use [Default: hg38]
-f, --force Overwrite output files if they exist already
-v, --verbose Display verbose output
The input file is a bed-like format, described below. If an output prefix is not specified, the scripts uses "out" as the default prefix.
Helper scripts for converting RFMix and ADMIXTURE outputs are included in the scripts/
folder.
A more complete example of a full chromosome painting using an RFMix output can be seen by running:
rfmix2tagore --chr1 example_ideogram/1KGP-MXL104_chr1.bed \
--chr2 example_ideogram/1KGP-MXL104_chr2.bed \
--out example_ideogram/1KGP-MXL104_tagore.bed
tagore --input example_ideogram/1KGP-MXL104_tagore.bed \
--prefix example_ideogram/1KGP-MXL104 \
--build hg37 \
--verbose
Input file description
#chr start stop feature size color chrCopy
chr1 10000000 20000000 0 1 #FF0000 1
chr2 20000000 30000000 0 1 #FF0000 2
chr2 40000000 50000000 0 0.5 #FF0000 1
Each column is explained below:
- chr - The chromosome on which a feature has to be drawn
- start - Start position (in bp) for feature
- stop - Stop position (in bp) for feature
- feature - The shape of the feature to be drawn
- 0 will draw a rectangle
- 1 will draw a circle
- 2 will draw a triangle pointing to the genomic location
- 3 will draw a line at that genomic location
- size - The horizontal size of the feature. Should range between 0 and 1.
- color - Specify the color of the genomic feature with a hex value (#FF0000 for red, etc.)
- chrCopy - Specify the chromosome copy on which the feature should be drawn (1 or 2). To draw the same feature on both chromosomes, you must specify the feature twice
Etymology
Tagore (/tæˈgɔːr/) was a prolific songwriter, artist, and influential poet of 19th and 20th century India. Notably, Tagore spoke out against racial prejudice and espoused the princple respect for all people, regardless of ancestry or ethnic bacground.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.