A useful module for any CompBio
Project description
genepy
what is genepy?
A set of awesome functions & tools for Computational Geneticists
Content
- utils: where a bunch of helper functions and usefull general scripts are stored
- plots: a set of plotting tools based on matplotlib and bokeh to make volcano plots / CNV maps etc..
- helper: and additional helper functions to save data, do merging of dataframes...
- terra: contains a set of functions that uses dalmatian to interact with the GCP powered genomics HPC platform: Terra.
- sequencing: contains a set of function to works with bed/bam/fastqs...
- rna: contains function to work with RNAseq (and related) data.
- mutations: a set of functions to work with maf files, vcf files etc..
- google: functions and packages linked to google's apis
- google_sheet: function to upload a df as a google sheet
- gcp: sets of functions to interact with google storage (relies on
gsutil
)
- epigenetics: where we have things related to epigenomics
- chipseq: has functions to read, merge, denoise, ChIP seq data.
- plot: has functions to plot ChIP seq data.
Helper tools
tools that you do not need to use directly as they have binding functions in genepy.
- epigenetics/rose:: where an updated version of the rose algorithm is stored (as a git submodule)
- cell_line_mapping-master/python/cell_line_mapper: a set of functions to map cell line ids to other cell line ids based on an up to date google spreadsheet.
Install
with pip
pip install broad-genepy
and then use with from genepy.utils/epigenetics/... import ...
dev mode
git clone git://github.com/BroadInstitute/genepy.git
pip install -e genepy
then you can import files in python with e.g:
from genepy import terra
from genepy.utils import helper as h
from genepy.google import gcp
from genepy.utils import plot
from genepy.epigenetics import chipseq
data:
hg38 genome sizes: from https://github.com/igvteam/igv/blob/master/genomes/sizes/hg38.chrom.sizes
installation: to get access to all bindings
Install the following tools:
- htslib/samtools
- bwa just used once:
- bowtie2
Finaly you can install R packages (GSEABase, erccdashboard, GSVA, DESeq2):
R -e 'if(!requireNamespace("BiocManager", quietly = TRUE)){install.packages("BiocManager")};BiocManager::install(c("GSEABase", "erccdashboard", "GSVA", "DESeq2"));'
About
please do contribute, we do not have time to fix all issues or work on feature requests
Jeremie Kalfon jkalfon@broadinstitute.org jkobject@gmail.com https://jkobject.com
Javad Noorbakhsh jnoorbak@broadinstitute.org
Apache license 2.0.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
Hashes for Broad_genepy-1.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | cbf18862df287eb571e419606fdff1ee28c31f38703158adca1129592e70266b |
|
MD5 | 661242c1114c4577db03ca54074688a3 |
|
BLAKE2b-256 | 75877f999a5cf9d0f957a0fe5cf6acb0a3bba37be7c5080f27054b580c594a3d |
Hashes for Broad_genepy-1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | cc93599934523d6c2da1a4e7d4b5e9f511b06b6946413cdc3ad60005e6bb6155 |
|
MD5 | d84412481611f783dfbf905dae4967ad |
|
BLAKE2b-256 | d4110e4d08fd9539b4a3bcd14099642971a6b09ec157c36295bfb838fca5eceb |