A useful module for any CompBio
Project description
genepy
what is genepy?
A set of awesome functions & tools for Computational Geneticists
Content
- utils: where a bunch of helper functions and usefull general scripts are stored
- plots: a set of plotting tools based on matplotlib and bokeh to make volcano plots / CNV maps etc..
- helper: and additional helper functions to save data, do merging of dataframes...
- terra: contains a set of functions that uses dalmatian to interact with the GCP powered genomics HPC platform: Terra.
- sequencing: contains a set of function to works with bed/bam/fastqs...
- rna: contains function to work with RNAseq (and related) data.
- mutations: a set of functions to work with maf files, vcf files etc..
- google: functions and packages linked to google's apis
- google_sheet: function to upload a df as a google sheet
- gcp: sets of functions to interact with google storage (relies on
gsutil
)
- epigenetics: where we have things related to epigenomics
- chipseq: has functions to read, merge, denoise, ChIP seq data.
- plot: has functions to plot ChIP seq data.
Helper tools
tools that you do not need to use directly as they have binding functions in genepy.
- epigenetics/rose:: where an updated version of the rose algorithm is stored (as a git submodule)
- cell_line_mapping-master/python/cell_line_mapper: a set of functions to map cell line ids to other cell line ids based on an up to date google spreadsheet.
Install
with pip
pip install broad-genepy
and then use with from genepy.utils/epigenetics/... import ...
dev mode
git clone git://github.com/BroadInstitute/genepy.git
pip install -e genepy
then you can import files in python with e.g:
from genepy import terra
from genepy.utils import helper as h
from genepy.google import gcp
from genepy.utils import plot
from genepy.epigenetics import chipseq
data:
hg38 genome sizes: from https://github.com/igvteam/igv/blob/master/genomes/sizes/hg38.chrom.sizes
installation: to get access to all bindings
Install the following tools:
- htslib/samtools
- bwa just used once:
- bowtie2
Finaly you can install R packages (GSEABase, erccdashboard, GSVA, DESeq2):
R -e 'if(!requireNamespace("BiocManager", quietly = TRUE)){install.packages("BiocManager")};BiocManager::install(c("GSEABase", "erccdashboard", "GSVA", "DESeq2"));'
About
please do contribute, we do not have time to fix all issues or work on feature requests
Jeremie Kalfon jkalfon@broadinstitute.org jkobject@gmail.com https://jkobject.com
Javad Noorbakhsh jnoorbak@broadinstitute.org
Apache license 2.0.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file Broad-genepy-1.2.6.tar.gz
.
File metadata
- Download URL: Broad-genepy-1.2.6.tar.gz
- Upload date:
- Size: 932.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.10.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | eb9b2db586f72e623eb29fc4b9d09f0e714c470290dc96eee8ec4a10c00094e8 |
|
MD5 | 3d831018594fa3a05c12a09af4da285e |
|
BLAKE2b-256 | 60d7e407bf4a6170569204dad6995211a31262d47e618ec02fa8a285e5a40de9 |
File details
Details for the file Broad_genepy-1.2.6-py3-none-any.whl
.
File metadata
- Download URL: Broad_genepy-1.2.6-py3-none-any.whl
- Upload date:
- Size: 90.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.10.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4ec81052745baa400f5e0893a6f7833567322d2c7c2efd19e9a5193d8d03357f |
|
MD5 | ebdb8b8eb7964423865aff945bea8d93 |
|
BLAKE2b-256 | dac1173e562cad5c7f8fd444276fc893d0989f09053600f03d617abbf4123101 |