Analysis of allele-specific methylation in bisulfite DNA sequencing.
Project description
pyllelic
⭐ the project to show your appreciation. :arrow_upper_right:
pyllelic: a tool for detection of allelic-specific methylation variation in bisulfite DNA sequencing files.
Pyllelic documention is available at https://paradoxdruid.github.io/pyllelic/ and see pyllelic_notebook.ipynb
for an interactive demonstration.
Dependencies and Installation
Using Conda (preferred)
Create a new conda environment using python 3.8:
Easiest:
# Get environment.yml file from this repo
curl -L https://github.com/Paradoxdruid/pyllelic/blob/master/environment.yml?raw=true > env.yml
# Create and activate conda environment
conda env create --file=env.yml
conda activate pyllelic
or more explictly step by step instructions
conda create --name pyllelic python=3.8
conda activate pyllelic
conda config --env --add channels conda-forge
conda config --env --add channels bioconda
conda config --env --add channels paradoxdruid
conda install pyllelic
# Optional but usual use case:
conda install notebook jupyter_contrib_nbextensions ipywidgets
Docker container
docker pull ghcr.io/paradoxdruid/pyllelic:latest
PyPi installation
PyPi instructions
This will require independent installation of samtools, bowtie2, and bismark packages.
# PyPi
python3 -m pip install pyllelic
# or Github
python3 -m pip install git+https://github.com/Paradoxdruid/pyllelic.git
Example exploratory use in jupyter notebook
from pyllelic import pyllelic
config = pyllelic.configure( # Specify file and directory locations
base_path="/Users/abonham/documents/test_allelic/",
prom_file="TERT-promoter-genomic-sequence.txt",
prom_start="1293200",
prom_end="1296000",
chrom="5",
offset=1298163,
)
files_set = pyllelic.make_list_of_bam_files(config) # finds bam files
# Run pyllelic; make take some time depending on number of bam files
data = pyllelic.GenomicPositionData(config=config, files_set=files_set)
positions = data.positions
cell_types = data.cell_types
means_df = data.means # mean methylation of reads
modes_df = data.modes # mode methylation of reads
diff_df = data.diffs # difference mean - mode of reads
individual_data = data.individual_data # read methylation values
data.save("output.xlsx") # save methylation results
data.save_pickle("my_run.pickle") # save data object for later analysis
data.write_means_modes_diffs(filename="Run1_") # write output data files
data.histogram("CELL_LINE", "POSITION") # visualize data for a point
data.heatmap(min_values=1) # methylation level heatmap
data.quma_results["CELL_LINE"] # see summary data for a cell line
Authors
This software is developed as academic software by Dr. Andrew J. Bonham at the Metropolitan State University of Denver. It is licensed under the GPL v3.0.
This software incorporates implementation from QUMA, licensed under the GPL v3.0.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file pyllelic-0.2.8.tar.gz
.
File metadata
- Download URL: pyllelic-0.2.8.tar.gz
- Upload date:
- Size: 42.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/4.8.2 pkginfo/1.7.0 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.7.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fe4ae81e0ee5013beadb16e1a978dca6736711708fbf3c69d2cd796fd7c4c84f |
|
MD5 | 712c5368b8a8c69ba79bdcd675d126d3 |
|
BLAKE2b-256 | e871f96d2db1e222a1b5f8ee32e283b1eda3830ab78e77a441f7150a3886f7f3 |
File details
Details for the file pyllelic-0.2.8-py3-none-any.whl
.
File metadata
- Download URL: pyllelic-0.2.8-py3-none-any.whl
- Upload date:
- Size: 31.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/4.8.2 pkginfo/1.7.0 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.7.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1102178636bb34f27b878d06fd79901bf3d2734c2333197848889134a3881ad9 |
|
MD5 | a32f14fd19160cbec7d1f4086a80706c |
|
BLAKE2b-256 | 2bc18e32915180116792db87f58bcc85ec75e5c525a95608b5c1d5ead43babdf |