A genome browser in your Jupyter notebook
Project description
Getting Started
Documentation
https://dbikard.github.io/genomenotebook/
Install
pip install genomenotebook
How to use
Create a simple genome browser with a search bar. The sequence appears when zooming in.
#Using the example E. coli genome data from the package
import genomenotebook as gn
import os
from bokeh.io import output_notebook
output_notebook()
data_path = gn.get_example_data_dir()
genome_path = os.path.join(data_path, "MG1655_U00096.fasta")
gff_path = os.path.join(data_path, "MG1655_U00096.gff3")
g=gn.GenomeBrowser(genome_path=genome_path, gff_path=gff_path, init_pos=10000)
g.show()
<style>
.bk-notebook-logo {
display: block;
width: 20px;
height: 20px;
background-image: url();
}
</style>
<div>
<a href="https://bokeh.org" target="_blank" class="bk-notebook-logo"></a>
<span id="p175635">Loading BokehJS ...</span>
</div>
Unable to display output for mime type(s): application/javascript, application/vnd.bokehjs_load.v0+json
Unable to display output for mime type(s): application/javascript, application/vnd.bokehjs_exec.v0+json
Adding a track with random points as a demonstration. genomeNotebook
uses the Bokeh library and track.fig
is a simple Bokeh figure on which
you can plot anything you want using Bokeh.
import numpy as np
track = g.add_track()
x= np.arange(0,100000,100)
y= np.random.randint(0,10,size=x.shape)
track.fig.scatter(x=x,y=y)
g.show()
Unable to display output for mime type(s): application/javascript, application/vnd.bokehjs_exec.v0+json
Plotting some ChIP-seq data
Track objects also have a few custom plotting functions: line, scatter and bar.
A pandas DataFrame is passed as the source of the data as follow.
import pyBigWig
import pandas as pd
g=gn.GenomeBrowser(genome_path=genome_path,
gff_path=gff_path,
bounds=(0,100000),
search=False,
show_seq=False)
#Importing some coverage data from a BigWig file
bw_file_path=os.path.join(data_path,"ChIP-ACCCA-1.bw")
refname='NC_000913'
with pyBigWig.open(bw_file_path) as bw:
cov=bw.values(refname,0,g.genome_size,numpy=True)
source=pd.DataFrame({"pos": np.arange(0,g.genome_size,10),
"cov": cov[::10]})
track=g.add_track()
track.line(source,pos="pos",y="cov")
g.show()
Unable to display output for mime type(s): application/javascript, application/vnd.bokehjs_exec.v0+json
Plotting some CRISPR screening data
#Opening the Cui 2018 CRISPRi screen data
cui2018data="https://gitlab.pasteur.fr/dbikard/badSeed_public/raw/master/screen_data.csv"
cui2018data=pd.read_csv(cui2018data,index_col=0)
cui2018data.head()
<style scoped>
.dataframe tbody tr th:only-of-type {
vertical-align: middle;
}
.dataframe tbody tr th {
vertical-align: top;
}
.dataframe thead th {
text-align: right;
}
</style>
gene | essential | pos | ori | coding | fit18 | fit75 | ntargets | seq | |
---|---|---|---|---|---|---|---|---|---|
guide | |||||||||
AAAAAACCTGCTGGTGAGGC | NaN | NaN | 2202483 | - | NaN | -4.850012 | -1.437546 | 1 | AAAGCAGATCACAGTAAATAAAAAAACCTGCTGGTGAGGCAGGTTC... |
AAAAAACGTATTCGCTTGCA | curA | False | 1517891 | + | False | -0.094026 | -0.100313 | 1 | TGTTGATGGCTACAGTGCTGAAAAAACGTATTCGCTTGCAAGGTTT... |
AAAAAAGCGCACTTTTTGAC | NaN | NaN | 1919717 | + | NaN | -1.109310 | -0.246740 | 1 | GTAACGCCTGACAGCGCACAAAAAAAGCGCACTTTTTGACTGGCAC... |
AAAAAAGCGGTGACTTACGA | bglA | False | 3042929 | + | False | -1.328831 | -0.905068 | 1 | GCGCCCATATCGAAGAGATGAAAAAAGCGGTGACTTACGATGGCGT... |
AAAAAATCTGCCCGTGTCGT | gyrA | True | 2337231 | - | False | -0.840373 | -0.598858 | 1 | ATGACTGGAACAAAGCCTATAAAAAATCTGCCCGTGTCGTTGGTGA... |
g=gn.GenomeBrowser(genome_path=genome_path,
gff_path=gff_path,
bounds=(0,1000000),
search=False,
show_seq=False)
track=g.add_track(height=100)
track.scatter(source=cui2018data,pos="pos",y="fit75",factors="ori")
track2=g.add_track(height=100)
track2.scatter(source=cui2018data,pos="pos",y="fit18",factors="ori")
g.show()
Unable to display output for mime type(s): application/javascript, application/vnd.bokehjs_exec.v0+json
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
genomenotebook-0.4.1.tar.gz
(18.9 MB
view details)
Built Distribution
File details
Details for the file genomenotebook-0.4.1.tar.gz
.
File metadata
- Download URL: genomenotebook-0.4.1.tar.gz
- Upload date:
- Size: 18.9 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8f120597b814818e0595c95692657652e62efec93303213f471f08c106033246 |
|
MD5 | 55019fe4f4c11e5e4fe22975680c3440 |
|
BLAKE2b-256 | 22b2e310c158a94279545ffd9b0f43e42b27ef1af60312aae17a2afaccdaf756 |
File details
Details for the file genomenotebook-0.4.1-py3-none-any.whl
.
File metadata
- Download URL: genomenotebook-0.4.1-py3-none-any.whl
- Upload date:
- Size: 19.1 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4cacf4cf5ee981723254c963f7c498783b4f0bb1fbcd59d1c1203b4db9a43060 |
|
MD5 | 56372889f7ec444c71c1a35258435a95 |
|
BLAKE2b-256 | fc8d465c0425a51b8a9460013f23a82c8590b66d0639fbf29335b636e32726b0 |