A package for motif discovery and motif enrichment analysis
Project description
MotifScan
Introduction
Scan input genomic regions with known DNA motifs
Given a set of input genomic regions, MotifScan scans the sequences to detect the occurrences of known motifs. It can also applies a statistical test on each motif to check whether the motif is significantly over- or under-represented (enriched or depleted) in the input genomic regions compared to another set of control regions.
Citation
Documentation
To see the full documentation of MotifScan, please refer to: https://motifscan.readthedocs.io
Installation
The latest version release of MotifScan is available at PyPI:
$ pip install motifscan
Or you can install MotifScan via conda:
$ conda install -c bioconda motifscan
Usage
Install genome assemblies
Install from a remote database
You can download genome assemblies from the UCSC database.
First, display all available genome assemblies:
$ motifscan genome --list-remote
Then, install a genome assembly (e.g. hg19):
$ motifscan genome --install -n hg19 -r hg19
Install with local files
To install a genome assembly locally, you have to prepare a FASTA file containing the genome sequences and a genome annotation file (refGene.txt).
$ motifscan genome --install -n hg19 -i <hg19.fa> -a <refGene.txt>
Install and build motif sets
Install from a remote database
Users can install motif PFMs sets in the JASPAR 2020 database.
First, display all available motif PFMs sets in JASPAR 2020:
$ motifscan motif --list-remote
Then, install a JASPAR motif PFMs set (e.g. vertebrates_non-redundant):
$ motifscan motif --install -n <motif_set> -r vertebrates_non-redundant -g hg19
Install with local files
Install a motif set with local PFMs file:
$ motifscan motif --install -n <motif_set> -i <pfms.jaspar> -g hg19
Build PFMs for additional genome
Build the motif PFMs set for another installed genome assembly hg38:
$ motifscan motif --build <motif_set> -g hg38
Scanning Motifs
After the data preparation steps, you can now scan a set of genomic regions to detect the occurrences of known motifs.
$ motifscan scan -i regions.bed -g hg19 -m <motif_set> -o <output_dir>
Note: Using -h/–help for the details of all arguments.
License
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file motifscan-1.3.0.tar.gz
.
File metadata
- Download URL: motifscan-1.3.0.tar.gz
- Upload date:
- Size: 232.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.0 setuptools/51.0.0.post20201207 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.8.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8059ab3232dfe7580835c5e953a8edd3f843676d57a6afbc41c5518a8a4dfde2 |
|
MD5 | 6275fdb3d3e2c040c55edfce8fe3cba0 |
|
BLAKE2b-256 | a9259f149660e79c145c6bcc9a9d4ceb35f8c3eff10dcddce37014663c062d3d |