Gibbs Sampler for motif discovery
Project description
Gibbs_Sampler
This program runs the Gibbs Sampler algorithm for de novo motif discovery. Given a set of sequences, the program will calculate the most likely motif instance as well as the position weight matrix and position specific scoring matrix (the log2 normalized frequency scores).
The input sequence file should be provided in fasta format. Output is written to the console. A eps file containing the weblogo of the motif is also created. #Getting Started ##Prerequisites Python 3.8 or later is required. ##Installation The program can be installed with pip by running the following command:
$ pip install gibbs_sampler
Usage
To run, simply call gibbs_sampler from the command line along with the path to the sequence file and expected width of the motif.
$ gibbs_sampler <primary sequence file> <width>
Optional arguments for the title of the weblogo file and number of iterations of the sampler can be specified using the -t and -n flags respectively.
License
gibbs_sampler
was created by Monty Python. It is licensed under the terms of the CC0 v1.0 Universal license.
Credits
Thank you to Professor Hendrix for teaching me how to use python to investigate biologically relevant questions.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for gibbs_sampler-0.2.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2bb00132efbcf6aa44a12d3afafca948902abb685092d9e7bc79c01fb967545b |
|
MD5 | b9d4eac4946ae2aa68f0219bb73b10e9 |
|
BLAKE2b-256 | f6ad48743fcf250c2792310b1b8565e35c9afa8e544f10cf8562165fee5b5b51 |