Python package that searches for microsatellites in fasta sequences
Project description
This program searches for microsatellites in fasta sequences. The available options are the following:
-i, --input-fasta: path of the fasta file containing the sequence(s) to analyze. The file can be either a plain text file or a gzip compressed file.
-o, --output-file: path of the file to save with the results of the analysis. The coordinates of the microsatellites will be 0-based.
-l, --seed-length: length of the seed of the microsatellites, i.e. the number of nucleotides that will be repeated. For instance -l 2 will find microsatellites like ACACACAC where the seed AC is repeated 4 times.
-r, --minimum_repetitions: minimum number of repetitions of the seed. For instance -r 3 will find microsatellites like CTACTACTA or CTACTACTACTA where the seed CTA is repeated at least 3 times. The minimum allowed value is 2 and the default value is 3.
-im, --imperfect: include imperfect microsatellites. With this option microsatellites repeated at least -r -1 times that share the same seed and have a distance up to the “–imperfect” value will be merged together and will be considered as a single microsatellite. By default this option is disabled and microsatellites are kept separated.
-a, --alphabet: alphabet to use for the microsatellites search. The alphabet can be either dna for DNA or aa for PROTEINS. Default is “dna”.
-c, --cores: number of CPUs to use in the computation. By default it will use all the available CPUs.
-p, --progress: track the progress of the computation with a progress bar.
Requirements:
Python 3.4
Installation:
pip install macrosatellites_finder
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Hashes for microsatellites_finder-1.0.0.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | ed74270116e27777787ca349a581b8804312e5e644d7c4fa7f4d7458d1f9cabf |
|
MD5 | 6c4cad6f826571369c05df375b622cc2 |
|
BLAKE2b-256 | 23326e9c76dc8cec49ff4029cd3210c68d5f3574f6d60255af66ea6b3b8c35e3 |