Code to create a PRG from a Multiple Sequence Alignment file
Project description
make_prg
A tool to create and update PRGs for input to Pandora and Gramtools from a set of Multiple Sequence Alignments.
Support
We fully support make_prg
on linux
with python
versions 3.8
-3.11
. For other operating systems, make_prg
can be run through containers.
Table of Contents
Install
No installation needed - precompiled portable binary
You can use make_prg
with no installation at all by simply downloading the precompiled binary, and running it.
In this binary, all libraries are linked statically. Compilation is done using PyInstaller.
Requirements
GLIBC >= 2.29
(present on Ubuntu >= 19.04
, Debian >= 11
, CentOS >= 9
, etc);
Download
wget https://github.com/iqbal-lab-org/make_prg/releases/download/0.5.0/make_prg_0.5.0
Run
chmod +x make_prg_0.5.0
./make_prg_0.5.0 -h
pip
Requirements: python>=3.8,<=3.11
pip install make_prg
conda
conda install -c bioconda make_prg
Container
Docker images are hosted at quay.io.
singularity
Prerequisite: singularity
URI="docker://quay.io/iqballab/make_prg"
singularity exec "$URI" make_prg --help
The above will use the latest version. If you want to specify a version then use a tag (or commit) like so.
VERSION="0.5.0"
URI="docker://quay.io/iqballab/make_prg:${VERSION}"
docker
Prerequisite: docker
docker pull quay.io/iqballab/make_prg
docker run quay.io/iqballab/make_prg --help
You can find all the available tags on the quay.io repository.
Running on a sample example
To see how to input files to both make_prg from_msa
and make_prg update
, and the outputs
they create on a sample example, see sample example.
Usage
$ make_prg --help
usage: make_prg <subcommand> <options>
Subcommand entrypoint
options:
-h, --help show this help message and exit
-V, --version show program's version number and exit
Available subcommands:
from_msa Make PRG from multiple sequence alignment
update Update PRGs given new sequences.
from_msa
$ make_prg from_msa --help
usage: make_prg from_msa
options:
-h, --help show this help message and exit
-i INPUT, --input INPUT
Multiple sequence alignment file or a directory containing such files
-s SUFFIX, --suffix SUFFIX
If the input parameter (-i, --input) is a directory, then filter for files with this suffix. If this parameter is not given, all files in the input directory is considered.
-o OUTPUT_PREFIX, --output-prefix OUTPUT_PREFIX
Prefix for the output files
-f ALIGNMENT_FORMAT, --alignment-format ALIGNMENT_FORMAT
Alignment format of MSA, must be a biopython AlignIO input alignment_format. See http://biopython.org/wiki/AlignIO. Default: fasta
-N MAX_NESTING, --max-nesting MAX_NESTING
Maximum number of levels to use for nesting. Default: 5
-L MIN_MATCH_LENGTH, --min-match-length MIN_MATCH_LENGTH
Minimum number of consecutive characters which must be identical for a match. Default: 7
-O OUTPUT_TYPE, --output-type OUTPUT_TYPE
p: PRG, b: Binary, g: GFA, a: All. Combinations are allowed i.e., gb: GFA and Binary. Default: a
-F, --force Force overwrite previous output
-t THREADS, --threads THREADS
Number of threads. 0 will use all available. Default: 1
-v, --verbose Increase output verbosity (-v for debug, -vv for trace - trace is for developers only)
--log LOG Path to write log to. Default is stderr
update
$ make_prg update --help
usage: make_prg update
options:
-h, --help show this help message and exit
-u UPDATE_DS, --update-DS UPDATE_DS
Filepath to the update data structures (a *.update_DS.zip file created from make_prg from_msa or update)
-o OUTPUT_PREFIX, --output-prefix OUTPUT_PREFIX
Prefix for the output files
-d DENOVO_PATHS, --denovo-paths DENOVO_PATHS
Filepath containing denovo sequences. Should point to a denovo_paths.txt file
-D LONG_DELETION_THRESHOLD, --deletion-threshold LONG_DELETION_THRESHOLD
Ignores long deletions of the given size or longer. If long deletions should not be ignored, put a large value. Default: 10
-O OUTPUT_TYPE, --output-type OUTPUT_TYPE
p: PRG, b: Binary, g: GFA, a: All. Combinations are allowed i.e., gb: GFA and Binary. Default: a
-F, --force Force overwrite previous output
-t THREADS, --threads THREADS
Number of threads. 0 will use all available. Default: 1
-v, --verbose Increase output verbosity (-v for debug, -vv for trace - trace is for developers only)
--log LOG Path to write log to. Default is stderr
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file make_prg-0.5.0.tar.gz
.
File metadata
- Download URL: make_prg-0.5.0.tar.gz
- Upload date:
- Size: 15.8 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.17
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2cbc2cfdd426714b17126f04cf4031198839ab3c73c41ccfdcfca9ac0d950866 |
|
MD5 | bd370c981cb910cee42381af3298d710 |
|
BLAKE2b-256 | 4f2edcaea80f51948d5b132200a2866b6feb44a687cee0e0967feb80ebda36c7 |
File details
Details for the file make_prg-0.5.0-py3-none-any.whl
.
File metadata
- Download URL: make_prg-0.5.0-py3-none-any.whl
- Upload date:
- Size: 15.9 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.17
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 925252978f662c7e6918065ea4d4f2beb8956bb305505e5f6ba77a5e239c1400 |
|
MD5 | a680a5147097c6b4f805eb08a3039f1b |
|
BLAKE2b-256 | e6f4d2fdc19f4069bff7e2ad3b6258cdd3293e34e1f1b4e56dc1b8a3f98a0304 |