System for turnkey analysis of semi-automated genome annotations

These details have not been verified by PyPI

Project links

Homepage

Project description

Segzoo

What is segzoo?

Segzoo is a tool that allows to run various genomic analysis on a segmentation obtained by segway. The results of each analysis are made available as well as a summarizing visualization of the results. The requirements for this tool include segtools, bedtools and python packages, but all of them are dependencies that will be treated during installation.

How to install

Segzoo is a python 3 tool, so if you have python 2 installed it is highly recommended for you to install segzoo in a separate python 3 environment. To create such an environment run conda create -n python3_env python==3.6 where you can change the name of the environment, python3_env. Accept all the installation steps.

Next, you need to activate this environment. Run source activate python3_env specifying the name of the environment you chose before. Now that you already are in it, you can install segzoo. You can do that by running pip install segzoo, which will require you to have bedtools already installed, as it's only in anaconda. To install bedtools beforehand you can use conda install -c bioconda bedtools. Another option is to install it using conda install -c bioconda segzoo which will take care of all the dependencies (WIP).

After accepting all installations, segzoo will be good to go!

How to use

To access the help to know how to run segzoo you can run segzoo -h or segzoo --help. Here's a look at the most important parameters:

--parameters to specify a params.params file resulting from segway's training to obtain GMTK parameters in the final visualization.
--prefix to specify where you want all needed data (like genome assembly) to be downloaded. The default is in your current environment share directory.
-o or --outdir to specify the folder where all the results and the final visualization will be created

After running the command segzoo by specifying the segmentation file and all the optional arguments that you want, the execution of the pipeline will begin. All necessary data will be downloaded, tools will run the different analysis and the final visualization will be created. This execution may take some time.

Results

After the execution has finished, the new directory will be created (outdir is the default name). In the data folder you will be able to find the results for all the tools' analysis. In results you will find the tables of processed results used in the visualization. In logs some relevant information about the run. Finally, the visualization will be in the plots directory. It will look something like this:

Plot

Y-axis are the labels of the segmentation for all the plots. As a note: the aggregation results displayed are the percentage of aggregations in one component in comparison to all the gene biotype, so notice that each row adds up to 100.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.0.13

Apr 17, 2024

1.0.12

Feb 26, 2024

1.0.11

Feb 3, 2023

1.0.10 yanked

Feb 2, 2023

Reason this release was yanked:

extra file tmp.py contains non python code and can cause errors when installing.

1.0.9

Sep 19, 2022

1.0.7

Apr 28, 2022

1.0.4

Jun 20, 2018

1.0.3

Jun 15, 2018

1.0.2

May 25, 2018

1.0.1

May 24, 2018

1.0.0

May 23, 2018

This version

1.0.0.dev11 pre-release

May 11, 2018

1.0.0.dev10 pre-release

May 10, 2018

1.0.0.dev8 pre-release

Apr 16, 2018

1.0.0.dev7 pre-release

Apr 12, 2018

1.0.0.dev6 pre-release

Apr 11, 2018

1.0.0.dev5 pre-release

Apr 11, 2018

1.0.0.dev4 pre-release

Apr 11, 2018

1.0.0.dev3 pre-release

Apr 10, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

segzoo-1.0.0.dev11.tar.gz (15.7 kB view hashes)

Uploaded May 11, 2018 Source

Hashes for segzoo-1.0.0.dev11.tar.gz

Hashes for segzoo-1.0.0.dev11.tar.gz
Algorithm	Hash digest
SHA256	`b4b6f360beee69ad87cbc7929f819549692c10c97d318a7511f39a851d19dd10`
MD5	`203371971af4d8bd84e19b97be494aa7`
BLAKE2b-256	`c6129a7ce2d16225c01b98e582160b69670dd1034918c8c87081f76f96a06faf`