Skip to main content

Core Sequence Identifier

Project description

CORSID

CORSID is a computational tool to simultaneously identifying TRS sites and gene locations in unannotated coronavirus genomes. We also provide another tool CORSID-A that only identifies TRS sites given annotated genes. Given an genome (optionally with their annotation), CORSID(-A) will find the TRS alignment and the core sequence.

The data and results can be found in the repo CORSID-data. The visualized results of our tool applied to 468 coronavirus genomes can be found in CORSID-viz.

Figure

Contents

  1. Pre-requisites
  2. Installation
  3. Usage instructions

Pre-requisites

Installation

Using conda (recommended)

  1. Create a new conda environment named "corsid" and install dependencies:

    conda create -n corsid
    
  2. Then activate the created environment: conda activate corsid.

  3. Install the package into current environment "corsid":

    conda install -c bioconda corsid
    

Using pip (alternative)

We recommend installing in a virtual environment, as decribed in step 1 and 2 in the previous section. Use pip to install the package:

pip install corsid

Usage instructions

I/O formats

CORSID takes a FASTA file containing the complete genome as input. Optionally it also takes an annotation file (GFF format) to validate the identified genes.

CORSID-A takes a FASTA file and an annotation file (GFF format) as input. It will find candidate regions for each gene given the annotation file, and run CORSID-A on candidate regions.

The output is an JSON file containing sorted solutions and auxilary information. This file can be used as the input to the visualization webapp (link). The program also outputs to the standard output, where it shows tables of solutions and visualization of TRS alignment.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

corsid-0.1.0.tar.gz (14.7 kB view hashes)

Uploaded Source

Built Distribution

corsid-0.1.0-py3-none-any.whl (17.0 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page